March 30, 2006

Google Sandbox and TrustRank Algorithm

Google's PageRank was the key to Google's success because it gave a plausible answer to a simple question: "How valuable a site is?". Let's see who links to it and how important these sites are. But as time went by, Google realized that many links from less important sites could outweigh few links from major sites. And that's especially true for blogs. So Google wanted to fight spam from search results.

Jon Galloway explains how Google changed their ranking system:

Jagger's [Google index update from October 2005] main change is the switch from the elegant but overly trusting PageRank system to the more realistically cynical TrustRank system, which is designed to only count votes from sites it trusts.

TrustRank imitates human behavior - if a stranger on a train recommends a movie, I'm going to value it a lot less than a recommendation from a close friend or movie critic, both of whom have earned my trust by either how long I've known them or by their reputation. Trust comes from two sources - site age and links from trusted sources. From my movie recommendation analogy above, site age is the close friend who has gained trust through the age of the relationship, whereas trusted sources are sites who has been granted a position of authority by links form a small seed group of trused sites.

Another way to look at this is from the point of view of a content publisher with a new site. At first, your links will be untrusted and will not contribute to the Page Rank of the page they link to. The site has to undergo an aging delay to before it is considered authoritative, which has led to discussion of the "Sandbox" (or the "Trustbox"). The idea is that new sites are sandboxed so they can't mess up the rankings until they've proven themselves, at which time they can participate in Page Rank voting.

There are two ways to gain trust and escape the Trustbox:
* Acquire links from highly trusted sources (the "movie critic recommendation")
* Acquire links from somewhat trusted sources and let them age (the "friend recommendation")

Google Sandbox is a filter whose criteria is the age of a site. After let's say 4-6 months or when the site acquires highly trusted links, a site is given credit for what it has achieved, for the backlinks it has established: its PageRank increases and it's more visible in the search results.

Related:
Expertrank: authoritative search
The future of search

3 comments:

  1. Thanks for the link. I don't work for Microsoft; the weblogs.asp.net weblogs system is mostly non-Microsoft developers who use Microsoft technologies.

    ReplyDelete
  2. Sorry for that, I've always associated you with Microsoft.

    ReplyDelete
  3. Well that does sound great but it kind of eliminates the opportunity for new retail companies to get their sites going. I have a online store that has been up since nov and I just can't get my page rankup enough to be able to profit from my site. If there is no visitors there is no buyers. They need to find a better way. Just because a site is not very old dosn't mean that they don't offer quality products that deserve a better rank.

    ReplyDelete

Note: Only a member of this blog may post a comment.