![]() ![]() |
Hall of Fame![]() Group: Members
Joined: 27-July 04
Posts: 937
From: Seattle, WA
|
Apr 7 2005, 01:52 PM |
|
|
Bwelford's post in the reciprocal links thread got me thinking about the functionality of how links pass 'relevance' and power according to the major search engines.
As we all know, PageRank's initial formula discriminated against pages with more links on them and lowered the link power of each link based on how many internal/external links a page had. As time has passed and PageRank is now just "pixie dust", is it still the general consensus that more links on a page means less power per link? I haven't been able to think of a good universal reason why this should be so. In my opinion, a link on a page with 3, 6 or 10 other links, should be equally valuable - the number of external links shouldn't dampen the weight of each link, until you get into higher numbers - 20, 50, 100 external links (at which point you're probably dealing with a list of references, rather than a content subject based link). I'd love to hear the community's thoughts. |
||
| Offline | ![]() |
Moderator![]() Group: Moderators
Joined: 6-March 03
Posts: 7,962
From: Langley, British Columbia, Canada
|
Apr 7 2005, 05:13 PM |
|
|
I believe my view is in line with that set out by Grumpus and by invision.
I'm not sure that all the major search engines will handle this question in the same way. As has been pointed out, we're now talking about something different than the mere passage of PageRank through to referenced (linked) web pages. It's the amount of extra authority that the linked page gets as a result of the link. It's been noted elsewhere that Google looks not only at the link text but also at the text near to the link. I'll illustrate my view on Google by discussing a simple example. Suppose you had three directories listing different types of manufacturing companies. Thinking about the web page for Engineering manufacturing companies, we'll look at typical entries in each directory. All three directories have say 100 links on this particular page. Directory A gives only the company name and the URL. A typical entry looks like this QUOTE WilsonMachine URL http://www.wilsonmachine.com Directory B gives a few words describing the type of manufacturing. QUOTE WilsonMachine, an industrial gear manufacturer URL http://www.wilsonmachine.com Directory C gives a short paragraph describing the company. QUOTE WilsonMachine is one of the leading precision gear and rotating machinery manufacturers in North America. It has been successfully meeting the needs of its mechanical power transmission customers for over 90 years. It can handle projects that few other companies in the world can handle with the same accuracy and quality. It is ISO 9001 certified and a long-standing member of the American Gear Manufacturers Association. URL http://www.wilsonmachine.com My view is that C gives much more relevance or authority to the link than B, which in turn gives much more authority than A. I also believe that this amount of authority will largely be the same for both B and C if the page contains only 20 links or, as suggested above, 100 links. On the other hand, the authority in the case of A would depend to some extent on the number of links on the page. |
||
| Offline | ![]() |
Site Administrator![]() ![]() Group: Site Admin
Joined: 4-September 02
Posts: 6,887
From: Melbourne, Australia
|
Apr 7 2005, 05:24 PM |
|
|
Excellent analogy Stock - thanks for sharing.
|
||
| Offline | ![]() |
Star Member![]() ![]() Group: 1000 Post Club
Joined: 9-January 05
Posts: 1,532
From: Perth, Western Australia
|
Apr 8 2005, 09:51 AM |
|
|
Are you saying that theory doesn't hold water ?
|
||
| Offline | ![]() |
MemberGroup: Members
Joined: 16-November 04
Posts: 21
|
Apr 8 2005, 10:22 AM |
|
|
Water pipe analogy is a good example, but why would you limit yourself with 5 billion websites.. ? As we know page rank refers to a page and not to a site and every new page that google knows about already has its own PR which is (1-d) according to the original formula and the amount of web pages is growing every day.
Also, when you think about road maps - they lie in the same plane and links are not - roads usually have two-way direction and links are mostly one way vectors (how often you see links back from the exactly the same page that being linked to). So, in this way water pipes are closer to me. If they were in the same plane it doesn't matter how wide the pipes are - the water level will be even everywhere. In our case the leak has obvious direction, so there are different levels between outgoing and incoming points - in water pipes they have to be in order to pass the water, but in vectors, levels don't affect direction. This sounds like pure math and originally it is (I am not good with math though), and I agree with Grumpus that a value of a page attached to a link, since vector is a "movement of a ..." from one point to another. You should look at PR as vector value - this is how it referred in the original works and studies. Of course not all the rules of vector apply to links and PR, but I think it worth looking into it just not to get on the wrong track from the start. Also, here is a good review of topic sensitive PR calculation work that has been written before and may be some of the aspects will be implemented by google http://pr.efactory.de/e-pagerank-themes.shtml I also found this work http://dbpubs.stanford.edu:8090/pub/1999-31 and this one which is a separate study http://informatics.indiana.edu/fil/Class/b...first-draft.pdf The stanford's work is pretty interesting from the standpoint of using resources to calculate page rank and rank the page; I'm not sure how it limits their abilities in changing algo and increasing calculations and processing time etc since they use low scale machines, but really interesting.. ..kinda got lost in my own talk.. didn't really want to make a point, and I didn't answer any questions, but may be someone will come up with a good idea from all that stuff I just wrote if any of this makes sense. |
||
| Offline | ![]() |
MemberGroup: Members
Joined: 16-November 04
Posts: 21
|
Apr 8 2005, 10:29 AM |
|
|
btw, just found all the papers by Taher Haveliwala http://www.stanford.edu/~taherh/
and here is also interesting link http://labs.google.com/papers.html Krishna Bharat - Publications (Principal Scientist at Google Inc) http://searchwell.com/krishna/publications.html |
||
| Offline | ![]() |
![]()
|
|
2 Pages 1 2 >
|
|
| Lo-Fi Version | Time is now: 9th February 2010 - 03:15 PM |
| Meet our Moderators: | cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB |