Reply to this topicStart new topic
> Andrei Broder Joins Yahoo!

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 18 2005, 09:58 AM
The official Yahoo! press release is here:

http://home.businesswire.com/portal/site/g...147&newsLang=en


Andrei Broder, who was the chief search scientist for Altavista from 1999 to 2002, and then one of the technology heads of IBM since, has joined Yahoo! as vice president of emerging search technology.

There are a lot of search patents and white papers with Andrei Broder's name upon them. He has written papers and patents in collaboration with some search engineers presently working with Google and Yahoo!, including Monika Henzinger and Krishna Bharat of Google.
Offline Go to the top of the page

Solid Contributor

Group: Members
Joined: 16-November 05
Posts: 66
From: Chennai
post Nov 19 2005, 02:31 PM
thats something gr8 for yahoo to cheer about smile.gif
Offline Go to the top of the page

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 19 2005, 04:43 PM
It is indeed.

I think having Andrei Broder at Yahoo! will not only add to their knowledge base of things related to search, but also help attract other folks to Yahoo!.

Chris Sherman and Garry Price had a post at the Search Engine Watch Blog which lists a number of the papers that Andrei Broder has worked upon. See:

http://blog.searchenginewatch.com/blog/051118-122544

Here are a few of the patents that he worked with others upon as an inventor at Altavista:

Connectivity server for locating linkage information between Web pages

This patent describes a way of indexing the links between pages. Here's a snippet from the document:

QUOTE
The invention provides linkage information for a significant portion of the Web. The information can be used by programs that rank Web pages according to their connectivity, for instance, pages with many connections could be considered authoritative pages, or \"hubs.\" The information can be used to build Web visualization and navigation tools. The information can be used in conjunction with search engine results to lead users to portions of the Web that store content which may be of interest. In addition, the invention can be used to optimize the design and implementation of web crawlers based on statistics derived from the in and out degrees of nodes.



Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses

The reason for the process described in this patent:

QUOTE
Often search engines index only one copy of a mirrored page. In the process, they may fetch replicas and discard them. If mirroring information were available, a search engine could avoid fetching replicas from known mirrored hosts. The search engine could also distribute fetches of the remaining pages between the mirrors for load balancing, or choose the best mirror in terms of response time



Method for determining the resemining the resemblance of documents

This one looks at parts of a pair of documents to see how similar they are, and could even be used to build clusters of similar documents.


Method for clustering closely resembling data objects

This one builds upon the previous patent mentioned, and shows a way for a search engine to only index one copy or a document that is very similar to being considered for indexing. The concept of shingles, or fingerprints, is repeated in both, and explained more fully, and in more practical terms when it comes to indexing pages, in this one.

There are a number of others in the US Patent and Trademark Office database listing him as inventor or co-inventor. Those are worth taking a look at if you are interested in exploring his work some more.
Offline Go to the top of the page
Reply to this topic Start new topic
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:
Jump to Forum:
 
Lo-Fi Version Time is now: 9th February 2010 - 06:33 AM
Meet our Moderators: cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB
Cre8asite RSS Feed