![]() ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
|
Aug 21 2006, 10:42 AM |
|
|
Hi all,
One of the curses of having an English degree in literature (who would think that would come in useful) is that you pay too much attention to language patterns, and the way people speak. I noticed for instance, that it was pretty common, at the recent SES in San Jose, for search engineers, giving presentations, or in conversations tend to start many sentences with the word "So," as in: "So, if you think about..." "So, if we were to...." "So, you have this..." I guess that's part of the cost of presenting hypothetical situations and theories to others. Another word that kept on showing up in the presentations and conversations from these search engine representatives was the word "signals." Mike Grehan and I were both in the audience during a presentation titled "Link Building Q&A" and Mike wrote a column for Clickz which emphasizes the word: Sending Signals to Search Engines I liked this quote from Mike's article a lot: QUOTE I feel certain, though, that search engines can glean a lot more temporal analysis related to linkage data when it's combined with the other signals they talk about, such as burstiness, end-user behavior data, social search, and personalization. I do think that search engines still place a lot of reliance on links, but am convinced that understanding user queries - like seen in our thread on the AOL data, is playing a larger and more important role in determining relevance. There have been a number of mentions in recent white papers and patents which mention search engines collecting information from ISPs about user behavior. I wrote about a patent that was granted last week for Overture/Yahoo which describes some ways of collecting and using that information in: How a Search Engine Might Use Information from an ISP While Capturing Traffic Flows I think one of the most important aspects of that is the idea that it is a "traffic flow" that a search engine is looking at, and they can capture information over time, which shows how some sites are more popular than others and are attracting traffic in response to searches and other methods of getting people to a site. What role might that have in ranking or reranking a page? We've talked about Google's patent involving Information Retrieval based upon Historic Data in a number of threads here. Here's a paragraph from that which discusses some signals based upon this type of information: QUOTE [0107] In addition to history of positions (or rankings) of documents for a given query, search engine 125 may monitor (on a page, host, document, and/or domain basis) one or more other factors, such as the number of queries for which, and the rate at which (increasing/decreasing), a document is selected as a search result over time; seasonality, burstiness, and other patterns over time that a document is selected as a search result; and/or changes in scores over time for a URL-query pair. This is a paraphrase of an answer that I received when asking a question during that Link Building Q&A session: QUOTE There are a lot of "signals" that search engines look at when determining relevancy and rankings. Thinking about an information stream like this, I think that we need to expand our thoughts of rankings and relevancy beyond what we commonly call "on page" factors and "off page" factors to another additional category which looks at how our sites fit within that information stream. How well do our sites perform when we look at them from the perspective of users clicking on pages dealing with topical subjects? This paper talks about burstiness some more: Bursty and Hierarchical Structure in Streams One of the reasons why I like blogging so much is that it gives me a chance to write about subjects that are topical, and that will attract viewers based upon what is popular and interesting today. By keeping an eye on conversations going on upon the web within my targeted niche, and taking part in them, I become part of that information stream. For example, Google recently acquired Neven Vision, a company that makes face and object recognition software. I conducted research at the US Patent Office and wrote about the patents which the company had been granted, which helped my site become part of the topical information stream. This is something that can be done in many different industries, by keeping an eye on what people are thinking about and talking about. It's one of the potential benefits of blogging. What signals do you think are becoming more important, and how are you taking advantage of being within an information stream? |
||
| Offline | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
|
Aug 22 2006, 10:01 PM |
|
|
Got a kick out of this blog post:
Extracting Meaning from Bay Area Sentences Thanks, Chris. QUOTE Your questions require some thought and self-evaluation, and some others may reach the same conclusion as me: "Am I swimming in the stream or just dipping my feet into it?" Sometimes you have to be the ones starting the conversation, and creating the buzz, too. This video from Yahoo, presenting Jon Kleinberg of Cornell University talks about how information flows through social sites, blogs, emails, and has some interesting ideas in it: Diffusion and Cascading Behaviors in Social Networks I think those are excellent points regarding brand and name information. |
||
| Offline | ![]() |
Moderator![]() Group: Moderators
Joined: 31-July 06
Posts: 1,665
|
Aug 25 2006, 03:53 AM |
|
|
Dear Bill -
I read you post, and read Animas post, and I just went and wrote this one: http://www.solaswebdesign.net/wordpress/ I hope this isn't considered self-promotion, but you have struck upon a keen point with me - the vagaries of the English language! I'm so glad to know I am not the only one who thinks about HOW people say things! Thanks, Bill! Miriam This post has been edited by SEOigloo: Aug 25 2006, 03:55 AM |
||
| Offline | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
|
Aug 25 2006, 11:36 AM |
|
|
Nice blog post.
I'm thinking about how the word "so" was used in those conversations, and I think that it was more a signal that what was being said was a hypothetical more than anything else. The equivalent of perhaps starting a sentence with "Imagine that..." or, "We can speculate..." In a recent Matt Cutts blog post, SEO Advice: Writing useful articles that readers will love, a lot of folks in the comments section start discussing the words that Matt uses rather than the advice he provides. I think that they might have cared a little too much about the signal itself, and not the information that it was carrying. But the search engines do look at those signals. They consider the words used, and consecutive sequences of words (n-grams), in understanding messages sent. I referred to the Jon Kleinberg video, and I agree with Chris on its importance. Getting past the math in the video, the message in there is important - that trying to understand how information spreads from person to person may be helpful in finding that information for someone who might be looking for it. QUOTE I'm so glad to know I am not the only one who thinks about HOW people say things! I think that's part of what has me interested in seeing how a search engine tries to understand and interpret signals. |
||
| Offline | ![]() |
![]()
|
|
| Lo-Fi Version | Time is now: 9th February 2010 - 06:49 PM |
| Meet our Moderators: | cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB |