![]() ![]() |
Moderator Alumni![]() ![]() Group: Hall Of Fame
Joined: 15-May 04
Posts: 2,648
From: Londonish, England
|
May 29 2006, 05:23 AM |
|
|
For the benefit of all concerned, (apart from Naveen
QUOTE Latent semantic analysis (LSA) is a technique in natural language processing, in particular in vectorial semantics, invented in 1990 [1] by Scott Deerwester, Susan Dumais, George Furnas, Thomas Landauer, and Richard Harshman. In the context of its application to information retrieval, it is sometimes called latent semantic indexing (LSI). from Wikipedia It's a complex looking subject Naveen. Where would you like to start the discussion? |
||
| Offline | ![]() |
Technical Administrator![]() ![]() Group: Technical Administrators
Joined: 8-March 06
Posts: 2,650
From: Minneapolis/Saint Paul, MN
|
May 29 2006, 05:50 AM |
|
|
There have been a number of great posts on this at SEOmoz - Michael Martinez recently wrote a very detailed post on latent semantic indexing, specifically talking about how it's not really been implemented yet in any significant way due to the computational power it requires - even Google can't altogether cut it.
Rand talked about the idea in February of 2005 and gave an interesting early perspective, as well. Aaron Wall wrote about it at Search Engine Journal, and rustybrick made some extensive comments at SE Roundtable. It seems to me that the complexity of natural language and the construction of meaning may be a significant barrier to the kind of large-scale analysis a search engine needs to do. A reduced test case may be practical, but at least for the time being I think it's beyond the capability of search engines. But I don't think that LSA needs to fully "track" natural language, on the other hand - it needs to be able to learn and change; not maintain a fixed idea of how language works. Even with an incomplete implementation, it may well work in an extremely sophisticated and effective manner. As send2paul said, it's a complex subject - possibly beyond any meaningful application in SEO, and certainly beyond my mathematics |
||
| Online | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 1-September 02
Posts: 9,213
From: UK
|
May 29 2006, 09:06 AM |
|
|
QUOTE(Naveen) We have carried out and updating our extensive reaserch on this latest technolgy of search engine. if anyexpert from search engine industry, want to share/discuss his /her expertise and experience on LSI. They are most welcome. I hate to rain on a parade for 'the latest search engine technology', but we first covered this subject a few years ago. In fact, within a few months of first opening these forums in 2002. http://www.cre8asiteforums.com/forums/inde...p?showtopic=593 In 2003, we were already directly applying advice regarding LSA to sites and techniques being discussed. http://www.cre8asiteforums.com/forums/inde...latent+semantic If LSI/LSA were really the latest technology then those hundreds of Information Retrieval scientists haven't been doing much in the years since. LSA is far older than many more significant updates to technology, including the infamous 'Florida' update which itself now seems quite ancient history. Far more recently (a mere 2+ years ago, in January 2004) there was mention of a specific use of Semantic Analysis by a specific engine in the following discussion. http://www.cre8asiteforums.com/forums/inde...?showtopic=5394 A good place to start with studying LSA would be the engines themselves. http://www.google.com/search?q=latent%20se...indexing&num=50 |
||
| Offline | ![]() |
Technical Administrator![]() ![]() Group: Technical Administrators
Joined: 8-March 06
Posts: 2,650
From: Minneapolis/Saint Paul, MN
|
May 30 2006, 02:11 PM |
|
|
QUOTE I can't believe that you guys are not using LSI!!! It is one of my secret weapons for competitive terms. If you would like to learn more on LSI Google Michael Marshall. He is the SEO who basically invented LSI and SEO. I'm not sure exactly what you mean by "using" latent semantic indexing. Latent semantic indexing is a technique which construct an algorithm which can identify the essential meaning of your query or website and use that knowledge to make connections despite the lack of a direct key-term relationship. (To attempt to describe it, however imprecisely.) If you're talking about the same Michael Marshall I know of, then he has written some interesting articles about writing content with LSI in mind. However, as interesting as this article is, Michael Marshall certainly did not actually "invent" LSI or SEO. I'd be interested in hearing how LSI has explicitly aided your SEO campaigns - how have you applied the concepts of LSI to your content and what demonstrates that this has helped you? |
||
| Online | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
|
May 30 2006, 03:28 PM |
|
|
QUOTE It is a one of the latest and crucial technology for search engine and help the search engine to retrive the data of web site and present the result in a LSI based technolgy. The concept of Indexing by Latent Semantic Analysis (pdf) was probably introduced in 1990, though it relies on a lot of research from the 1960s, 70s, and 80s (see the list of citations in the paper I linked to.) The biggest public plunge that Google has probably made concerning any type of semantic indexing was their purchase of the company Applied Semantics, with its Conceptual Information Retrieval and Communication Architecture (CIRCA) technology. A number of similar ideas surfaced in two recent patent applications from Google: Phrase-based searching in an information retrieval system Multiple index based information retrieval system Think about Google's supplemental index while looking at this one above. See also the Applied Semantics Patents: Meaning-based information organization and retrieval Meaning-based advertising and document relevance determination |
||
| Offline | ![]() |
Untested![]() Group: Members
Joined: 11-April 06
Posts: 3
|
Jun 12 2006, 08:28 AM |
|
|
Hi,
Some quick Fact about LSI – 1. LSI is 30% more effective than popular word matching method. 2. LSI uses a powerful and fully automatic statistical method (Singular Value Decomposition) 3. It is very effective in cross-languages retrievals. 5. LSI can retrieve relevant information that does not contain query words, 6. It finds more relevant information than other methods. There are some examples for LSI based SEO and Non LSI based SEO, Please see and analyze the difference. Result on Google for Query “Laptop” without LSI (Kindly analyze the result on Top 10 pages format of Google) http://www.google.com/search?num=100&compl...en&lr=&q=laptop Result on Google for query “ ~Laptop “ with LSI http://www.google.com/search?num=100&compl...lr=&q=%7Elaptop Result on Google for Query “Mobile” without LSI http://www.google.com/search?num=100&compl...le+&btnG=Search Result on Google for Query “~Mobile” with LSI http://www.google.com/search?num=100&compl...r=&q=%7Emobile+ we have written a 37 e-book on LSI. it includes the some fact about LSI. kindly let me know if you enjoy reading.I will mail it by PM. Google is implementing it its semantic result. we are making our research more extensive with the latest fact and figure. Thanks Naveen This post has been edited by web based expertise: Jun 12 2006, 08:42 AM |
||
| Offline | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 1-September 02
Posts: 9,213
From: UK
|
Jun 12 2006, 09:22 AM |
|
|
Using "~keyword" in a search has nothing whatever to do with Latent Semantic Analysis. That's standard semantics, such as applied semantics, with nothing latent about it.
The ~ operator is simply used to find synonyms and stem-variants of a given word. Please tell me that your 37 ebooks on LSA do indeed know what LSA is, and what it is not. This thread from the Search Engine Watch Forums discusses c-index methods for building up 'thesaurus' of related words by co-occurrence. As Orion states there, c-indexes date back to the 70s at least, and so far predate any existing search engine. However, these too are mainly about 'active' semantics, not 'latent' semantics. Latent semantics are the things you only tend to spot from a distance, with fractals or some other plotting/graphing analysis. The best way of explaining the difference might be to look at ways of blocking spam (such as emails). If we use exact negative matching (allow no email that uses the word 'viagra' for example) this can get a wide range of false positives, blocking emails that were not actually spam at all. Using semantics, we might look for variants of the word viagra, especially misspellings that are attempting to bypass the first kind of exact matching we just mentioned - like those that use "VlAGRA" (the I there is actually a lower-case L) Latent semantics is the system that noted you could block over 70% of all spam by just blocking anything that used bold red exclaimation points as punctuation. Bayesian spam filtering methods are far more along the line of latent semantics than most other systems. |
||
| Offline | ![]() |
Star Member![]() ![]() Group: 1000 Post Club
Joined: 22-May 06
Posts: 1,632
|
Jun 12 2006, 09:34 AM |
|
|
If you click here web based expertise you will get to learn the importance of anchor text! What you pointing is actually irrelevant and misleading. Self bragging about your SEO skills and e-book about LSI technology leaves me very cold and is very unprofessional at least in this forum. You have actually contibuted nothing to this post other than telling us how good you are!
The proof is in the padding! Here is a challenge. Post a before and after 'LSI Technology' page and let the pros here comment on the techniques employed. Please spell check the page before you post! You prove your techniques with the relevant mathematics and I will eat my words and my summer hat! |
||
| Offline | ![]() |
![]()
|
|
3 Pages 1 2 3 >
|
|
| Lo-Fi Version | Time is now: 9th February 2010 - 06:05 PM |
| Meet our Moderators: | cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB |