![]() ![]() |
Solid ContributorGroup: Members
Joined: 24-March 08
Posts: 54
|
May 25 2008, 07:42 AM |
|
|
Is this enough proof?
http://jonathanleger.com/why-post-duplicat...-to-your-blogs/ Is duplicated content a myth? I mean this is the actual proof: http://www.google.com/search?hl=en&q=G...amp;btnG=Search As you can see two identical articles take the first and second position on Google out of 300k+ results. |
||
| Offline | ![]() |
Moderator Alumni![]() Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
|
May 25 2008, 08:35 AM |
|
|
The issue is much more complicated than is set forth in the article that you link to.
Is there a "duplicate content penalty" is a different question than "will sites get filtered out of search results because of duplicate content?" There are many forms of duplicate content on the web, from wire news stories, to syndicated articles, to the same pages on a site existing at more than one URL, to public domain stories, to official documents (declaration of independence, for example), to mirrored sites. When a site is mirrored, so that it has exactly the same content, and the same URL structure, then there is a very real possibility that a search engine will decide to not index one version of the site at one or more domains that it may exist at. When a site uses a content management system that regularly duplicates pages under different URLs at the same domain, that may cause pages from that site to not get indexed. It happens a lot. The pages that show up in the search results that you point towards are not duplicate content, even though they share a large amount of content between them. They are a somewhat different problem, known as "near duplicate content" and there are differences upon the pages. Again, this is something that a search engine will likely try to figure out, and possibibly filter out, but it's not the same as duplicate content. It's a more difficult issue to solve. Is one copy or the declaration of independence on one site more preferable than another? Should a search engine allow the top ten search results to show variations of the same content across all of the top ten results for a page? I wrote a little about some of the research going on by search engineers, to address "near duplicate content" at: http://www.seobythesea.com/?p=999 Search engines want to try to index and return results that have unique content. It's possible that pages with near duplicate content will show in search results. I'm not sure that I would build a long term strategy for success on the web solely upon regularly copying other people's content. |
||
| Offline | ![]() |
Solid ContributorGroup: Members
Joined: 24-March 08
Posts: 54
|
May 25 2008, 08:57 AM |
|
|
Fantastic, It does clear up much confusion that was created, maybe this comic can illustrate it:
http://www.mattcutts.com/blog/something-is...n-the-internet/ Your post in your site is very helpful too. It is nice maybe even vital to see that Google is trying to address the thorny issue of copyrighting on the internet. |
||
| Offline | ![]() |
Technical Administrator![]() ![]() Group: Technical Administrators
Joined: 3-February 03
Posts: 3,926
From: Sydney Australia
|
May 25 2008, 06:06 PM |
|
|
All three snippets (the easiest filter) are different. Try this search:
http://www.google.com/search?hl=en&q=%...art=20&sa=N 29 are unique enough to get a guernsey, almost 200 are not. I hate the word penalised. It is too conotative. Filtered is a more accurate word. Step back, the goal of all this is to supply users with markedly different content so they can choose. Bill has some good comments on this, but another few comments I would have are: 1. Your example was very long. Multiple word searches more than likely have the least factors in the algo, for several reasons, starting with the time it takes to process multi-word searches, to their infreuency (and thus unlikelihood of being cached), and lastly to the fact their is less ambiguity. a search for [Seal] wants what? Animal, navy man or singer? A search for [what is the name of the model seal is married to] likely won't have much ambiguity at all, and really needs just one page: the one with the answer. 2. For some searches, Google probably use multiple algorithms. Not just universal search, with the blending of different content types, but also different regular algorithms, with the goal to specifically increase diversity. 3. Just because having duplicate content isn't causing an issue on some searches doesn't make it a terribly useful strategy to use. I think there are probably better ways to acheive one's goals longterm (short term, no comment!) |
||
| Offline | ![]() |
Hall of Famer![]() ![]() Group: Hall Of Fame
Joined: 3-November 05
Posts: 3,461
From: CHeeseland
|
May 27 2008, 01:42 AM |
|
|
Regarding "fine" ... I think the Google Webmaster Guidelines are pretty clear on this:
QUOTE Quality guidelines (...) Don't create multiple pages, subdomains, or domains with substantially duplicate content. There's even a whole article on duplicate content which mentions QUOTE However, in some cases, content is deliberately duplicated across domains in an attempt to manipulate search engine rankings or win more traffic. Deceptive practices like this can result in a poor user experience, when a visitor sees substantially the same content repeated within a set of search results. (...) In the rare cases in which Google perceives that duplicate content may be shown with intent to manipulate our rankings and deceive our users, we'll also make appropriate adjustments in the indexing and ranking of the sites involved. As a result, the ranking of the site may suffer, or the site might be removed entirely from the Google index, in which case it will no longer appear in search results. I think it's probably pretty clear that if you are a serious webmaster, your time is better spent on creating and publishing your own unique and compelling content John |
||
| Offline | ![]() |
![]()
|
|
| Lo-Fi Version | Time is now: 9th February 2010 - 09:17 AM |
| Meet our Moderators: | cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB |