9 Pages V  1 2 3 > »   
Reply to this topicStart new topic
> SEO Myths

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 26 2005, 11:59 PM
In another thread, a suggestion was made that we explore some SEO myths. I've come up with a list to start us off, and would be happy to discuss these in more detail. For now, I'm just going to list them. If you have more to add, please do so.

SEO Myths:

1. SEO copywriting means writing strong copy, then inserting keywords within the copy a number of times.

2. keyword density is important, and you don't want too much or too little.

3. You should place content above menus to have it "crawled first"

4. Meta tags are the key to high rankings

5. The revisit meta tag can tell search engines to come back on a regular basis

6. <META name="robots" Content="Index, Follow"> is important

7. The more links the better

8. Pagerank is dead

9. There is a duplicate content penalty

10. There is a certain percentage of duplication that you can get away with before your page will be filtered in the results

11. There is a sandbox

12. There are thousands of search engines to submit your site to

13. You need to submit your site to the major search engines.

14. Resubmitting sites to search engines helps your site get crawled more frequently

15. Appearing in DMOZ is essential to high rankings

16. High rankings are the aim of SEO

17. Optimizing for specific keyword phrases is the goal of SEO

18. You shouldn't optimize a site for terms that are too broad or too narrow.

19. Once you've achieved a top ranking with a page, you should adopt a hand's off approach to that page.

20. Hidden links or text in a page can get your page ranked higher.

21. Software, which compares your use of keyword phrases in titles, headlines, meta tags, etc., with pages that rank highly for those terms can help you rank highly for those phrases.

22. Participating in pay-per-click or paid impression programs on the search engines can help or harm your search rankings.

23. Search engines don't index dynamic sites
Offline Go to the top of the page

Solid Contributor

Group: Members
Joined: 30-September 04
Posts: 84
From: london.on.ca
post Nov 27 2005, 12:21 AM
3. You should place content above menus to have it "crawled first"

I think it is good SEO to have your content first sourced even above you menu. I never heard anyone ever call that a myth before.

7. The more links the better

Confirm. Long as they are not ROS, not added more than 10-20% of yoru total a month, in or from bad nebs, I am for the more links the better. I don't think thats a myth.

8. Pagerank is dead

Pagerank real, directory and toolbar is alive and well. The ranking weight of Pagerank is dead. That isn't a myth.

9. There is a duplicate content penalty

LOL. First I thought you might have forget to insert the "not" but after reading the "myths" I think you really don't believe in dup penalties. Now that one takes the cake.

10. There is a certain percentage of duplication that you can get away with before your page will be filtered in the results

Confirm.

11. There is a sandbox

Confirm.

16. High rankings are the aim of SEO

Confirm. High rankings for convertiable keywords.

17. Optimizing for specific keyword phrases is the goal of SEO

Confirm. Woa you have a lot of strange myths.

21. Software, which compares your use of keyword phrases in titles, headlines, meta tags, etc., with pages that rank highly for those terms can help you rank highly for those phrases.


Confirm. Long as it is comparing them to good examples. While I hold that Googles ranking weight is 90% plus off site it is not going to help much. But with MSN 70% plus and Yahoo 80% plus off page it makes some what of a difference.
Offline Go to the top of the page

Star Member

Group: Members
Joined: 18-November 05
Posts: 976
From: Paris - France
post Nov 27 2005, 12:24 AM
24. W3C Validation is necessary to rank high

25. SEO is easy and can be done after reading a 50$ junk Ebook
Offline Go to the top of the page

Solid Contributor

Group: Members
Joined: 30-September 04
Posts: 84
From: london.on.ca
post Nov 27 2005, 12:30 AM
25. SEO is easy and can be done after reading a 50$ junk Ebook

not the junk one but the aaron one yes. and his is $79. SEO is easy, it's like throwing mud at a wall. Find out what other people who have their mud sticking or using for a mix and us that. It really is simple.

bragadocchio: for an admin you are sporting alot of interesting myths.
Offline Go to the top of the page

Star Member

Group: Members
Joined: 18-November 05
Posts: 976
From: Paris - France
post Nov 27 2005, 12:45 AM
Yes seobook is a good one but there is a lot of junk ebooks down there that people should be wary of.
Offline Go to the top of the page

Star Member

Group: Members
Joined: 18-November 05
Posts: 976
From: Paris - France
post Nov 27 2005, 12:52 AM
3. You should place content above menus to have it "crawled first"

I think it is good SEO to have your content first sourced even above you menu. I never heard anyone ever call that a myth before.

I was wondering if that was true or not and mentionned to bragadocchio that I've seen sites ranking well and they do have content first. The example I had in mind was sites that rank well for "seo company" on Google. And sites like seoimage.com, your site, etc have content first... So do you think it helped you get the ranking you have?
Offline Go to the top of the page

Moderator

Group Icon
Group: Moderators
Joined: 27-July 05
Posts: 2,936
post Nov 27 2005, 12:53 AM
QUOTE
2. keyword density is important, and you don't want too much or too little.   (you need at least one occurrence of the word or a related word... then if you stuff you will not do well)

7. The more links the better.  (Right, as long as they are not junk links)

9. There is a duplicate content penalty  

10. There is a certain percentage of duplication that you can get away with before your page will be filtered in the results  (sure)

11. There is a sandbox (ha ha... the \"expert\" SEOs deny the sandbox  adamantly to gain business then they can't deliver)

17. Optimizing for specific keyword phrases is the goal of SEO (you betcha - and lots of non specific if you are clever)

21. Software, which compares your use of keyword phrases in titles, headlines, meta tags, etc., with pages that rank highly for those terms can help you rank highly for those phrases. (if you don't know Jack it can actually be useful - but I bet on myself!)


I'll bet a big buck that some of these are true.
Offline Go to the top of the page

Star Member

Group Icon
Group: 1000 Post Club
Joined: 18-November 05
Posts: 1,392
From: GMT+1
post Nov 27 2005, 12:59 AM
Hehe, there's gotta be some non-myths in there, or else there would be no need for discussion....
Offline Go to the top of the page

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 27 2005, 01:40 AM
I'll definitely address those, Bob. smile.gif

Before I dig too deeply into them, let's see if anyone else has more to add, or issues with any of the ones that I've listed.

I'll start with one of them that you've indicated "takes the cake."

QUOTE
9. There is a duplicate content penalty  

LOL. First I thought you might have forget to insert the \"not\" but after reading the \"myths\" I think you really don't believe in dup penalties. Now that one takes the cake.


We know from a number of patents from Altavista, that there are ways of identifying mirrored sites, and sites that are very similar to other sites on a number of levels, and that search engines can decide not to include those pages in their index. I'm not sure that is what we are talking about when we talk about a "duplicate penalty." If it's what you are considering a duplicate penalty, I'll concede that aspect of a duplicate penalty.

Method and apparatus for finding mirrored hosts by analyzing urls

Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses

We also know that there are methods of indentifying pages that are very similar from Altavista and Google patents.

Method for determining the resemining the resemblance of documents

The abstract from this Alstavista patent:

QUOTE
A method for facilitating the comparison of two computerized documents. The method includes loading a first document into a random access memory (RAM), loading a second document into the RAM, reducing the first document into a first sequence of tokens, reducing the second document into a second sequence of tokens, converting the first set of tokens to a first (multi)set of shingles, converting the second set of tokens to a second (multi)set of shingles, determining a first sketch of the first (multi)set of shingles, determining a second sketch of the second (multi)set of shingles, and comparing the first sketch and the second sketch. The sketches have a fixed size, independent of the size of the documents. The resemblance of two documents is provided using a sketch of each document. The sketches may be computed fairly fast and given two sketches the resemblance of the corresponding documents can be computed in linear time in the size of the sketches.


Method for identifying near duplicate pages in a hyperlinked database

The abstract from this Altavista patent:

QUOTE
A method is described for identifying pages that are near duplicates in a linked database. In the linked database, pages can have incoming links and outgoing links. Two pages are selected, a first page and a second page. For each selected page, the number of outgoing links is determined. The two pages are marked as near duplicates based on the number of common outgoing links for the two pages


Method for indexing duplicate records of information of a database

The abstract for this Altavista patent:

QUOTE
A computer implemented method indexes duplicate information stored in records having different unique addresses in a database. A fingerprint is generated for each record, the fingerprint is a singular value derived from all of the information of the record. The fingerprint is stored in the index as a unique fingerprint if the fingerprint is different than a previously stored fingerprint of the index. A reference to the unique address of the record is stored with the fingerprint. If the fingerprint is identical to the previously stored fingerprint, then store the reference to the address of the record with the previously stored fingerprint.


This last one includes the clearest statement that a page would be deleted from the index if it was a duplicate of a "master page." The difficulty there may not be determining duplicates as much as it is identifying which document is the "master page" and which is the duplicate.

We don't know how closely any of the search engines are presently using the ideas in those patents from Altavista, but some of the folks who worked upon them have worked on similar documents for other search engines. For instance, Monika Henzinger has worked one of a couple of patents from Google that look mostly at duplicate content issues.


Detecting duplicate and near-duplicate files

The abstract from this Google Patent, filed January 24, 2001 and granted December 2, 2003:

QUOTE
Improved duplicate and near-duplicate detection techniques may assign a number of fingerprints to a given document by (i) extracting parts from the document, (ii) assigning the extracted parts to one or more of a predetermined number of lists, and (iii) generating a fingerprint from each of the populated lists. Two documents may be considered to be near-duplicates if any one of their fingerprints match.


This document does a nice job of raising issues that weren't considered in the Altavista patents, while building upon ideas mentioned in a number of them. While it is possible that some duplicates may be removed from the database, this statement leads me to believe that what is more likely to happen is a filtering of results:

QUOTE
The present invention may function to generate clusters of near-duplicate documents, in which a transitive property is assumed. Each document may have an identifier for identifying a cluster with which it is associated. In this alternative, in response to a search query, if two candidate result documents belong to the same cluster and if the two candidate result documents match the query equally well, only the one deemed more likely to be relevant (e.g., by virtue of a high Page rank, being more recent, etc.) is returned.


It does note that there are some benefits to not including duplicate page. But, we do see from a number of search queries where we search for a unique string of text upon a page that Google will show "very similar" pages in search results, by including a link to click upon to see those. So Google is filtering results for duplicates. Does that mean that they don't delete sites, or penalize them? How do they know which page is the original? Which page is the one that shows up when one page is filtered, and another isn't? That's probably a more important question to ask then if some pages are penalized for duplicate content. How does a search engine know which is the original, and which is the duplicate?

William Pugh, one of the co-inventors on that document has a pdf presentation that I've seen appear and disappear and reappear and disappear again from the web at: http://www.cs.umd.edu/~pugh/google/Duplicates.pdf . (If you do a search for it, you can see the "HTML version" which is cached in Google. Here are a couple of lines from that document:


QUOTE
False Positive Rate

•0.1% seems like a pretty low false positive rate

•unless you are indexing billions of web pages

•Need to be very careful about deciding to discard web pages from index

•Less careful about eliminating near duplicates from query results


He does note that he isn't sure if Google has adopted the method described in this patent.

A patent that was filed (October 6, 2000) and granted (September 2, 2003) earlier than the one above from Google is:

Detecting query-specific duplicate documents

The abstract from this Google patent:

QUOTE
An improved duplicate detection technique that uses query-relevant information to limit the portion(s) of documents to be compared for similarity is described. Before comparing two documents for similarity, the content of these documents may be condensed based on the query. In one embodiment, query-relevant information or text (also referred to as \"snippets\") is extracted from the documents and only the extracted snippets, rather than the entire documents, are compared for purposes of determining similarity.


Here we have an approach that is conducting a determination the duplicate nature of documents based upon snippets being returned for queries. Under this method, pages are filtered when results are served rather than possessing penalties while sitting in an index. Here's a quote from the document that describes some of the issues addressed by this patent:

QUOTE
Some duplicate avoidance techniques are effected during the automated indexing operation. Similar documents can be flagged by (i) defining a similarity measure between two documents, and (ii) defining the two documents as \"duplicates\" if the similarity measure exceeds a predetermined threshold.  

Unfortunately, however, often duplicate information may be found in documents that are not exactly the same or even very similar. For example: (i) identical content may be presented with different formatting. (e.g., plain text versus HTML); (ii) different headers and/or footers may be prepended and/or appended, respectively, to identical content; (iii) hit counters may be appended to identical content; (iv) last modified dates may be appended, to identical content; and (v) one web site may include a copy of content found elsewhere (e.g., as a part of a compilation or aggregation of content, or simply as an insertion). Cases (ii)-(iv) are illustrated by the Venn diagrams of FIGS. 1 and 2. FIG. 1 illustrates the case where a second document merely adds a small amount of information (e.g., a counter, a footer, etc.) to a first document, whereas FIG. 2 illustrates the case where a second document slightly changes some information (e.g., a last modified date) of a first document. The present invention may be used to detect such \"duplicates\" with slight changes.  

Furthermore, the present invention may be used to detect duplicate content within documents that have a lot of different information, such as documents with different formatting codes or documents that aggregate or incorporate other content. Many prior techniques are not well-suited for such cases. For example, assume that documents A and B each contain basic financial information about companies. Assume further that document A has information on 50 companies, while document B has information on 100 companies, at least some of which are the same as those in document A. (For example, document B could be a later, expanded version of document A.) The Venn diagrams of FIGS. 3 and 4 illustrate such examples.



When I see people writing about a "duplicate content penalty," it is often in the context of "how much duplicate content can my page have before it is considered a duplicate of another page, and given a penalty by the search engines?" If we can rely upon some of the documentation above, the answer is that some content may not be indexed at all, especially if it is a site that is mirrored. Duplicate pages may or may not be indexed, but are possibly more likely to be filted out when search results are returned.

And that is why I say that a duplicate content penalty is a myth. Of course, if you can provide some information otherwise, I would be appreciative.
Offline Go to the top of the page

Star Member

Group: Members
Joined: 18-November 05
Posts: 976
From: Paris - France
post Nov 27 2005, 01:55 AM
I love when people bring proven data to confirm their statements...

I think brag did a good job here, and I'm sure he can help us understand each of the things that we have decided to call "myths" in a SEO point of view.
Offline Go to the top of the page

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 27 2005, 02:22 AM
QUOTE
7. The more links the better  

Confirm. Long as they are not ROS, not added more than 10-20% of yoru total a month, in or from bad nebs, I am for the more links the better. I don't think thats a myth.


I often see this one spouted in ads for "SEO companies" (I'll use that term loosely) that will hook people up with thousands of links for $129/month or some similar price. Saw some advertising their SEO linking services on Amazon.com recently.

Quality over quantity is the way to go here. Focus upon getting links that will bring you traffic, links from sites that may be considered authorities, links from popular sites, and popular pages, links that may lead to conversions and fulfilling the objectives of a site.

Of course, I'd rather have a link from the front page of each of these sites than a few thousand from other pages:

Adobe.com
Apple.com
Energy.gov
FirstGov.gov
Google.com
Harvard.edu
Macromedia.com
NASA.gov
NSF.gov
NYTimes.com
Real.com
StatCounter.com
W3.org
WebStandards.org
Blogger.com

But, ignoring the high page ranks of those sites, I want links from pages that are relevant and that people will follow while looking forward to reading the material on the site the link points to.

I do think that you should work to try to increase the number of links to your site, whether you do it by building content that people what to link to, or by some other means. But a straightforward, "More links is better" is just a myth - from a link popularity stance, and from an approach that looks at the opportunity to make a conversion and meet the objectives of the site.

I'll echo the words of Eric Ward here:

QUOTE
Are more links better or not?  


No, more links are not better, unless all of them are high-quality links. Numbers aren't as important as context and relevancy. It is better to have a few links from sites that are similar in content and topic to yours, a few links from the portals, and a few links from site reviewers, than to have 1,000 links on Free For All (FFA) links pages.


Since I dragged a few patents into my last post, I'll bring one into this post, too.

Ranking search results by reranking the results based on local inter-connectivity

Does Google resort their returned results based upon links between the pages being returned? This patent from them describes how they would do that. So, it's possible that a handful of links from pages that may show up in the results of the same query your site is showing up within may provide some value to you. If all of the links pointing to your site are from pages that have nothing to do with the topic of your site, and aren't relevant, this patent, if in use, might mean that your page could be sorted so that it is behind sites within the results that link to each other.

There may be sites that share a topic with you that aren't your direct competitors and may be willing to point a link to you if you provide content that they find interesting, or may be of use to their visitors. That's an area where relevance may be more important than numbers.
Offline Go to the top of the page

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 27 2005, 02:26 AM
QUOTE
bragadocchio: for an admin you are sporting alot of interesting myths.


I hear, and see, and get asked about a lot of these myths. smile.gif

QUOTE
I think brag did a good job here, and I'm sure he can help us understand each of the things that we have decided to call \"myths\" in a SEO point of view.


Thanks. smile.gif

Though I'm happy to let others describe their thoughts on many of these before I jump to defend their status as "myths." I will do that though.
Offline Go to the top of the page

Moderator Alumni

Group Icon
Group: Hall Of Fame
Joined: 31-August 02
Posts: 15,634
post Nov 27 2005, 02:54 AM
QUOTE
2. keyword density is important, and you don't want too much or too little. (you need at least one occurrence of the word or a related word... then if you stuff you will not do well)


I'll give the short answer here, with an example:

This page ranks #1 out of 2,890,000,000 for the phrase "click here" (without quotation marks.)

http://www.adobe.com/products/acrobat/readstep2.html

Neither word appears upon the page. No occurrences at all. Of course, my answer is a shortcut, but it shows that a document can have a keyword density of 0.00% and still rank number one out of 2 billion results.

This article does a nice job of discussing keyword density:

The Keyword Density of Non-Sense.

Keyword density is the percentage of use of the keywords within the document itself. That's different from term frequency, which is the number of times the word appears within the document. I believe that's what EGOL is pointing towards, and having the word or words appear upon the page can make it easier for the phrase to rank in search engines, though we still have instances like the Adobe one I pointed towards above.
Offline Go to the top of the page

Moderator/Blog Editor

Group Icon
Group: Site Admin
Joined: 18-January 05
Posts: 5,375
From: Olympia WA, USA
post Nov 27 2005, 06:17 AM
QUOTE(Bill)
16. High rankings are the aim of SEO  
 
17. Optimizing for specific keyword phrases is the goal of SEO
Agreed.

Ranking is only a step. Targeting is only a step.

High rankings without something behind them would be like putting up a Very Big Advertisement without further mission.

If optimizing for specific keyword phrases was "the" goal, yoiy, the mind boggles. Is that myth where gibberish generators come from? That which begat Adsense pages with automatically generated nonsense text?

Background reading - thread on Role of an SEM

Elizabeth
Offline Go to the top of the page

Moderator

Group Icon
Group: Moderators
Joined: 6-March 03
Posts: 7,962
From: Langley, British Columbia, Canada
post Nov 27 2005, 06:27 AM
What a great thread. Just thought it might be helpful to chime in with a thought about the word myth. Nothing is ever black and white. Myth-ness is a scale. It's usefullness is only in deciding whether you're going to put a lot of effort into using the myth as a working rule.

I don't think anyone of us would argue that 'The keywords metatag is important in SEO' is absolutely a myth. However it doesn't take much effort to put in a keywords metatag so, in case things change, many of us do. I'm with Bill on most of his list of myths. I would use them as a guide in where I put my SEO efforts.
Offline Go to the top of the page

Quarter Grand Poster

Group: Members
Joined: 20-March 03
Posts: 289
From: London England
post Nov 27 2005, 06:39 AM
QUOTE(Bragadocchio)
You should place content above menus to have it \"crawled first\"


Is it helpful in the slightest ~B?

I saw the word "should", which I can't disagree that giving some the advice of you "should" put the content above the menus, but I read here at Cre8 it is good to get the content as high up the crawl as possible. Is that just a myth please?
Offline Go to the top of the page

Moderator/Blog Editor

Group Icon
Group: Site Admin
Joined: 18-January 05
Posts: 5,375
From: Olympia WA, USA
post Nov 27 2005, 06:44 AM
QUOTE(Barry)
I don't think anyone of us would argue that 'The keywords metatag is important in SEO' is absolutely a myth. However it doesn't take much effort to put in a keywords metatag so, in case things change, many of us do.
Makes sense to me, in a sliding scale sort of a way.

I recently noticed meta descriptions coming up in the SE result blurbs for a few blogs. It caught me a bit by surprise, because you don't see a straight quote from metas very often any more. Not world peace, just a reminder to go ahead and do the basics like a good kid. tongue.gif

Elizabeth
Offline Go to the top of the page

Technical Administrator

Group Icon
Group: Technical Administrators
Joined: 3-February 03
Posts: 3,926
From: Sydney Australia
post Nov 27 2005, 06:45 AM
QUOTE
9. There is a duplicate content penalty

Ah, a semantic myth. There is actually a Duplicate content filter wink-2.gif
Offline Go to the top of the page

Moderator/Blog Editor

Group Icon
Group: Site Admin
Joined: 18-January 05
Posts: 5,375
From: Olympia WA, USA
post Nov 27 2005, 06:48 AM
QUOTE(projectphp)
Quote:
9. There is a duplicate content penalty

Ah, a semantic myth. There is actually a Duplicate content filter wink.gif
LOL...
Offline Go to the top of the page

Star Member

Group Icon
Group: 1000 Post Club
Joined: 18-November 05
Posts: 1,392
From: GMT+1
post Nov 27 2005, 07:00 AM
QUOTE(AbleReach)
High rankings without something behind them would be like putting up a Very Big Advertisement without further mission.

If optimizing for specific keyword phrases was \"the\" goal, yoiy, the mind boggles. Is that myth where gibberish generators come from? That which begat Adsense pages with automatically generated nonsense text?


It's funny: I thought these weren't myths at all. I only started to do SEO as a hobby, because I wanted my first site to rank #1 for a particular phrase. I've been ranking #1 with that site for ages now, and it hasn't brought me a single "conversion" (could be because the site is crappy, I know, and because it is in fact a non-commercial site). But just for the sport of it, I wanted to do that.

IMV that's pure SEO, without the SEM and commercial stuff that often come with it.
Offline Go to the top of the page
Fast ReplyReply to this topic Start new topic
9 Pages V  1 2 3 > » 
1 User(s) are reading this topic (1 Guests and 0 Anonymous Users)
0 Members:
Jump to Forum:
 
Lo-Fi Version Time is now: 9th February 2010 - 06:34 PM
Meet our Moderators: cre8pc : projectphp : sanity : Black Phoenix : bwelford : EGOL : Ruud : rustybrick : AbleReach : swainzy : joedolson: eKstreme: dazzlindonna : SEOigloo: iamlost : RisaBB
Cre8asite RSS Feed