Pdf Files And Pagerank
Posted 18 January 2008 - 01:41 PM
First, we know that Google will index PDF documents:
So if it indexes those documents...does it:
1. Follow the links within that PDF?
2. Pass PageRank juice through that PDF?
Just some random things to think about
Posted 18 January 2008 - 01:56 PM
Great question, though.
Posted 18 January 2008 - 02:16 PM
If you look at the source code for a PDF document it is not at all like a text file. So Google will have to be doing extra workarounds to grab the content. How much would they do? Only they know.
Posted 18 January 2008 - 02:23 PM
Sounds like a good test is needed. Anyone got a higher PR site that would link to a pdf document that links to a page that has no other links to it and see if that page gets indexed or not?
Posted 18 January 2008 - 02:34 PM
One other thing that I have noticed is that these documents are not indexed very regularly - which is natural - in that G probably guesses that pdf documents don't get updated too often.
They also index word files i.e rdf documents, code and many other things. Nothing escapes!
Posted 18 January 2008 - 02:43 PM
Posted 18 January 2008 - 03:07 PM
Posted 18 January 2008 - 03:16 PM
I went into Google Webmaster Tools and downloaded my backlink files. In it I see that google recognizes links to my site from various document types on many other domains.
I see lots of links from .pdf (Acrobat files), .xls (Excel files), .doc (Word documents), .ppt (PowerPoint presentations)
I publish files of all of these types on my own site. In each one I include a link back to my homepage. I believe that I get linkjuice from these when other domains link to them. Why not if google indexes the document and spiders the link to my site?
Posted 18 January 2008 - 04:00 PM
I believe that a couple of years ago Matt Cutts said (sorry unable to find reference) that Google can/may treat plain text links as backlinks. If a URL, linked or not, is treated as a backlink there is no reason for it to be unable to pass PR - but that is just my opinion.
The usual location for certain filetypes, i.e. pdf, is in some deadend repository receiving limited backlinks (there are notable exceptions), with very diluted trickledown PR and uncertain onpage factor oomph to add to none to few out-links (which does indicate a certain potential...).
0 user(s) are reading this topic
0 members, 0 guests, 0 anonymous users