The pdf files also have a small amount of text and a clickable link to the homepage of my website. They are also optimized to display an SEOed title tag and rank well in the SERPs. Lots of people have linked to these pdfs.
I think that these pdfs are causing a duplicate content problem or a trivial content problem. To solve that I need to use rel=canonical in a way that attributes them back to the html page that the visitor visits to download them. Unfortunately there is no way to place an rel=canonical in a .pdf document (or I don't know of any way to do it). So I am going to fix this by htaccess following instructions from SearchEnginePeople.
http://www.searcheng...ccess-file.html
My htaccess lines will look as follows
<FilesMatch "brass-widget-1.pdf">
Header set Link '< http://www.mysite.co...rel="canonical"'
</FilesMatch>
================================
I think that the above will work fine, but I have lots of files and want to use wildcards in the .htaccess such as this...
<FilesMatch "brass-*.pdf">
Header set Link '< http://www.mysite.co...rel="canonical"'
</FilesMatch>
================================
I have two questions....
1) Do you think that my method of dealing with this is the right method..... and
2) Do you know if my use of the wildcard in the .htaccess is correct?
Thank you!






