While there have been many theses, patents, and whitepapers about content weighting I have seen little directly referencing images. What is particularly pleasing, to me, is that if you have been following webdev best practices with usability and standards in mind you are well on your way to 'doing it right' for the SEs as well. Yes, this patent is Microsofts but the logic is likely similar for all.
Basically there is an image relevance scoring system, i.e. algo input and/or filter, which considers:
1. Image level features (based on the image):
* photograph or CGI (computer generated image)
* person or not
* contrast, colour, blur
* size, dimension ratio
2. Page level features (based upon the relationship of image to page):
* x, y positions relative to page
* size relative to page
* dimensions relative to page dimensions
3. Site level features (based upon the relationship of the image to the site):
* image stored on/off site
* image usage frequency on pages within site.
And a text-image relevance scoring system:
1. metadata such as caption, alt text, anchor text (if image is a link).
2. surrounding text - especially within the same 'container', i.e. td or p or div.
Which all raises some questions such as:
* what about a fluid page vs. fixed page regarding the page-image relative size calculation?
* what about using an large image and sizing down on page via CSS?
* how does reuse frequency affect image value?
Some testing would seem to be in order - on these and several others that simply jumped out and said 'let's game this!'
As images are joined by multimedia the non-text content is going to become increasingly considered by the SEs. Every bit of insight such as this patent - thankfully in Bill's thoughtful 'plain English' translation - is gratefully received - and carefully tested.






