http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/digg_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/reddit_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/dzone_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/stumbleupon_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/delicious_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/blinklist_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/blogmarks_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/furl_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/newsvine_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/technorati_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/magnolia_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/google_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/myspace_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/facebook_32.png http://www.profitbooks.com/blog/wp-content/plugins/sociofluid/images/yahoobuzz_32.png

Behind The 8 BallA debate is raging over whether search engines penalized websites for providing duplicate content. Search engines keep their algorithms secret (along with just about everything else they do) better than Alan Greenspan and the Federal Reserve. With all this secrecy the only way to the truth is from observation and a few ‘off the cuff’ quips from search engine principals.

Investigation reveals that Google, AltaVista and others have filed patent applications concerning duplicate content filters. These filings are public record and open for public scrutiny. Examination of the filings shows that the search engines are indeed interested in filtering duplicate pages and penalizing sites that use duplicate content. Traffic studies on this site (www.profitbooks.com) and others clearly show there is not only a page by page duplicate content penalty but also a site wide penalty. Additionally, articles created for redistribution show a search engine duplicate content filter works over a period of a few weeks to remove pages duplicating the the article.

It is worthwhile to note that there are at least two types of duplicate content. There are duplicate content pages that contain similar or duplicate content that appears on other pages of the same website and there duplicate content pages that contain similar or duplicate content that appears on pages of a different website. The former being the considered the most objectionable and often classified as search engine spam. We’ll confine our discussion here to the latter, that is, content pages that are similar to pages on other websites.

Duplicate content pages would include pages of content “scraped” from copyrighted material or other websites, rss, syndicated and other content feeds, the reprinting of articles intended for redistribution and public domain content. Are all forms of duplicate content penalized?

The answer seems to be yes, if you use too much duplicate content on a page. The key to avoiding a duplicate content penalty is to mix the duplicate content with original content on a page and re-organize the content duplicated. Try to use your own page title and description meta tags and keep the duplicated content to no more than 70% of the content on a page.

Next time we’ll discuss some possible ways you can still use articles and other duplicate content without running afoul of the search engines. In the mean time, don’t go overboard filling your website with copied articles, feeds and syndicated content.

More Popular Articles

YNC News

Duplicate Annihilator Is In TheMacBundles September Bundle
Brattoo Propaganda Software today announced that Duplicate Annihilator is part of TheMacBundles along with 14 other great Mac applications. Duplicate Annihilator compares images in your iPhoto library using effective algorithms to make sure that no duplicates escape. When found the duplicate will either be marked with a description of your choice to make it searchable or simply moved to iPhotos


Symantec Reveals Deduplication Appliance, Cloud Storage Service
NetBackup 5000 can burn through 4.3TB of data an hour Symantec has announced a new de duplication appliance based on its existing software product. It also announced a cloud storage service for NetBackup and Backup Exec customers. The company also announced a new version of its archive software, Enterprise Vault 9.0, which combines email from Microsoft Exchange Online with on premise archived


Frontier CEO Pledges Better Service, Stable Pricing
Maggie Wilderotter was in Beaverton on Tuesday to meet with employees and businesses.

Like this post? Subscribe to my RSS feed and get loads more!