I’m working on a CFM database driven site and Google thinks we have hundreds of duplicate title tags and descriptions because pages on the site can be accessed using the normal page # and/or the page # plus navigation query strings.
Example: (these 3 urls all go to the same page and Google is logging them as 3 different pages in my Google Webmaster View)
To avoid a duplicate content penalty I cleaned up my sitemap.xml to only include the page # with no query strings. (Example = body.cfm?id=19â€Ž) In my robots.txt file I’ve also added the disallow code to block any file with ‘TopId’ in the url. I’m hoping this will help…have you experienced this type of problem before?
Your question was the source of some debate over here, so thanks for bringing it up! There is a question as to whether Google will actually index pages with session ID’s, and the general thinking is no, so you may be in the clear.
You seem to be handling the problem of duplicate content with database driven sites well, however. It’s best to pick one of the URL’s to include in your site map. You can also be sure NOT to link to any of these pages with duplicate content, from within your site. If you do need to link to these pages, be sure to use “no follow” tags on your link.
Best of luck, Nick Loeser