Hi Kalena…
I have a question regarding Server Side Includes. Do search engines index them? The navigation menus in my company’s website are SSI files, meaning that their content is not visible in the main index.htm file. Do you reckon search engines ignore the content of the navigation menus in my case?
Marco
Hi Marco
For my readers that aren’t familiar with Server Side Includes, they are a way to tell the server to insert something into a web page before it is sent to the viewer. What is inserted may be the content of a plain text file or it may be the output generated by another program running on the server such as the processing of a form return using a PERL script.
I don’t use SSI much but I was pretty sure the code generated by the SSI would be indexed. To be sure, I asked my SEO homies on Twitter to confirm this and they said:
“They are normally fine, spiders just see the html thats outputted” (thanks Patrick)
“So long as they are generating HTML I can’t see why they would not be indexed” (thanks Dave)
So just to confirm - YES - search engines will index the HTML generated by your Server Side Includes.
Another way to look at it… Take the words “Server Side.” The includes are processed at the server before it is even processed and sent back over the internet. There are two main aspects of the process. Front side (the actual final output in the browser) and back side (what happens at the server).
So… Yes, server side includes are indexed because they are in the final output. This comment is included with a Server Side Include.
No, they will not index the included file itself. You can even make sure aren’t included by using the robots.txt file.
I am heavily involved in the creation of CMS systems during redesign work in respect of clients taking up my SEO services and use SSI to avoid unnecessary duplication of navigation etc. For me it is an essential part of web design and I have found that there are no penalties for using SSI strategies. I agree with Chris Estes and will add that the output HTML does not exist until the page has been requested by a browser or spider so the chances of the code in raw server side format being read are non-existent.