Q and A: Is re-writing content from another blog a legitimate SEO tactic?

QuestionHello Kalena

My sister has just hired a SEO company based in The Philippines to provide weekly content for her company blog. As I’m a bit more web savvy than she is, she asked me to look over their service outline just to be sure she made the right decision.

Problem is, this “Google optimized content” they are providing seems to consist of copying popular blog posts from other sites in the same industry (women’s health and beauty) and re-writing them in a slightly different way before publishing. I don’t know a lot about SEO, but I am sceptical that Google would approve it. Besides the SEO consideration, this tactic just doesn’t sit right with me.

Is this a legitimate SEO tactic or could it harm my sister’s site in any way?

Thank you

Leon

————————————-

Hi Leon

You are absolutely right to be sceptical. By the sound of things, this *SEO* firm employs a technique called site scraping – where the content of other sites is copied or “scraped” and either republished unchanged on a different site, or re-written slightly and THEN republished.

Long term readers of this blog might recall my hilarious battle with site scrapers in the past and the revenge I took on them. I’ve got no problem outing site scrapers, especially when all attempts at communication have been ignored. Their tactics are not only unprofessional, but go directly against Google’s published Webmaster Guidelines.

Take BrainySEO for example. This “blog” (run by some clown called Mayank Jain in Singapore) blatantly scrapes the content of hundreds of blogs across the net, including mine. What’s hilarious is that the scraped content is run through some bizarre automated plagiarist thesaurus (I’m guessing Babel Fish) to translate it into a slightly different version of the same content as a way to avoid Google’s duplicate content filters. It is then published on servers based in the UK.

Compare these two posts:

1) My Fast Five post from week 39 (original)

2) BrainySEO’s scraped Babel Fish version (scraped)

The second (scraped) version reads like a drunk Aunty.

The service that your sister has signed up for sounds suspiciously similar. As Google re-iterates in their Quality Guidelines:

“Scraped content will not provide any added value to your users without additional useful services or content provided by your site; it may also constitute copyright infringement in some cases”.

Typically, Google and other engines will ignore or filter scraped content out of the search results for target search terms. But that’s not the only negative impact it can have.

Sites like ScamAudit.com provide a rudimentary way of measuring the trustworthiness of sites and suitably, BrainySEO is ranked as *seems suspicious*.

So my prediction is at best, the content your sister pays for will be worthless. At worst, it may impact the reputation of her business and the trust of her customers.

My advice is that she should sever the contract immediately, perhaps citing Google’s Quality Guidelines as her justification.

Let us know what happens!

——————————————————————-

Need to learn more about legitimate SEO tactics but not sure where to start? Access your Free SEO Lessons. No catch!

 

Spread the joy!

Q and A: Will Google penalize me for redirecting my old site to my new site with duplicate content?

QuestionHello Kalena

I have a current subdomain webpage that is ranking on page 12 on the Google SERP’s. I just bought a new domain name and created a new website with pretty much duplicate content so I could use that as my prime domain. What I did was re-direct my subdomain to the new prime URL.

My new site has been indexed, but not yet ranked by Google. I intend to delete the sub-domain page as soon as the new page starts appearing in the SERP’s. My question is, because of the duplicate content, is Google going to ban me for this?

Thanks,
Paul

————————————-

Hi Paul

Ah yes, the old hosted sub-domain ranking conundrum.

For the benefit of readers who might not understand your situation, here’s a brief explanation. Paul’s current website is free-hosted on a sub-domain provided by his hosting company. For example, instead of having his site at www.PaulsPlace.com, it’s currently at PaulsPlace.hostingplace.com. This means that any links pointing to his site contribute to the hosting site’s link popularity and not his own. It also means that he is helping his hosting company to rank better in search engines, rather than his own brand and content.

To avoid this, Paul has done the right thing and purchased his own domain name, transferring all his site content over to the new domain and then putting an automatic sign-post up on his current sub-domain site that redirects people to his new domain when they hit his old site or click on a link to his old site within search engine results.

Paul, provided you used a 301 redirect on your sub-domain, there shouldn’t be any problem at all with duplicate content. In fact, this is the recommended process to use, according to Google. Just don’t forget to remove the redirect (and dump your old site) once you see your pages start to appear in the search results. You can hurry this along by creating a XML sitemap for the new site and uploading it to Google via Webmaster Tools.

Hope this helps.

——————————————————————-

Need to learn SEO but not sure where to start? Access your Free SEO Lessons. No catch!

 

Spread the joy!

Q and A: Do I need to use rel=canonical to tell Google my preferred domain?

QuestionHello Kalena

I’ve been a reader of your blog for many years but have never submitted a question. Until now!

With Google’s recent changes to the algorithm, we have noticed a drop in traffic and rankings for our site (we sell ready-made crafting kits for kids). I suspect it might be related to duplicate content as I’ve been reading how Google will penalize sites that can be loaded with www and also without www. Our site loads for both addresses and I’m worried this means we have been penalized.

I also read that you can fix this issue by using coding called rel=canonical or something like that? I have looked into this briefly, but to be honest, although I’m responsible for the content of our site, I’m a sales and marketing person, not a programmer and I don’t think I have the coding knowledge to use this tool.

Is there a more simple way I can remove the duplicate pages or have our site load just with the www? Or will I need to pay our original web designers to fix this?

Thanks for any advice

Sally

————————————-

Hello Sally

Sorry to hear of your traffic drop, but I highly doubt it is due to your site loading for both www and non-www versions of your domain. The algorithm changes over the past 18 months have been related to more complex issues than domain versions.

Even if Google has indexed both versions of your domain, the algorithm is almost always able to distinguish content that loads on both as one and the same. In this situation, Google will usually choose one version and consistently show that version in the search results.

But if you want to instruct Google which version to use in the search results, you can do this from within your Webmaster Tools account by setting the Preferred Domain (sometimes this is referred to as the canonical domain). The Preferred Domain tool enables you to tell Google if you’d like URLs from your site crawled and indexed using the www version of the domain (http://www.example.com) or the non-www version of the domain (http://example.com).

Simply click on the gear icon at the top right when viewing your Webmaster Tools dashboard and then choose *Site Settings* and the Preferred Domain option will come up as per the image here:

Setting-Preferred-Domain-Screenshot
The recommended use of rel=canonical is on a page by page basis, to indicate to Google which version of a page URL to use, if there are several URLs leading to the same page content.

For example, imagine if these URLs all led to the same page content:

1) http://www.blog.com/blue-suede-shoes/
2) http://www.blog.com/blue-suede-shoes&id=72
3) http://www.blog.com/?p=12890

Now imagine that you only wanted 1) to be shown in Google search results. You could achieve this by adding the rel=canonical link element to the < head > tag of each of those pages, specifying http://www.blog.com/blue-suede-shoes/ as the preferred URL.

However, in your situation, the easiest thing would be to use the Preferred Domain tool in Webmaster Tools.

Hope this helps!

——————————————————————-

Need to learn SEO but not sure where to start? Access your Free SEO Lessons. No catch!

 

Spread the joy!

SMX Sydney 2012 – Anne Kennedy – Duplication, syndication & all that other mess

This is a summary of Anne Kennedy’s presentation at Search Marketing Expo / Online Marketer Conference held in Sydney 1-2 May 2012.Anne Kennedy at SMX Sydney 2012

Anne Kennedy has co-authored the first book on international SEO and PPC, called Global Search Engine Marketing. Anne provides search engine consulting to hundreds of companies worldwide and formed an international online marketing consortium with Nordic eMarketing in Reykjavik, London, Stockholm, Rome and Beijing.

Duplicate content happens, says Anne. URL duplication is a big one. This is where you see several different versions of the same page being indexed and/or linked to. For example:

– http://www.site.com
– http://site.com
– http://www.site.com/index.shtml
– http://iste.com/index.shtml

and so on.

You should always use the Rel=canonical tag to lose the canonical versions of pages and also let Google know in Webmaster Tools which version of your pages to index.

Anne says to watch your crawl budget. Your crawl budget is the percentage of your site that Googlebot will crawl. Googlebot rarely crawls your entire site, so keep your low quality pages out of the index by excluding them from your sitemap and blocking them using robots.txt.

Common Duplication Causes

A very common duplicate content mistake is to have printer-friendly versions of your content. Whatever you do, lose the print friendly versions from your sitemap!

Use 301 redirects on your pages, but only when necessary because not all link value will transfer to your replacement pages. PageRank will not transfer 100 percent over to pages if you 301 redirect them – keep that in mind.

Think about using a separate XML feed for your product pages, says Anne. Separate out your e-commerce or product-specific pages from your main sitemap and create a sitemap just for them. Upload the two sitemaps separately in your Google Webmaster Tools account.

Content syndication and site scraping can cause duplicate content headaches. If you are an article syndicator or blogger, make sure you link back to the original article with the title in the anchor text within the article, not the footer, because some syndications sites strip links out of footers. Require syndicators to use the canonical url version or require a no index (exclusion) of the article link in their robots.txt. This will ensure Google finds the original article more easily.

Another trick is to give syndicators a logo or image to go with the article that contains a link to your article and article title in the alt tag of the logo/image. Syndicators will often miss those.

Be sure to update your XML sitemap immediately whenever you publish a new article or blog post – you can use WordPress plugins to update your sitemap automatically for this.

If your article is out of date or no longer accurate and you want it gone from the SERPs for good, use a 410 code to tell Google the article is GONE. This is a more permanent solution than 404.

Dont put your international content on your English TLD. If you want your content to rank well in a particular international market, you should put the content on a related TLD e.g. a German language site should site on site.de or at the very least, de.site.com. Your international content will rank better in regional markets if you have links pointing to it from related TLDs e.g. site.de will rank better in Google.de if it has plenty of .de sites linking to it.

And finally – dont leave it up to the bots! Take control of your content.

Spread the joy!

SMX Sydney 2010: Duplicate Content

Up now is Todd Friersen of Position Tech to talk about duplicate content.Todd Frierson

Todd is a *reformed* black hat spammer. Duplicate content was the standard practice back in those days when he used to do SEO for viagra and other pharma web sites. Build 100 websites and slap up duplicate content.

Todd makes the point that duplicate content can happen easily, even with your home page. Showed 5 or 6 examples of homepage and web server configuration issues. Google *may* decide on the right version of the home page, but you should really tell them which one if you can.

Rel=canonical / 301 redirect = your friend to solve this issue

Faceted Navigation

– products exist in many categories e.g. Tigerproducts.com uses it, Dell uses it

– categories are flexible and unordered

– results are in crazy amounts of duplicate content

– problem. Web site ends up with 200K products but 4 million URLs

No need to do worry, Todd says. Create a directory structure that encourages Googlebot to come in via a specific way, but block all the dupe pages out of Google’s index.

Regional Domains

– AU, US, CA, UK etc.

– country.example.com

– example.com/country

– example.com/index.htm?lang=en

– country specific TLDs

This is easy to resolve, says Todd. You can use Rel=canonical or simply login to GG Webmaster Tools and tell Google what country your domain, sub-domain, or folder is associated with. Do this!

Multiple Sites and Microsites

– Keyword Domains. Bleurgh

– Why are you doing this?

– Stop it. Stop it now.

– Consolidate your sites and your effort (and this will concentrate all your link popularity to one site as well).

– Actually, Bing likes microsites. If you have to do it, do it for Bing. They love it.

Spread the joy!