Duplicate content FAQ: What is it, and how should you deal with it?

There are a few questions that have been confusing the SEO industry for many years. No matter how many times Google representatives try to clear the confusion, some myths persist.

One such question is the widely discussed issue of duplicate content. What is it, are you being penalized for it, and how can you avoid it?

Let’s try to clear up some of the confusion by answering some frequently-asked (or frequently-wondered) questions about duplicate content.

How can you diagnose a duplicate content penalty?

It’s funny how some of the readers of this article are rolling their eyes right now reading the first subheading. But let’s deal with this myth first thing.

There is no duplicate content penalty. None of Google’s representatives has ever confirmed the existence of such a penalty; there were no algorithmic updates called “duplicate content”; and there can never be such a penalty because in the overwhelming number of cases, duplicate content is a natural thing with no evil intent behind that. We know that, and Google knows that.

Still, lots of SEO experts keep “diagnosing” a duplicate content “penalty” when they analyze every other website.

Duplicate content is often mentioned in conjunction with updates like Panda and Fred, but it is used to identify bigger issues, i.e. thin or spammy (“spun”, auto-generated, etc.) and stolen (scraped) content.

Unless you have the latter issue, a few instances of duplicate content throughout your site cannot cause an isolated penalty.

Google keeps urging website owners to focus on high-quality expert content, which is your safest bet when it comes to avoiding having your pages flagged as a result of thin content.

You do want to handle your article republishing strategy carefully, because you don’t want to confuse Google when it comes to finding the actual source of the content. You don’t want to have your site pages filtered when you republish your article on an authoritative blog. But if it does happen, chances are, it will not reflect on how Google treats your overall site.

In short, duplicate content is a filter, not a penalty, meaning that Google has to choose one of the URLs with non-original content and filter out the rest.

So should I just stop worrying about internal duplicate content then?

In short, no. It’s like you don’t want to ignore a recurring headache: it’s not that a headache is a disease on its own, but it may be a symptom of a more serious condition, so you want to clear those out or treat them if there are any.

Duplicate content may signal some structural issues within your site, preventing Google from understanding what they should rank and what matters most on your site. And generally, while Google is getting much better at understanding how to handle different instances of the same content within your site, you still don’t want to ever confuse Google.

Internal duplicate content may signal a lack of original content on your site too, which is another problem you’ll need to deal with.

Google wants original content in their SERPs for obvious reasons: They don’t want their users to land on the same content over and over again. That’s a bad user experience. So Google will have to figure out which non-unique pages they want to show to their users and which ones to hide.

That’s where a problem can occur: The more pages on your site have original content, the more Google positions they may be able to appear at throughout different search queries.

If you want to know whether your site has any internal duplicate content issues, try using tools like SE Ranking, which crawls your website and analyzes whether there are any URLs with duplicate content Google may be confused about:

SE Ranking

How does Google choose which non-original URLs to rank and which to filter out?

You’d think Google would want to choose the more authoritative post (based on various signals including backlinks), and they probably do.

But what they also do is choose the shorter URL when they find two more pages with identical URLs:

Duplicate content

Share this article
  • Facebook0
  • Linkedin75
  • Google+0
  • Twitter
75
Related articles
Smart shopping season checklists: Mobile and desktop, content and SEO
Black Friday SEO: Last-minute tips for the holiday season
What factors should you consider before choosing a web crawler tool?
Image optimization 101: How to rank higher in image search
How about international websites? Can translated content pose a duplicate content issue?

This question was addressed by Matt Cutts back in 2011. In short, translated content doesn’t pose any duplicate content issues even if it’s translated very closely to the original.

There’s one word of warning though: Don’t publish automated translation using tools like Google Translate because Google is very good at identifying those. If you do so, you run into risk of having your content labeled as spammy.

Use real translators whom you can find using platforms like Fiverr, Upwork and Preply. You can find high-quality translators and native speakers there on a low budget.

Translation

Look for native speakers in your target language who can also understand your base language

You are also advised to use the hreflang attribute to point Google to the actual language you are using on a regional version of your website.

How about different versions of the website across different localized domains?

This can be tricky, because it’s not easy to come up with completely different content when putting up two different websites with the same products for the US and the UK, for example. But you still don’t want Google to choose.

Two workarounds:

  • Focus on local traditions, jargon, history, etc. whenever possible
  • Choose the country you want to focus on from within Search Console for all localized domains except .com.

There’s another old video from Matt Cutts which explains this issue and the solution:

Are there any other duplicate-content-related questions you’d like to be covered? Please comment below!

Ann Smarty is Brand and Community manager at InternetMarketingNinjas.com and a contributor to Search Engine Watch.

Want to stay on top of the latest search trends?Get top insights and news from our search experts.
Subscribe
Related reading
The ultimate guide to choosing keywords for ROI

Keyword research is not easy. Every SEO has done it, but few will ever master it completely. In this guide we go beyond raw search volume data to metrics that will help you choose the keywords that deliver the best ROI for you right now.

PPC SEO 08 Nov 17 | Tom Smith
Image of a person typing on a laptop with paper and pens by the side, and a variety of different analytics icons sketched above it, such as graphs, charts and a clipboard.
Beyond Google Analytics: 10 SEO analytics and reporting tools

Analytics and reporting are a critical part of any SEO campaign, and while Google Analytics is a great place to start, it most certainly shouldn"t be where you stop. In this post, we"ll cover some of the best free and paid tools available for SEO reporting and analytics.

Analytics SEO 06 Nov 17 | Jessie Moore
What is keyword clustering, and who cares? (Hint: you should!)

If you"re still optimizing for "keyword strings", you"re not alone. But the industry is evolving towards a different tactic that helps you to get to grips with your niche and create great-quality content: keyword clustering. So what is it, and how should you go about using it?

SEO 01 Nov 17 | Ann Smarty
What are the SEO benefits of social media?

Social media has a host of benefits in a marketing context, including brand recognition and presence, advertising, customer service, content marketing and more. But does it have any benefits for SEO? We delve into the relationship between social and SEO and whether social media has any direct benefits for SEO. In short: "It"s complicated".

SEO Social 30 Oct 17 | Simon Ensor

Nguồn: searchenginewatch.com