Why Canonical Tags are Essential

Tristan Pirouz
Tristan Pirouz

On 12th June 2014 • 7 min read

 

Canonical Tags are an Essential Part of Every Site Architecture: Here's Why

Canonical tags are a powerful way to tell Google and other search engines which URLs you want them to index.

They can prevent duplicate content issues if you have different versions of the same page: for example, an original and print version of the same page, session IDs or colour variations of the same product.

 

What is a Canonical URL?

The canonical URL is the primary version of your content. It is the URL that you want to appear in Google’s search results.

The full set of canonical URLs on your site are created using a set of rules to ensure they are consistent. For example, you might decide that your canonical URLs should always end with trailing slash. Or you might decide the canonical URLs should not include any URL parameters.

 

What is a Canonicalised URL?

A canonicalised URL is a page with a canonical tag and a different URL inside its canonical tag (the canonical URL).

By including a different URL in the canonical tag on a page, you are instructing Google to index the canonical URL instead of the page’s URL.

Authority signals collected on the canonicalised URLs are also consolidated to favour the canonical URL.

 

Where and how do I add a canonical tag?

Insert the following tag into the of the page you want to canonicalise.

<link rel=”canonical” href=”http://www.example.com/a-different-page” />

Alternatively you can include them in the HTTP headers.

Link: <http://www.example.com/a-different-page>; rel=”canonical”

 

Adding canonical tags: the rules

In order for canonical tags to work properly, they must be used correctly and consistently:

 

What’s the difference between these and 301 redirects?

A canonical tag is only visible to search engines so it allows the user to remain on the URL, whereas a 301 will redirect users and search engines.

A redirected URL won’t be stored in your analytics, whereas a canonicalised URL will be tracked.

If you want a URL to be accessible to users then you should use canonical tags, otherwise you can redirect.

 

What can go wrong?

Search engines will ignore your canonical tags in the following situations:

Content different on canonical and canonicalised URL:
Google may choose to ignore canonical tags if the canonical URL and the canonicalised URL are different.

Page missing a canonical tag:
All pages should contain a canonical tag to prevent any possible duplication, including on the canonical page.

Canonicalising to the wrong URL:
If the canonical URL is not similar enough to the canonicalised one, then Google will probably ignore it.

Multiple canonical tags:
If the canonical tags on the same page are different, then Google will ignore both.

Canonical loop:
A page canonicalises to a page that canonicalises back.

Unlinked canonical pages:
Most canonical URLs would probably be linked internally at least once because they are usually an important part of the site.

If a canonical URL is not linked directly it may indicate the canonical URL is wrong.

Redirecting canonical URL:
If the canonical URL redirects to another URL, then it can’t be a true canonical URL.

Broken canonical URL:
If the canonical URL isn’t a valid URL then Google will probably just ignore it but it will still waste time, which reduces crawling efficiency.

Empty canonical tag:
Canonical tag does not include a URL.

 

Using canonical tags for mobile

If you have a mobile website on separate URLs, Google recommend that you canonicalise your mobile site to your desktop site to compliment a rel-alternate.

 

Canonical tags for pagination

If you have implemented pagination with a view-all page (typically used for articles broken up into many pages), then Google recommend canonicalising all the paginated parts to the full version.

 

Keep track of your canonical tags with DeepCrawl

DeepCrawl’s three canonical tag reports will show you all of your canonicalised pages, pages without acanonical tag, and unlinked canonical pages.

 

1. Find canonicalised pages

This report will find all pages with URLs that are different to the canonical URL specified in the canonical tag, in either the HTML or HTTP header.

Go to: Indexation > Canonicalised pages.

You will then see a list of all your canonicalised pages, plus their location and the canonical URL:

indexation canonicalised pages DeepCrawl
 

2. Identify pages without a canonical tag

Go to Validation > Pages without Canonical Tag

This view will show an at-a-glance view all of your pages that are missing a canonical tag.

validation pages without canonical tags DeepCrawl
 

3. Find unlinked canonical pages

Go to Validation > Unlinked Canonical Pages

Here you will find a list of all pages found in canonical tags that are not linked:

validation unlinked canonical pages DeepCrawl

DeepCrawl will always follow any canonical URL, so if any of these are broken then you can find them in the other error reports.

Author

Get the knowledge and inspiration you need to build a profitable business - straight to your inbox.

Subscribe today