In order for web pages to be included within search results, they must be in Google’s index. Search engine indexing is a complex topic and dependent on a number of different factors. Our Hangout Notes on indexing cover a range of best practice advice to ensure your website’s important pages are indexed by search engines.

Indexing Paginated and Search Results Pages

March 27, 2015 Source

Search results pages can be made indexable. Including only the 1st page of a paginated set is an option, provided you make sure that all the details/product pages can still be reached.

Canonicalise Product Variants

December 23, 2014 Source

A discussion around when to canonicalise pages to other pages. e.g for colour variations of product pages.

Break XML Sitemaps into Small Chunks

December 23, 2014 Source

Breaking up XML Sitemaps into smaller groups can give you more feedback on indexing issues, which are reported separately for each Sitemap in Webmaster Tools.

Google Will Rewrite Title Tags

September 26, 2014 Source

Sometimes Google will rewrite your title tags if they have a lot of irrelevant keywords (keyword stuffed), if they are heavy duplicated or if they are too long to be displayed.

Canonicalised Pages Stay in Google’s Index

August 29, 2014 Source

Canonicalised pages may remain showing as indexed for site: searches depending on the ‘site structure’. They are no considered as hard as a redirect, and the page can still surface for unique content. Canonical URLs are not crawled immediately, like a redirect would be. John suggests that if you have a large number of incorrect canonical tags, such as many pages canonicalising to a single page, they might ignore all canonical tags across the site. Google makes a clear recommendation that cleaning up broken canonical tags is a good idea.

Related Topics

Crawling Crawl Budget Crawl Errors Crawl Rate Disallow Sitemaps Last Modified Nofollow Noindex RSS Canonicalization Fetch and Render