DeepCrawl Integration: Google Analytics

Adam Gent
Adam Gent

On 11th July 2019 • 8 min read

DeepCrawl allows users to integrate their websites Google Analytics and web crawl data, to gain unique insights and identify issues with traffic driving pages.
 

 

What is Google Analytics?

Google Analytics is a web analytics service offered by Google that tracks and reports website traffic, goals and user behaviour. It is a free tool for websites owners but is also available as a paid for service called Google Analytics 360.
 

Why integrate Google Analytics with DeepCrawl?

Integrating Google Analytics data with DeepCrawl allows users to further enrich their reports. DeepCrawl users will be able to:

To get these special reports Google Analytics needs to be added as a crawl source to a user's DeepCrawl project.
 

How to set up Google Analytics in DeepCrawl

There are two ways a user can connect to a Google Account in DeepCrawl, through the Connected App page or the DeepCrawl property settings in a Project in the Sources step.

This guide mentions views, accounts and properties in Google Analytics. To find out more about the Google Analytics hierarchy read the following official guide.

Connect Apps method

1. Navigate to the Connected Apps page.

2. Click on the “ADD GOOGLE ACCOUNT” button.

3. Login to your Google Account which has the Google Analytics profiles you want to include.

4. In the Connected Apps page, users can manage what Google Accounts are connected in DeepCrawl.

5. Once a Google account is connected, navigate to the Sources settings (step 2) in the project you want to include Google Analytics data.

6. Choose the Google Analytics Source (a green tick will appear when selected).

7. Select the Google Analytics view you want to use in project crawls, you can use the internal search function to quickly find a view.

8. Once selected, the Google Analytics view will appear on the right of the list.

9. That’s it, DeepCrawl will now fetch URLs in the selected Google Analytics view during the crawl.

DeepCrawl property settings:

1. In the Sources settings in step 2 scroll down to the Google Analytics Source and choose the Source (a green tick will appear when selected).

2. Click on the “ADD GOOGLE ACCOUNT” button.

3. Login to your Google Account which has the Google Analytics views you want to include in crawls.

4. Select the Google Analytics profile you want to use in project crawls.

5. Once selected the Google Analytics view will appear on the right of the list.

6. That’s it, DeepCrawl will now fetch URLs found in Google Analytics during the crawl.
 

Configure Google Analytics settings

Once Google Analytics is integrated, DeepCrawl allows users to filter the requested landing page URLs from the API.

To filter the data, simply select the date range and minimum number of organic search visits to a URL.

The following filters can be used to alter the URLs which are extracted by DeepCrawl:

1. Date range: Select a date range (7, 14, 30, 60, 90, 180 and 365 days).

2. Minimum visits: Group data by minimum number of organic search visits to a page.

By default the date range for Google Analytics data is 90 days.
 

Google Analytics metrics extracted

The following metrics are extracted from Google Analytics:

Google Analytics allows you to create custom metrics (goals, revenue, etc.), however, DeepCrawl will only extract default metrics.
 

Frequently asked questions

How does DeepCrawl choose the protocol and hostname for the URLs in Google Analytics?

DeepCrawl uses the primary domain in the project settings (step 1) when fetching URLs from the Google Analytics API.

For example if the primary domain was https://example.com then DeepCrawl would append the primary domain to the relative URls fetched from the Google Analytics API:

If the primary domain inputted into the project settings is not the canonical domain, then DeepCrawl will still crawl these URLs and they will be reported in whatever state they are in when crawled by DeepCrawl (most likely redirected or canonicalised if not the preferred domain).

How does Google Analytics work with DeepCrawl?

DeepCrawl adds Google Analytics data to a project/crawl by:

1. A client connects to their Google account in DeepCrawl.

2. A user then chooses the Google Analytics profile they want to include in a crawl.

3. DeepCrawl sends a request to the Google Analytics API.

4. Google Analytics API accepts the DeepCrawl request.

5. DeepCrawl uses the primary domain in the project settings when fetching URL data.

6. DeepCrawl crawls URLs found in the Google Analytics data.

7. DeepCrawl saves certain metrics for each URL found in Google Analytics (see "Google Analytics metrics extracted").

8. All Google Analytics data is saved and pulled into a clients crawl and reported on.

The client can’t see the Google Analytics profile they want to use in a crawl?

This is usually due to the Google Account which has been connected. If the Google Analytics profile has not been set up in the connected Google account then it will not appear in the list.

How can the Google Analytics data be extracted?

The Google Analytics data can be extracted by visiting the Pages in Analytics report.

All the Google Analytics data crawled by DeepCrawl can be exported to a CSV with metadata and other on-page SEO signals.

The HTTP not HTTPS in the Google Analytics profile list is in DeepCrawl, does this matter?

DeepCrawl uses the protocol and domain name from the primary domain settings in the project. As long as the client is selecting the correct Google Analytics profile then this shouldn’t matter if they select the HTTP profile.

Is it possible to add multiple Google Accounts to DeepCrawl?

Yes, DeepCrawl allows multiple Google Accounts to be added. Also, any profiles in Google Analytics will be consolidated into one list in the Sources settings.

For example all the Google Analytics profiles under seo@gmail.com and marketing@gmail.com would be consolidated into one list in the Analytics Source settings in a project set up.

Can DeepCrawl help with Google Analytics line limits in 360?

No, DeepCrawl will not help in this situation. Google Analytics API will still use the same sampling thresholds as in the UI. For more information around line limits and sampling thresholds please read the following guide.
 

Any questions about Google Analytics and DeepCrawl?

If you have any further questions about Google Analytics and DeepCrawl don’t hesitate to get in touch.

Author

Adam Gent
Adam Gent

Search Engine Optimisation (SEO) professional with over 8 years’ experience in the search marketing industry. I have worked with a range of client campaigns over the years, from small and medium-sized enterprises to FTSE 100 global high-street brands.

Get the knowledge and inspiration you need to build a profitable business - straight to your inbox.

Subscribe today