Stitch Documentation
has moved!

Please update your bookmarks to https://www.stitchdata.com/docs

If you're not automatically redirected after 5 seconds, click here.

Expected Google Analytics Data

Now that Google Analytics is successfully connected to your Stitch account, what comes next? In this article, we'll cover:

Replicating Google Analytics Data

Historical Sync

By default, a historical sync of Google Analytics data goes back 1 year from the Stitch connection date.

Frequency

Stitch will replicate Google Analytics data based on the frequency you define. The default setting is every 30 minutes, but you can change it based on your needs.

Replication Method

While our Google Analytics integration uses Incremental Replication, the last 15 days' worth of data will be replicated every time a replication job runs. This is to account for any updates that may have occurred to existing rows during Google Analytics' 15 day attribution window and includes both updated and not-updated data.

Setting the Replication Frequency to a higher frequency - like 30 minutes - can result in re-replicating recent data and contribute to high row counts. We recommend keeping the 15 day attribution window in mind when setting the Replication Frequency.

Google Analytics Data in Your Data Warehouse

A single table - called report - will be created for each Google Analytics integration. The Metrics and Dimensions you selected during the setup process are the columns that will make up the table.

Let’s say we selected these Metrics and Dimensions during the setup:

  • Metrics: 
    • ga:sessions
    • ga:pageviews
  • Dimensions: 
    • ga:referralPath
    • ga:country

Our table’s schema would look like this:

sessions pageviews country referralpath start_date end_date
1 1 United Kingdom /strategic-investor 2016-07-06 00:00:00 2016-07-06 00:00:00
6 12 United States (not set) 2016-07-06 00:00:00 2016-07-06 00:00:00
1 1 Canada /serial-entrepreneur 2016-07-06 00:00:00 2016-07-06 00:00:00
2 5 Mexico /strategic-investor 2016-07-07 00:00:00 2016-07-07 00:00:00
10 23 United States /serial-entrepreneur 2016-07-07 00:00:00 2016-07-07 00:00:00

In addition to the columns for the Metrics and Dimensions, you’ll notice two additional columns in the table: start_date and end_date.  Google Analytics data is paginated on a daily basis, which means you’ll see one row in the table for each Metric/Dimension combination. These columns tell you the day that particular combination in the row pertains to.

Primary Key

The Primary Key for Google Analytics tables is a composite key made up of the Dimensions you selected and the start_date and end_date columns.

In the table above, the Primary Key would be referralpath:country:start_date:end_date.

Related:

Was this article helpful?
0 out of 0 found this helpful

Comments

Questions or suggestions? If something in our documentation is unclear, let us know in the comments!