BigQuery: Define Your Own Schema

Prerequisites

You must grant 'bigquery.dataViewer' access to Indicative's service account for your BigQuery project. 

In order to perform the following steps you must have administrative access to the BigQuery console as well as your BigQuery database.

If there are additional enrichments required, such as joining with user property tables or deriving custom user_ids, please contact us.

Instructions

Adding a Data Source In Indicative

  1. In Indicative, click on Settings and select Data Sources

    mceclip0.png

  2. Click on New Data Source

    mceclip1.png

  3. Select Connect via Data Warehouse or Lake
    mceclip0.png
  4. Select BigQuery as your data connection and Define your own schema as the connection schema and click Connect
    mceclip0.png
  5. You should see this BigQuery Overview screen. Click Next
    mceclip1.png

Connection Information

mceclip2.png

  1.  Open the BigQuery console on Google Cloud Platform and Select a project.
  2. Enter the GCP Project ID containing your data.
    mceclip2.png

  3. Enter the Dataset Name
    mceclip4.png
  4. Enter the Table Name and click Next in Indicative.mceclip5.png

Grant Permissions 

mceclip3.png

  1. This integration works by sharing the dataset with Indicative's service account and only requires read-only access to that dataset. Indicative takes on the cost of the query and caches this data in Indicative's proprietary analytics engine.

    1. Within the BigQuery Console, select your Project and your dataset from the previous section.
    2. Click on Share Dataset

    3. In the Dataset Permissions panel, in the Add Members field, place the user below.
      integrations@indicative-988.iam.gserviceaccount.com
    4. In the Select a Role dropdown, select BigQuery Data Viewer and click Add.
      mceclip5.png

Event Modeling

mceclip4.png

  1. In the Events Field section, input the field that should be used to derive Indicative event names.
  2. For Timestamp, input the field that represents the time that the event was performed.
  3. Click Next

User Identification (Aliasing)
mceclip5.png

For more information on User Identification (Aliasing), please refer to this article.

  1. If you choose to enable Aliasing:
    1. Unauthenticated ID - Input the field used to identify anonymous users.
    2. Authenticated ID - Input the field used to identify known users.
  2. I you choose to disable Aliasing, press Disabled:
    1. Unauthenticated ID - Enter the field used to identify your users. All users must have a value for this field.

  3.  Press Next

Scheduling

mceclip6.png

  1. Select the Schedule Interval to adjust the frequency at which new data is available in Indicative.
  2. Set the Schedule Time for when the data should be extracted from your BigQuery environment. It is critical that 100% of the data is available by this time to avoid loading partial data.
  3. Select Save

Waiting for Data


mceclip3.png

Advanced Settings

For additional advanced settings such as excluding certain events and properties, please refer to this page

1 users found this helpful