Google BigQuery is an industry-leading, fully-managed cloud data warehouse that allows you to store and analyze petabytes of data in no time.
RudderStack allows you to dump your customer event data into Google BigQuery by adding it as a destination to the source of your choice.
Follow these steps to set up Google BigQuery before adding it as a destination in RudderStack:
Log into the Google BigQuery console.
Create a new bucket for RudderStack to store the staging files
Create a new service account under IAM & Admin - Service Accounts
Add Service Account permissions as follows:
Grant storage admin access to the bucket
Grant admin access to BigQuery
Create a key for the service account with JSON as the type and store it.
In order to enable sending data to Google BigQuery, you will first need to add it as a destination to the source from which you are sending event data. Once the destination is enabled, events from RudderStack will start to flow to BigQuery.
To do so, please follow these steps:
Choose a source to which you would like to add Google BigQuery as a destination. You can also simply create the destination and connect it to a source later.
Once you have set up the source, select Google BigQuery from the list of destinations. Give your destination a name, and then click on Next. You will then see the following Connection Credentials screen:
Add your project ID and bucket name, as specified in the Setting up Google BigQuery section
Copy the contents of the credentials JSON file you created and stored, in the section above.
That's it! You have successfully added Google BigQuery as a destination in RudderStack. Once you start sending events from your source, RudderStack will dump them into the BigQuery bucket periodically.