Amazon Redshift is fully managed, cloud-based data warehouse. Redshift exhibits far better performance for large-scale analytical queries than do traditional, row-based relational databases like MySQL and PostgreSQL. You can learn more about transactional vs. analytic databases in our Data Strategy Guide.
In this tutorial, we'll walk you through how to spin up your own Redshift instance. You can use the links under Jump To to jump around, or if you're a visual learner, let our UX Director Kevin Jackson give you a quick walkthrough:
Before you can set up your Amazon Redshift cluster, you’ll need to complete the following:
To sign up for an AWS account, click here or enter this URL in your browser: http://aws.amazon.com/
Follow the instructions on the website to complete setting up your account.
A VPC, or Virtual Private Cloud, is a scalable virtual network that resembles a traditional network used in a data center. Any AWS account created after December 3, 2014, includes a default VPC in each AWS region. This means that you can use the default VPC to launch your Redshift cluster instead of spinning up your own.
If you want to use your own, however, use Amazon's Getting Started with VPC guide to create it after you sign up for your AWS account. When creating your VPC, note that the cluster you'll be using for Stitch must reside in a publicly accessible subnet.
You'll need to do this before continuing with this tutorial.
Most SQL client tools with Amazon Redshift JDBC or ODBC drivers can connect to Amazon Redshift. After you create and launch your cluster, you'll use an SQL client to test the connection. For this reason, we recommend that installation and setup for your SQL client are complete before beginning the cluster creation process.
We've put together a list of some compatible SQL clients. Click here to check it out.
For information regarding JDBC and ODBC drivers, refer to Amazon’s Configuring Connections in Redshift guide.
Note that this is optional for the tutorial - for the purposes of this tutorial, the SQL client is only used to test the connection. When connecting Redshift to Stitch, Stitch will check and monitor the connection for any potential issues. If you plan to query your data, you will need a compatible client.
After you've completed the prerequisites, you can move on to creating the cluster. Let's get started!