Steps to turn on partitioning

Enable BigQuery Partitioning

Artie

Welcome to Artie, the database replication platform.

Moving high-volume data is hard. Artie replicates operational data into warehouses and lakes — reliably, without the heavy engineering most pipelines require. On this page, we break down the latest features, what they solve, and why they matter.

Changelog

Discover the various ways that Artie is able to connect to your database.

Connection Options

Learn about Artie's split plane architecture that separates control and data planes for enhanced security and flexibility.

Architecture

Backfills

Advanced Settings

In addition to the source table columns, you can configure Artie to add additional columns to your tables.

Artie Columns

Analytics Portal

Stay up to date on schema changes in your source database.

Schema Changes

In this section, we will go over the metrics that Artie Transfer emits and the future roadmap.

Available Metrics

Inviting Your Team

Enabling Slack Notifications

Customize how your team can log into Artie

Single Sign-On (SSO)

Terms of Service

Data Processing Addendum

Privacy Policy

Subprocessors

ELv2 License Addendum

In this section, we will go over how to install and run Artie Transfer.

Overview

This page describes the available configuration settings for Artie Transfer.

Options

Examples

Learn how to use Artie to replicate data from DocumentDB via change streams.

DocumentDB

DynamoDB

Microsoft SQL Server

Learn how to use Artie to replicate data from MongoDB via change streams.

MongoDB

MySQL

BigQuery

Databricks

Redshift

Artie will write delta files in Parquet format to an S3 bucket.

Snowflake

Database migrations

In this document, we will discuss how to prevent WAL growth for a Postgres database running on AWS RDS.

Preventing WAL growth on Postgres running on AWS RDS

We will go over how we can add primary key(s) to tables that do not have them.

Tables without primary key(s)

Curious how Artie's typing library works? You've come to the right place! Here, we will discuss how Artie's internal typing library works and how we ensure source-data integrity.

Partitioning type	Description	Example
Time partitioning	Partitioning a particular column that is a TIMESTAMP. BigQuery allows hourly, daily, monthly, yearly and integer range partitioning intervals.	Column: timestamp Partitioning granularity: daily
Integer range or interval based partitions	Partitioning off of a range of values for a given column.	Say you have a column called customer_id and there are 100 values. You can specify to have values 0-9 go to one partition, 10-19 the next, etc.
Ingestion-based	This is when the row was inserted. This is not recommended, because it requires storing additional metadata to know when this row was inserted. If we don’t specify this upon a merge, we will end up creating duplicate copies.	NA

Guides

​Steps to turn on partitioning

Steps to turn on partitioning