hero-image

Easily migrate your Cassandra data into Aiven with sstableloader

July 31, 2019
By John Hammink

Aiven now supports sstableloader, an open source utility which allows you to easily migrate all of the data from your external Apache Cassandra instances into a managed Aiven Cassandra cluster. Sstableloader allows this seamless migration by bulk-loading data from remote SSTables into an Aiven Cassandra cluster via the target nodes’ internode ports.

How sstableloader works

An SStable is a persistent, immutable file format where your Cassandra data is stored. It is created when fast memory stores called memtables are flushed to disk. Sstableloader works by first connecting to the new Cassandra cluster to determine which nodes of the target cluster should receive which rows, then uploading the SSTable data to the target nodes.

migrating cassandra into aiven using sstableloader

This data upload - called streaming - is implemented as a command line client using the same Java code the Cassandra service uses internally to replicate data between cluster nodes, connecting to the Cassandra internode port to achieve a more efficient way to transfer large amounts of data.

Clients cannot normally connect to this internode port, and Aiven requires sstableloader to connect with SSL through client certificates to secure access. As a result, large clusters for which inserting the source cluster’s data row by row with CQL would take unfeasibly long can now be migrated.

Wrapping up

With sstableloader, you now have the industry standard for bulk data loading into Cassandra which makes migrating your full data pipeline to Aiven that much simpler.

You can find the complete procedure here.

We’re always adding more features to the Aiven platform, so stay up to date with our blog and changelog RSS feeds, or follow us on Twitter and LinkedIn.

Start your free 30 day trial today

Test the whole platform for 30 days with no ifs, ands, or buts.