Skip to main content

Diskless topics for Apache Kafka®

Diskless topics are a feature of Aiven's Inkless clusters that store Kafka topic data in cloud object storage, without writing to local disk.

Diskless topics are available in Inkless Kafka services on Aiven Cloud and Bring Your Own Cloud (BYOC). In Bring Your Own Cloud (BYOC) deployments, Aiven manages the Kafka service in your cloud account, while you retain control over your infrastructure and data.

note

Diskless topics are limited availability on Aiven Cloud.

About diskless topics

Diskless topics store topic data in cloud object storage, such as Amazon S3 or Google Cloud Storage (GCS), instead of on broker disks. This design simplifies operations, reduces cross-availability zone (AZ) traffic, and supports cost-effective scaling.

Data is batched and written to object storage. Partition metadata and message ordering are managed by an internal coordination layer that Aiven deploys and operates to support diskless topics. For details, see Batching and delivery.

Diskless topics work with standard Kafka APIs and clients, and most applications do not require any changes to use them. If you use the Kafka CLI (kafka-topics.sh) to create or configure diskless topics, use the script from the Inkless repository.

For architectural details, see Diskless topics architecture.

Benefits of using diskless topics

Diskless topics are well suited for workloads that require high throughput and rapid scaling. They provide:

  • Elastic scaling: Supports high throughput and scales in seconds.
  • No disk overruns: Shifting to object storage removes broker disk capacity limits.
  • Lower storage and network costs: Reduces cross-availability zone traffic by offloading data to cloud object storage.
  • Lower latency for hot data: Frequently accessed data is cached on brokers to improve fetch performance.
  • Simplified storage management: No need to manage broker disks, rebalance partitions, or manually provision storage.
  • Faster scaling and node replacement: Removing large local disks reduces data movement during scaling and node rotation.
  • Compliance and security: In BYOC deployments, the service runs entirely within your own cloud account.
note

Internal and ecosystem topics (such as consumer offsets, Kafka Connect, MirrorMaker 2, and Schema Registry topics) are managed by the service. You cannot change their storage type or configuration.

Diskless vs. classic Kafka topics

Diskless topics store data in cloud object storage and do not rely on broker-managed replication or partition leadership. In Classic Kafka services, classic topics store data on broker-local disks and use standard Kafka replication.

You can use both diskless and classic Kafka topics in the same Inkless Kafka service. This allows you to:

  • Adopt diskless topics gradually.
  • Continue running workloads that require features not yet supported by diskless topics.
  • Maintain flexibility in your deployment strategy.

For a detailed comparison, see Compare diskless and classic topics.

Related pages