Hot and cold data with Apache Kafka, Tiered Storage, and Iceberg

Utilizing the true potential of data streaming is key to business success.

In this Data (R)evolution episode, we're joined by Josep Prat and Filip Yonov to dive into the transformative features of Apache Kafka and its evolving role in data architecture. They discuss the critical importance of collaboration and feedback in enhancing Kafka's capabilities, the future of "lake house" technology, exciting updates from the Open Source Program Office (OSPO), and the importance of Kafka's readiness to support evolving data formats—making it a backbone for modern data ecosystems.

Key takeaways:

  • Community collaboration and contribution are essential for the continuous improvement and testing of Apache Kafka's capabilities.
  • The evolution of Apache Kafka into a more versatile platform, combined with object storage and open table formats, can significantly enhance real-time data streaming, analytics, and the future of "lake house" technology.
  • Tiered storage in Kafka facilitates more efficient and cost-effective data management by decoupling storage from computing.