Data
Implement a change data capture workflow in Apache Kafka®, a key component of any organization with high data integrity requirements.
Learn how to safely migrate a MySQL database to a new cloud provider, region and version without losing data on the Aiven platform
Vector embeddings are key to ML, and here we describe how to use OpenCV, OpenAI CLIP and pgvector to generate vectors and use them to perform image recognition on a corpus of photos.
Caching is used to speed up cloud applications, particularly for database reads. Read on to learn more, and find out how to build caching with Redis®* into a simple PostgreSQL® web app.
Learn how to validate your data as it goes into your databases to improve data quality
Learn how to get ClickHouse® data into the Metabase Business Intelligence tool, as a pathway to getting visualisation and insights into your data.
Mastodon → Apache Kafka® → OpenSearch® → knowledge
If you want to analyze Mastodon posts, getting them into Apache Kafka® is a sensible first step. Read on to find out how to do this with Typescript and NodeJS.
We all know we shouldn't use naughty words. Learn how to remove them from your streaming data using DataCater.
How can you test an empty data pipeline? Read on to discover how to create pretend streaming data using Python and Faker.
Find out how to use Apache Kafka® to migrate across database technologies while keeping the target continually in sync with the source.