Data

TensorFlow, PostgreSQL®, PGVector & Next.js: building a movie recommender
Leveraging TensorFlow, PostgreSQL®, PGVector, and Next.js for vector search with this step-by-step video guide.
Speed up PostgreSQL® pgvector queries with indexes
Learn the theory and the details of how to speed up PostgreSQL® pgvector queries using indexes IVFFlat, HNSW and traditional indexes
Develop and leverage AI models in ClickHouse®
Learn how to train AI models and perform live scoring with a set of SQL statements and Aiven for ClickHouse®
A guide to Apache Kafka® tiered storage with Aiven and Terraform
A guide to what tiered storage is, and how you can start using it with Aiven for Apache Kafka® and Terraform. We’ll set up a cluster, load the data and observe the metrics.
SQL query optimization: a comprehensive developer's guide
An SQL optimization guide for developers. With best practices, warnings, and pro tips to speed up your SQL query optimization.
Managing data drift with Apache Kafka® Connect and a schema registry
Use Karapace, an open source Apache Kafka® schema registry, to prevent data errors by managing the data model across databases
Change Data Capture from Azure SQL to Apache Kafka® with Debezium
Implement a real-time change data capture workflow from an Azure SQL database using Aiven for Apache Kafka® and Debezium
Serverless event driven architecture with AWS Lambda functions and Apache Kafka®
Build serverless Event Driven Architectures (EDA) by combining Apache Kafka® with AWS Lambda functions. Learn how to trigger Lambda functions based on events flowing in an Apache Kafka topic
Change Data Capture from Amazon RDS to Apache Kafka® with Debezium
Implement a real-time change data capture workflow from an Amazon Relational Database Service database using Aiven for Apache Kafka®
Enabling change data capture from MySQL to Apache Kafka® with Debezium
Implement a change data capture workflow in Apache Kafka®, a key component of any organization with high data integrity requirements.
Image recognition with Python, OpenCV, OpenAI CLIP and pgvector
Vector embeddings are key to ML, and here we describe how to use OpenCV, OpenAI CLIP and pgvector to generate vectors and use them to perform image recognition on a corpus of photos.
Use PostgreSQL® DOMAIN rules to validate columns of data
Learn how to validate your data as it goes into your databases to improve data quality
Social search in real time: Exploring Mastodon data with Apache Kafka® and OpenSearch®
Mastodon → Apache Kafka® → OpenSearch® → knowledge
Stream Mastodon data to Apache Kafka® using NodeJS and TypeScript
If you want to analyze Mastodon posts, getting them into Apache Kafka® is a sensible first step. Read on to find out how to do this with Typescript and NodeJS.
Remove naughty words from your data using DataCater
We all know we shouldn't use naughty words. Learn how to remove them from your streaming data using DataCater.
Database migration with Apache Kafka® and Apache Kafka® Connect
Find out how to use Apache Kafka® to migrate across database technologies while keeping the target continually in sync with the source.