Index your Valkey Cache and Start Searching

Aiven for Valkey includes the Valkey Search module setup and ready to go. Here's what that looks like in practice: a small online shop adding real search on top of the cache it's already running.

Needle & Yarn sells the yarn you crochet with (skeins) and the design patterns you crochet from. Like a lot of e-commerce backends, it already runs Valkey as a product cache, with each product stored as a Hash for hot-path performance.

The team wants to surface popular designs and skeins by recent traffic and saves. Adding a search index to the cache they're already running gets them all of this without a new datastore.

Note: the examples below use native Valkey command syntax, the same shape you'll see in the docs. A runnable Python tutorial that wraps everything in a flask app running with valkey-py lives in valkey-search-poc/.

The cache Needle & Yarn already runs

Needle & Yarn's commerce backend caches each design as a Hash under design:<id>.

HSET design:1001
  title       "Cabled Aran Cardigan"
  designer    "Liam O'Connor"
  yarn_weight worsted
  needle_size 5.0
  difficulty  advanced
  type        garment
  price       12.00
  rating      4.6
  favorites   842
  page_views  12480
Loading code...

Two signals live on every record. favorites counts customers who've saved the pattern, a measure of long-lived intent. page_views counts views in the recent window, short-lived intent. Only favorites come from PostgreSQL, the parent database.

Each Hash carries a TTL that's refreshed on every update.

HSET design:1001 favorites 843 page_views 12481
EXPIRE design:1001 86400
Loading code...

Records in active use stay in the cache and the index; records nobody touches for a day fall out automatically. No background sweep, no manual cleanup.

Skeins live in the same Valkey instance under skein:<id> with their own product fields and the same TTL pattern:

HSET skein:s101
  title       "Highland Wool DK"
  brand       "Northwind"
  yarn_weight DK
  fiber       wool
  color       oatmeal
  yardage     225
  needle_min  3.5
  needle_max  4.5
  in_stock    yes
  price       8.50
  favorites   612
  page_views  9220
Loading code...

Declare an index over the cache

A Valkey Search index describes which keys to watch and which fields to make queryable. The schemas use four field types: TEXT for full-text fields, TAG for exact-match filters, NUMERIC (with SORTABLE) for range and sort, and VECTOR, which we'll get to.

Once an index exists, every existing Hash matching the prefix is backfilled into the index, and every future HSET updates it automatically. There's no separate "insert into the index" call.

FT.CREATE designs_idx
  ON HASH
  PREFIX 1 design:
  SCHEMA
    title       TEXT WEIGHT 2.0
    designer    TEXT
    yarn_weight TAG
    needle_size NUMERIC SORTABLE
    difficulty  TAG
    type        TAG
    price       NUMERIC SORTABLE
    rating      NUMERIC SORTABLE
    favorites   NUMERIC SORTABLE
    page_views  NUMERIC SORTABLE
Loading code...

And the parallel index over the skein cache:

FT.CREATE skeins_idx
  ON HASH
  PREFIX 1 skein:
  SCHEMA
    title       TEXT WEIGHT 2.0
    brand       TEXT
    yarn_weight TAG
    fiber       TAG
    color       TAG
    in_stock    TAG
    yardage     NUMERIC SORTABLE
    needle_min  NUMERIC SORTABLE
    needle_max  NUMERIC SORTABLE
    price       NUMERIC SORTABLE
    favorites   NUMERIC SORTABLE
    page_views  NUMERIC SORTABLE
Loading code...

A few queries to set the scene

FT.SEARCH covers the obvious shapes: text, tag, numeric, and hybrid filters.

A customer searches the catalog for "cardigan":

FT.SEARCH designs_idx "cardigan"
Loading code...

The editorial team builds an "advanced picks" rail, high-difficulty patterns rated 4.5 or higher, sorted by saves:

FT.SEARCH designs_idx "@difficulty:{advanced} @rating:[4.5 +inf]" SORTBY favorites DESC
Loading code...

A "trending now" rail uses the short-window signal, top designs by recent traffic:

FT.SEARCH designs_idx "*" SORTBY page_views DESC LIMIT 0 10
Loading code...

Skeins follow the same pattern, and the reverse lookup falls out for free. Given a design that calls for worsted-weight yarn around a 5mm hook, find skeins that match: needle range covering 5mm, in stock, sorted by price:

FT.SEARCH skeins_idx "@yarn_weight:{worsted} @needle_min:[0 5.0] @needle_max:[5.0 +inf] @in_stock:{yes}" SORTBY price ASC
Loading code...

And the other direction: given a skein the customer just bought (worsted, oatmeal), find designs that work with it:

FT.SEARCH designs_idx "@yarn_weight:{worsted}" SORTBY favorites DESC
Loading code...

This isn't trying to replace Postgres FTS or a dedicated OpenSearch index; it gives you similar functionality over data you're already keeping hot in Valkey, with the structured fields you already index for caching purposes.

Vector search

The vector layer is the part of Valkey Search that's hard to replicate elsewhere without standing up a separate system. It runs similarity search next to those same structured fields, on data that's already in the cache.

This is what answers "what else looks like this?": similarity by meaning rather than exact match. Valkey Search treats a vector as another field type alongside TEXT, TAG, and NUMERIC, and lets you mix all of them in one query.

Two index algorithms are available. FLAT is brute-force exact K-Nearest Neighbor: perfect accuracy, slower as the dataset grows. Use it when the dataset is small or accuracy matters more than latency. HNSW is approximate, around 99% recall at milliseconds on millions of vectors. Use it when scale matters. Same query syntax either way; only the index changes. We'll use HNSW.

The example below shows the migration shape: an existing designs_idx is already serving traffic, and the team wants to add similarity search without dropping and rebuilding. Standing up a parallel vector index over the same key prefix means both indexes watch design:*, both see every HSET, and the basic one keeps serving while the new one backfills.

FT.CREATE designs_hnsw_idx
  ON HASH
  PREFIX 1 design:
  SCHEMA
    title       TEXT
    yarn_weight TAG
    difficulty  TAG
    embedding   VECTOR HNSW 6
      TYPE FLOAT32
      DIM 384
      DISTANCE_METRIC COSINE
Loading code...

The 6 after the algorithm name is the count of arguments that follow (not the vector dimension). We're declaring three attribute pairs (TYPE FLOAT32, DIM 384, DISTANCE_METRIC COSINE), which is six args total. The actual vector dimension is DIM 384, matching the output of sentence-transformers' all-MiniLM-L6-v2. Different models produce different sizes: OpenAI's text-embedding-3-small is 1536, Voyage's voyage-3-large is 1024, and so on. Set DIM to whatever your model produces.

Generating the embedding

The application has to produce the vector before it writes the design. With sentence-transformers in Python:

from sentence_transformers import SentenceTransformer
from valkey.commands.search.query import Query
import valkey

model = SentenceTransformer("all-MiniLM-L6-v2")     # 384-dim
client = valkey.Valkey()

def item_query(query_text: str, yarn_weight: str = None, limit: int = 5):
    vec = model.encode(query_text).astype("float32").tobytes()
    prefilter = f"@yarn_weight:{{{yarn_weight}}}" if yarn_weight else "*"
    q = (
        Query(f"{prefilter}=>[KNN {limit} @embedding $vec AS score]")
        .sort_by("score")
        .return_fields("title", "yarn_weight", "score")
        .paging(0, limit)
        .dialect(2)
    )
    return client.ft("designs_hnsw_idx").search(q, {"vec": vec}).docs
Loading code...

The query takes a prefilter before the =>[KNN ...] clause. * means no filter; replace it with any tag, numeric range, or text expression to narrow the candidate set first:

# No filter — search all designs
FT.SEARCH designs_hnsw_idx "*=>[KNN 5 @embedding $vec AS score]"
  PARAMS 2 vec <bytes>
  SORTBY score
  RETURN 3 title yarn_weight score
  DIALECT 2

# Narrow to worsted-weight designs first, then find the 5 nearest
FT.SEARCH designs_hnsw_idx "@yarn_weight:{worsted}=>[KNN 5 @embedding $vec AS score]"
  PARAMS 2 vec <bytes>
  SORTBY score
  RETURN 3 title yarn_weight score
  DIALECT 2
Loading code...

The rest of the syntax stays the same. PARAMS 2 vec <bytes> passes the query vector (the 2 is the argument count: one name, one value). DIALECT 2 enables the =>[KNN ...] syntax and must be set per query. That second query is how Needle & Yarn builds "designs like this one, in the yarn weight you're already shopping for" in a single round trip.

Where else this pattern fits

Needle & Yarn is a yarn shop, but the shape (cache plus index plus popularity counters plus similarity vectors) applies elsewhere. A few places it comes up regularly.

Internal AI assistants over private data work exactly this way: cache a corpus of docs as Hashes, add embeddings, expose FT.SEARCH as a tool. Customer support has two obvious fits: past tickets like this one, and KB articles relevant to this issue. Recency matters here too. When a new bug surfaces, an automated answer can resolve it everywhere at once rather than being handled ticket by ticket.

In-product "find similar" features follow the same structure regardless of domain: similar-document recommendations in a CMS, similar-account suggestions for a CSM, similar-incident lookup in an ops console. Hybrid filters over rapidly-changing fields are where a dedicated search engine tends to fall behind. Live inventory, dynamic pricing, current promo eligibility, in-flight order status: these all update too fast for an external index but stay searchable and accurate when the cache is also the index.

Anomaly detection and deduplication at ingestion are a less obvious fit but work well. Incoming user-generated content (reviews, comments, leads, abuse reports) checked against the existing corpus by semantic similarity catches near-duplicates that hand-tuned rules miss.

In each case, data that's already hot in Valkey for performance reasons gets indexed in place, with vector and structured fields combinable in one query.

If you're running Aiven for Valkey, the search module is available on every plan. Upgrade to Valkey 9.0, index your existing data fields, and start querying with vectors and hybrid filters today.

Stay updated with Aiven

Subscribe for the latest news and insights on open source, Aiven offerings, and more.

Subscribe to RSS

Table of contents

The cache Needle & Yarn already runs
Declare an index over the cache
A few queries to set the scene
Vector search
Generating the embedding
Where else this pattern fits

Index your Valkey Cache and Start Searching

The cache Needle & Yarn already runs

Declare an index over the cache

A few queries to set the scene

Vector search

Generating the embedding

Where else this pattern fits

Stay updated with Aiven

Related resources

Seamlessly Migrating 15k Redis servers to Valkey

The Aiven Free Tier Competition: Build, Share, and Win $1,000!

Right Size Your Model Usage with Valkey and Semantic Routing