What is Dense Retrieval?
Dense retrieval uses bi-encoders to map queries and documents into the same vector space. Similarity (dot product/cosine) retrieves top candidates efficiently.
How It Works (quick)
- Train: Contrastive learning on query–doc pairs.
- Encode: Precompute doc vectors; encode queries at runtime.
- ANN search: Return nearest neighbors; filter; pass to ranker.
- Cold-start: Use synthetic pairs if labels are scarce.
Why It Matters in E-commerce
- Great for long-tail and language variance.
- Low latency with precomputed vectors.
Best Practices
- Fine-tune on catalog + query logs.
- Refresh embeddings when inventory changes.
- Pair with neural re-rankers for quality.
Summary
Dense retrieval is fast semantic recall. Tune on your domain and combine with re-ranking.