Post
231
Embeddings are pretty useful, but mathematically limited.
Great insights from Google DeepMind: On the Theoretical Limitations of Embedding-Based Retrieval (2508.21038)
What could be the alternative? Cross-Encoders (good but can't scale), Multi-vector, Sparse models...or something new.
Hybrid retrieval is the current quick fix, imo.
Great insights from Google DeepMind: On the Theoretical Limitations of Embedding-Based Retrieval (2508.21038)
What could be the alternative? Cross-Encoders (good but can't scale), Multi-vector, Sparse models...or something new.
Hybrid retrieval is the current quick fix, imo.