Discussion about this post

User's avatar
Neural Foundry's avatar

This deep dive on multi-vector retrieval is exactly what the field needs right now. The token level granularity point really hits home becasue single-vector compression basically throws away all the detail that makes retreival actually useful. I've seen so many RAG systems fail on edge cases and this explains why, the semantic drift from compression is way worse than people realize and quantization making this practical changes everything.

Expand full comment

No posts

Ready for more?