There is no single "best" index. The right choice depends on your data size, latency budget, recall target, and memory constraints. Pick a dataset scale and hover the bubbles or cards to compare.
Bubble size shows memory footprint — smaller bubble = more memory used.
Brute force — compare against every vector.
Cluster the data; search only the nearest clusters.
IVF + 8-bit scalar quantization. ~4× less memory.
IVF + product quantization. Aggressive compression, ~16× smaller.
Hierarchical graph. Best speed-recall trade-off in memory.
Graph index that lives on disk. Trades latency for memory.
Decision shortcut: small data → FLAT or HNSW; medium data with RAM → HNSW; medium-to-large data with tight RAM → IVF_SQ8 or IVF_PQ; billion-scale → DiskANN. Always benchmark on your real data before committing.