I'd be curious how they implement updating. AFAICT this is the thorniest part of...

gujun720 · on Jan 11, 2020

Milvus allows users to append vectors. Vectors are stored in multiple file slices. When a file slice reaches the threshold, Milvus will build the index for that file slice, and new data will be inserted into a new file slice. For details, please refer https://medium.com/@milvusio/managing-data-in-massive-scale-...

We are now working on the vector deletion. Hopefully will be ready by the end of 1Q this year.

peterldowns · on Jan 11, 2020

If I append a single new vector, will it show up in search results without me needing to ask for the index to be rebuilt? Can i update an existing vector without having to ask for the index to be rebuilt?

EDIT: from reading the linked article, it seems like newly inserted vectors will be queried using brute force. Very interesting idea!

gujun720 · on Jan 11, 2020

Correct, new vectors will first be searched thru brute force until the index is created on that file slice.

pilooch · on Jan 11, 2020

See https://github.com/jolibrain/deepdetect/pull/641 that uses FAISS as a backend alternative to annoy (annoy supported as well). Deletion can be implemented by removing entries from the listing db while the vector remains within the index.

Tests show that FAISS is bit better than annoy on retrieval of both small and million items indexes. It also includes ind x compression techniques that in our tests do fair very well, with very low loss on mid size 500k image indexes.