Why does it take more time to search the entire xq than to search sequentially for a subset of all the shards of the xq? #836

chenyihang1993 · 2019-05-23T12:14:33Z

There are 1B samples in xq. I get the searching result by two different ways:

Using IVF65536_HNSW32,PQ64 index, train it and add whole xq in it, then search.
Using IVF65536_HNSW32,PQ64 index, train it and cut xq into 10 subsets(there are 100M samples in it), then add subsets of xq separately to 10 indexes and search separately, finally merge the result.
The first way takes more time than the second. Please tell me the reason. THX.

mdouze · 2019-05-24T13:36:34Z

With as baseline case 1:

Searching a dataset of 100M that is a subset of the 1B should be faster because the inverted lists are shorter.
Searching 10 datasets of 100M should be slower because it is performing the same amount of comparisons but the overhead is higher.
Doing the same but in parallel (eg. via an IndexShards) could be faster if you are performing searches in one thread.

If you are not in one of those cases, please comment.

chenyihang1993 · 2019-05-27T01:23:33Z

The strange thing is that searching 10 datasets of 100M is faster than searching the dataset of 1B. This is my case.

chenyihang1993 · 2019-05-27T02:43:48Z

There is a demo. The length of xb is 1M, and the index is 'IVF100_HNSW32,PQ64'.

import numpy as np
import faiss
import time

d = 64  # dimension
nb = 1000000  # database size
nq = 1000  # nb of queries
nt = 10000
np.random.seed(1234)  # make reproducible
xb = np.random.random((nb, d)).astype('float32')
xq = np.random.random((nq, d)).astype('float32')
xt = np.random.random((nt, d)).astype('float32')
k = 100

index_1 = faiss.index_factory(d, 'IVF100_HNSW32,PQ64')
index_1.train(xt)
index_1.add(xb)
faiss.ParameterSpace().set_index_parameter(index_1, 'nprobe', 64)
start = time.time()
index_1.search(xq, k)
print('time of searching: ' + str(time.time() - start))

index_1.reset()
faiss.write_index(index_1, 'base.index')

subset_len = 100000
sum_time = 0
for i in range(0, nb, subset_len):
    index = faiss.read_index('base.index')
    faiss.ParameterSpace().set_index_parameter(index, 'nprobe', 64)
    index.add(xb[i:i + subset_len])
    start = time.time()
    index.search(xq, k)
    sum_time = sum_time + time.time() - start

print('time of searching separately:' + str(sum_time))

The result is

time of searching: 18.017865657806396
time of searching separately:4.433526039123535

mdouze · 2019-05-27T14:13:36Z

Thanks for the demo. I can reproduce the issue.
It seems that this is a threading problem. When running in a single thread, the times are:

time of searching: 38.52920603752136
time of searching separately:42.64698934555054

The issue is due to the inverted list scanning, the quantization time is constant.
Looking further...

beauby · 2019-05-30T13:13:27Z

This will be fixed soon.

chenyihang1993 · 2019-05-31T01:46:53Z

Thank you very much.

beauby · 2019-06-07T13:50:42Z

This was fixed upstream, and the fix will be available in the next release.

Bugfixes: - slow scanning of inverted lists (#836). Features: - add basic support for 6 new metrics in CPU `IndexFlat` and `IndexHNSW` (#848); - add support for `IndexIDMap`/`IndexIDMap2` with binary indexes (#780). Misc: - throw python exception for OOM (#758); - make `DistanceComputer` available for all random access indexes; - gradually moving from `long` to `int64_t` for portability.

ucasiggcas · 2020-05-11T10:01:54Z

hi,dear
If I want to use xb=100M or other ,then how to set the index
I see the code you set
The length of xb is 1M, and the index is 'IVF100_HNSW32,PQ64'.

index_1 = faiss.index_factory(d, 'IVF100_HNSW32,PQ64')
index_1.train(xt)
index_1.add(xb)
faiss.ParameterSpace().set_index_parameter(index_1, 'nprobe', 64)

thx

beauby added the question label May 23, 2019

beauby added bug and removed question labels May 30, 2019

beauby closed this as completed Jun 7, 2019

beauby mentioned this issue Jun 13, 2019

Facebook sync (Jun 2019) #862

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why does it take more time to search the entire xq than to search sequentially for a subset of all the shards of the xq? #836

Why does it take more time to search the entire xq than to search sequentially for a subset of all the shards of the xq? #836

chenyihang1993 commented May 23, 2019

mdouze commented May 24, 2019

chenyihang1993 commented May 27, 2019 •

edited

Loading

chenyihang1993 commented May 27, 2019

mdouze commented May 27, 2019 •

edited

Loading

beauby commented May 30, 2019

chenyihang1993 commented May 31, 2019

beauby commented Jun 7, 2019

ucasiggcas commented May 11, 2020

Why does it take more time to search the entire xq than to search sequentially for a subset of all the shards of the xq? #836

Why does it take more time to search the entire xq than to search sequentially for a subset of all the shards of the xq? #836

Comments

chenyihang1993 commented May 23, 2019

mdouze commented May 24, 2019

chenyihang1993 commented May 27, 2019 • edited Loading

chenyihang1993 commented May 27, 2019

mdouze commented May 27, 2019 • edited Loading

beauby commented May 30, 2019

chenyihang1993 commented May 31, 2019

beauby commented Jun 7, 2019

ucasiggcas commented May 11, 2020

chenyihang1993 commented May 27, 2019 •

edited

Loading

mdouze commented May 27, 2019 •

edited

Loading