HNSW BP reordering #14097

msokolov · 2025-01-02T20:01:01Z

Description

This is similar to the previous PR (#13683) on the same issue, but the difference here is:

rebased on main (there was a 10.0 major release in between)
refactored to make BpVectorReorderer compatible with BPIndexReorderer so that either can plug into BPReorderingMergePolicy

msokolov · 2025-01-02T20:03:40Z

note: addresses issue #13565

msokolov · 2025-01-02T20:27:23Z

One interesting detail that came out while testing with the merge policy is that indexing actually becomes faster with this change since the additional work we do in the BP reordering is outweighed by the savings we have performing HNSW searches during merging. Indeed the savings can be dramatic:

condition	recall	latency (ms)	nDoc	topK	fanout	maxConn	beamWidth	quantized	index s	index docs/s	force merge s	num segments	index size (MB)	vec disk (MB)	vec RAM (MB)
no BP	0.794	0.996	2000000	10	64	64	250	7 bits	1073.04	1863.87	271.95	1	2607.89	2441.406	488.281
with BP	0.797	0.887	2000000	10	64	64	250	7 bits	987.76	2024.78	143.48	1	2558.39	2441.406	488.281

mikemccand · 2025-01-03T13:04:27Z

These results look amazing @msokolov!

msokolov · 2025-01-06T12:17:09Z

Right, I;m excited about this! And pleased how it worked out as a merge policy. I was kind of hoping @jpountz would review, but perhaps he's out for vacation. Most of this has already been seen and he approved the earlier PR. The main new thing here that might be controversial is the refactoring so that this reorderer shares a common interface (and some implementation, for less code repetition) with the pre-existing BPIndexReorderer. I think I'll go ahead and commit, but not backport yet, so we can get some runs in the test machines, and if we later decide we want to revise that approach we still can.

Hmm I see it's only been a few days, including the weekend. I'll wait another day before pushing

mikemccand · 2025-01-06T19:42:02Z

@msokolov do you have changes to luceneutil's knnPerfTest.py to enable this? Let's merge those upstream (to luceneutil) too ... I'm working on getting nightly benchy to run knnPerfTest so maybe we can cutover to this new merge policy there!

The nightly benchy already indexes KNN vectors (with and without quantization), so that's another place where we could turn on this new feature... I'll open a luceneutil issue to iterate! Exciting...

benwtrent · 2025-01-06T19:58:14Z

These are exciting numbers! Its interesting how improved search latency is dropping the index build time.

Do we know why the searching times are so much better? Is it simply because during merge, the merger sees the docs in a specific order that aids better graph building?

I am a bit confused by how this all works.

msokolov · 2025-01-06T20:02:20Z

I am not sure, but surmising that search performance is improved because of some combination of (1) graph ordinal decoding being faster (since we encode using VInts and these are now smaller), and (2) improved memory locality something something - I mean we are in theory comparing vectors that are nearer to each other in virtual memory space - not sure how that could/would affect anything though. I admit this is pretty hand-wavy, but it's the best I've got -- the only change here is that more similar vectors are now nearer to each other in docid (and thus vector ordinal) space, and a follow-on from that is that delta encoding in the HNSW graph is more compact/efficient.

msokolov · 2025-01-06T20:02:47Z

Regarding merging luceneutil tooling - I will open a PR, but suggest we hold off merging until this change hits Lucene

msokolov · 2025-01-06T20:05:20Z

I will note that tests with smaller indexes don't show such dramatic improvements - more support for the theory that graph decoding is what is helped, because there are no real compression savings on small graphs where all the deltas are 2-bytes already. In the previous round of this Adrien had suggested looking into different encodings for the graph node ids, as we have for postings - I suspect there is room for improvement there

mikemccand · 2025-01-06T20:18:25Z

I like the more efficient delta encoding theory.

Decoding vInt is a hotspot for HNSW graph traversal ... so if we can use 2 bytes instead of 3, or 1 byte instead of 2, thanks to the improved locality, that might be needle moving!

benwtrent · 2025-01-06T21:15:39Z

If we really think vint is the cause, I wonder if we should switch encoding to the readGroupVInts stuff? #12871

My thought around the bp reordering helping was dependent on the order of the vectors read is actually helping the vector indexing because clusters of vectors are indexed together. Honestly, bootstrapping HNSW via clusters is a really nice idea (and could help solve the "merge cost problem" by reusing clusters during merge...).

The numbers here are really nice. I just want to understand why they were better, especially as recall changes, which seems to indicate that the graph building itself is being changed.

msokolov · 2025-01-06T21:39:49Z

The numbers here are really nice. I just want to understand why they were better, especially as recall changes, which seems to indicate that the graph building itself is being changed.

I didnt' see a big impact on recall beyond what is typical from noise -- even with the same graph settings we see variance in recall due to randomness in the graph creation when there are multiple threads. I had 'numMergeWorker': (12,),
'numMergeThread': (4,) so that could be the explanation for the variance?

benwtrent · 2025-01-06T21:56:43Z

I didnt' see a big impact on recall beyond what is typical from noise -- even with the same graph settings we see variance in recall due to randomness in the graph creation when there are multiple threads. I had 'numMergeWorker': (12,),
'numMergeThread': (4,) so that could be the explanation for the variance?

Yeah, that could explain variance. For repeatable tests, I generally only merge with one thread, but this is because I am usually doing things that effect recall.

But that force merge time difference is 🤯 143.48 vs 271.95 is HUGE. It seems strange that we would get an almost 2x improvement just from faster vint reading?

jpountz

This looks pretty cool! I left a few questions, nothing major.

jpountz · 2025-01-06T21:48:42Z

lucene/core/src/java/org/apache/lucene/util/hnsw/IncrementalHnswGraphMerger.java

-    KnnVectorsReader currKnnVectorsReader = reader;
-    if (reader instanceof PerFieldKnnVectorsFormat.FieldsReader candidateReader) {
-      currKnnVectorsReader = candidateReader.getFieldReader(fieldInfo.name);
+    if (!noDeletes(liveDocs) || !(reader instanceof HnswGraphProvider)) {


It took me a couple seconds to make sure I got the double negation of !noDeletes right. I wonder if this could be turned into a single negation?

hah, true. Not sure where this came from - will change to hasDeletes. Q: is it truly necessary to check every bit of liveDocs? If we have a non-null liveDocs doesn't that imply one of the bits will be zero?

Q: is it truly necessary to check every bit of liveDocs? If we have a non-null liveDocs doesn't that imply one of the bits will be zero?

I'd say yes. IMO null is only an optimization for the case when all docs are live.

jpountz · 2025-01-06T21:52:46Z

lucene/misc/src/java/org/apache/lucene/misc/index/BpVectorReorderer.java

+/**
+ * Implementation of "recursive graph bisection", also called "bipartite graph partitioning" and
+ * often abbreviated BP, an approach to doc ID assignment that aims at reducing the sum of the log
+ * gap between consecutive neighbor node ids. See BPIndexReorderer in misc module.


It should be able to link to this class since both are in the misc module now?

jpountz · 2025-01-06T22:01:16Z

lucene/misc/src/java/org/apache/lucene/misc/index/BpVectorReorderer.java

+            vectorScalarMul(
+                1 / (float) Math.sqrt(VectorUtil.dotProduct(centroid, centroid)), centroid);
+      }
+      ;


looks like it could be removed?

jpountz · 2025-01-06T22:05:09Z

lucene/misc/src/java/org/apache/lucene/misc/index/BpVectorReorderer.java

+
+      // Computing biases is typically a bottleneck, because each iteration needs to iterate over
+      // all postings to recompute biases, and the total number of postings is usually one order of
+      // magnitude or more than the number of docs. So we try to parallelize it.


This comment is probably not true for vectors.

yes, but something analogous is true (cost is like N(docs) * D(dimension) -- I'll update the comment

I had overlooked the number of dimensions, good point.

jpountz · 2025-01-06T22:06:10Z

lucene/misc/src/java/org/apache/lucene/misc/index/BpVectorReorderer.java

+      float gain = maxLeftBias - minRightBias;
+      // This compares the gain of swapping the doc from the left side that is most attracted to the
+      // right and the doc from the right side that is most attracted to the left against the
+      // average vector length (/500) rather than zero.


Where does this 500 come from, is it a heuristic?

yes, I'll update the comment

msokolov · 2025-01-07T13:52:40Z

Thanks for the reviews. I agree the measurements are not well explained. I have other runs that show no or less impact on search times, unchanged index sizes, and sometimes negative impact (on indexing times) -- I believe we expect indexing times to increase due to the additional work done when merging. But I don't think any of that should block merging this since it doesn't impact any existing use case, and testing to get to the bottom of what's going on is laborious since it requires building large indexes, so it could be done over time

jpountz · 2025-01-07T14:25:57Z

I don't think any of that should block merging

Agreed, especially for the misc module.

benwtrent · 2025-01-07T14:33:38Z

@msokolov I agree, we shouldn't block merging as it doesn't adjust default behavior.

I was just surprised at the numbers and curious to the cause.

Michael Sokolov added 4 commits January 2, 2025 13:43

BpVectorReorderer

cf5eb68

remove stray prints; make lenient to missing/misconfigured fields

7314314

tidy, lint, restore test setup

69e6dbf

comments

35f4d1f

mikemccand mentioned this pull request Jan 6, 2025

Should we enable BP reordering for the HNSW graph? mikemccand/luceneutil#322

Open

jpountz approved these changes Jan 6, 2025

View reviewed changes

tidying, and CHANGES

2399ad7

msokolov modified the milestones: 11.0.0, 10.2.0 Jan 7, 2025

msokolov merged commit 02a09f6 into apache:main Jan 7, 2025
5 checks passed

msokolov deleted the hnsw-bpreorder-merge-policy branch January 7, 2025 14:41

msokolov mentioned this pull request Jan 8, 2025

backwards compat tests fail with "too many neighbors: 11" #14118

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HNSW BP reordering #14097

HNSW BP reordering #14097

msokolov commented Jan 2, 2025 •

edited

Loading

msokolov commented Jan 2, 2025

msokolov commented Jan 2, 2025

mikemccand commented Jan 3, 2025

msokolov commented Jan 6, 2025 •

edited

Loading

mikemccand commented Jan 6, 2025

benwtrent commented Jan 6, 2025

msokolov commented Jan 6, 2025

msokolov commented Jan 6, 2025

msokolov commented Jan 6, 2025

mikemccand commented Jan 6, 2025

benwtrent commented Jan 6, 2025

msokolov commented Jan 6, 2025 •

edited

Loading

benwtrent commented Jan 6, 2025

jpountz left a comment

jpountz Jan 6, 2025

msokolov Jan 7, 2025

jpountz Jan 7, 2025

jpountz Jan 6, 2025

jpountz Jan 6, 2025

jpountz Jan 6, 2025

msokolov Jan 7, 2025

jpountz Jan 7, 2025

jpountz Jan 6, 2025

msokolov Jan 7, 2025

msokolov commented Jan 7, 2025

jpountz commented Jan 7, 2025

benwtrent commented Jan 7, 2025

HNSW BP reordering #14097

HNSW BP reordering #14097

Conversation

msokolov commented Jan 2, 2025 • edited Loading

Description

msokolov commented Jan 2, 2025

msokolov commented Jan 2, 2025

mikemccand commented Jan 3, 2025

msokolov commented Jan 6, 2025 • edited Loading

mikemccand commented Jan 6, 2025

benwtrent commented Jan 6, 2025

msokolov commented Jan 6, 2025

msokolov commented Jan 6, 2025

msokolov commented Jan 6, 2025

mikemccand commented Jan 6, 2025

benwtrent commented Jan 6, 2025

msokolov commented Jan 6, 2025 • edited Loading

benwtrent commented Jan 6, 2025

jpountz left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

msokolov commented Jan 7, 2025

jpountz commented Jan 7, 2025

benwtrent commented Jan 7, 2025

msokolov commented Jan 2, 2025 •

edited

Loading

msokolov commented Jan 6, 2025 •

edited

Loading

msokolov commented Jan 6, 2025 •

edited

Loading