Apply default k for knn query eagerly #118774

benwtrent · 2024-12-16T15:05:26Z

When originally added, the knn query didn't apply top-k restrictions to the query. Instead it would allow the resulting num_candidate to be combined with sibling queries without restricting to top-size results ahead of time.

This honestly is confusing behavior and leads to some bugs in understand how it all works.

This commit addresses this by eagerly gathering only size results when k==null before combining with other queries.

To achieve the previous behavior, this can be done directly by setting k==num_candidates in the query.

elasticsearchmachine · 2024-12-16T15:05:51Z

Pinging @elastic/es-search-relevance (Team:Search Relevance)

elasticsearchmachine · 2024-12-16T15:05:51Z

Hi @benwtrent, I've created a changelog YAML for you.

john-wagster

LGTM

carlosdelest

Looks good!

I think we need to change the docs as well, to something like:

k
(Optional, integer) The number of nearest neighbors to return from each shard. Elasticsearch collects k results from each shard, then merges them to find the global top results. This value must be less than or equal to num_candidates. Defaults to the search request size.

…t-k-knn-query

elasticsearchmachine · 2025-01-07T20:41:47Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 118774

benwtrent · 2025-01-07T21:05:34Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

When originally added, the knn query didn't apply `top-k` restrictions to the query. Instead it would allow the resulting `num_candidate` to be combined with sibling queries without restricting to `top-size` results ahead of time. This honestly is confusing behavior and leads to some bugs in understand how it all works. This commit addresses this by eagerly gathering only `size` results when `k==null` before combining with other queries. To achieve the previous behavior, this can be done directly by setting `k==num_candidates` in the query. (cherry picked from commit c18b48d)

Apply default k for knn query eagerly

2ab40fd

benwtrent added >bug :Search Relevance/Vectors Vector search v8.18.0 labels Dec 16, 2024

elasticsearchmachine added Team:Search Relevance Meta label for the Search Relevance team in Elasticsearch v9.0.0 labels Dec 16, 2024

Update docs/changelog/118774.yaml

d3d1635

john-wagster approved these changes Dec 16, 2024

View reviewed changes

carlosdelest approved these changes Dec 16, 2024

View reviewed changes

mayya-sharipova approved these changes Dec 16, 2024

View reviewed changes

benwtrent added 3 commits December 16, 2024 18:34

fixing tests

e1c89fa

adjusting testing

b4a1450

Merge remote-tracking branch 'upstream/main' into bugfix/apply-defaul…

83be105

…t-k-knn-query

benwtrent added the auto-backport Automatically create backport pull requests when merged label Dec 17, 2024

fixing tests

3c85c94

benwtrent added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Dec 17, 2024

benwtrent and others added 3 commits December 17, 2024 14:49

Merge branch 'main' into bugfix/apply-default-k-knn-query

cf97f83

restricting dynamic k to be less than num candidates

e0c0f86

Merge remote-tracking branch 'upstream/main' into bugfix/apply-defaul…

208fecc

…t-k-knn-query

benwtrent removed the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Dec 18, 2024

benwtrent added 2 commits January 6, 2025 07:32

Merge remote-tracking branch 'upstream/main' into bugfix/apply-defaul…

65aaae9

…t-k-knn-query

update docs

bb551a9

benwtrent added the auto-merge-without-approval Automatically merge pull request when CI checks pass (NB doesn't wait for reviews!) label Jan 6, 2025

benwtrent and others added 5 commits January 6, 2025 14:32

Merge branch 'main' into bugfix/apply-default-k-knn-query

8d5f14d

fixing tests

72f6f64

Merge branch 'main' into bugfix/apply-default-k-knn-query

b1674b6

Merge branch 'main' into bugfix/apply-default-k-knn-query

8d69eee

attempting to fix tests again

8de8b34

benwtrent and others added 2 commits January 7, 2025 12:45

fixing tests

4b3f148

Merge branch 'main' into bugfix/apply-default-k-knn-query

1d757dc

elasticsearchmachine merged commit c18b48d into elastic:main Jan 7, 2025
16 checks passed

benwtrent deleted the bugfix/apply-default-k-knn-query branch January 7, 2025 20:40

elasticsearchmachine added the backport pending label Jan 7, 2025

benwtrent mentioned this pull request Jan 7, 2025

[8.x] Apply default k for knn query eagerly (#118774) #119700

Merged

carlosdelest mentioned this pull request Jan 8, 2025

Vector rescoring - Simplify code for k == null #118997

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Apply default k for knn query eagerly #118774

Apply default k for knn query eagerly #118774

Uh oh!

benwtrent commented Dec 16, 2024

Uh oh!

elasticsearchmachine commented Dec 16, 2024

Uh oh!

elasticsearchmachine commented Dec 16, 2024

Uh oh!

john-wagster left a comment

Uh oh!

carlosdelest left a comment

Uh oh!

Uh oh!

elasticsearchmachine commented Jan 7, 2025

Uh oh!

benwtrent commented Jan 7, 2025

Uh oh!

Uh oh!

Apply default k for knn query eagerly #118774

Apply default k for knn query eagerly #118774

Uh oh!

Conversation

benwtrent commented Dec 16, 2024

Uh oh!

elasticsearchmachine commented Dec 16, 2024

Uh oh!

elasticsearchmachine commented Dec 16, 2024

Uh oh!

john-wagster left a comment

Choose a reason for hiding this comment

Uh oh!

carlosdelest left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Jan 7, 2025

💔 Backport failed

Uh oh!

benwtrent commented Jan 7, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Uh oh!