Skip to content
This repository has been archived by the owner on Oct 30, 2024. It is now read-only.

fix: do not min/max-normalize already normalized similarity scores to not distort them #107

Merged
merged 1 commit into from
Sep 6, 2024

Conversation

iwilltry42
Copy link
Collaborator

In our default flows, we're using a merging retriever with similarity search and bm25 search.
The similarityScores from the similarity search are already normalized (0-1 range), while the bm25 Scores are not.
Before this change we still did a min/max-normalization even over the observed similarityScores, which distorted them so much that it significantly impacted the merged document score.

For the future we may consider using zScore normalization - #106

@iwilltry42 iwilltry42 merged commit 8d35da0 into main Sep 6, 2024
1 check passed
@iwilltry42 iwilltry42 deleted the fix/score-normalization branch September 6, 2024 12:25
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant