Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

boost_phrase should split by sentence of various length if we want to catch synonyms #250

Open
alexgarel opened this issue Oct 11, 2024 · 0 comments

Comments

@alexgarel
Copy link
Member

while search?q=organically grown&boost_phrase=1 finds items with label en:organic (which has organically grown as synonym),
q=organically grown plants&boost_phrase=1 does not work.

This is because The PhraseBoostTransformer groups all the words in a phrase. If we want synonyms to be detected, we would need to group terms 2 by 2, 3 by 3, 4 by 4 etc. so that organically grown plants,
results in adding "organically grown" OR "grown plants" OR "organically grown plants"

See the deactivated test in test_search.py

@alexgarel alexgarel added the 🐛 Bug Something isn't working label Oct 11, 2024
@github-project-automation github-project-automation bot moved this to Backlog (ready for dev) in 🔎 Search-a-licious Oct 11, 2024
@teolemon teolemon removed the 🐛 Bug Something isn't working label Oct 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Backlog (ready for dev)
Development

No branches or pull requests

2 participants