-
-
Notifications
You must be signed in to change notification settings - Fork 387
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix: improve ingredients extraction #8942
Conversation
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #8942 +/- ##
==========================================
+ Coverage 46.03% 46.07% +0.03%
==========================================
Files 64 64
Lines 19795 19824 +29
Branches 4791 4798 +7
==========================================
+ Hits 9113 9133 +20
- Misses 9496 9512 +16
+ Partials 1186 1179 -7 ☔ View full report in Codecov by Sentry. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @benbenben2 , it looks great, but could you add a test for those in unit/ingredients.t so that we don't break it in the future?
Tests
@stephanegigandet, yes, that was my question (is there a way to test image ingredients list extraction from image). So, I created tests for the cut_ingredients_text_for_lang subroutine (related to picture extraction). They are in a new file called ingredients_extract.t. %ignore_phrases bug fix%ignore_phrases were NOT removed from the text. I assume that it was a bug. It is fixed now. Additionally, I reviewed and removed 'na|n/a|not applicable' from the list: I kept those for FR but text of the image is often unrelated to ingredients: I kept DE phrases as they seem pretty coherent: |
Kudos, SonarCloud Quality Gate passed! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
Amazing @benbenben2 ! 💪 |
What
0850003875088
0099482485375
0856500004013, 0856500004037, 0029737700144
0029737700144
0029737210070
0073872746109
8006013990644
0008005958876
0008005985179
0008005958043
0008005959101
2252559300126
Question
Is there a way to test the image extraction locally?
Related issue(s) and discussion
Comment
Thanks @bredowmax to have collected so many examples. That is of great help.