Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mousquetaires import parsing issues #2098

Open
aleene opened this issue Jul 13, 2019 · 1 comment
Open

Mousquetaires import parsing issues #2098

aleene opened this issue Jul 13, 2019 · 1 comment
Labels
Data import 🧽 Data quality https://wiki.openfoodfacts.org/Quality 🥗🔍 Ingredients analysis https://wiki.openfoodfacts.org/Ingredients_Extraction_and_Analysis 🥗 Ingredients

Comments

@aleene
Copy link
Contributor

aleene commented Jul 13, 2019

What

Some parsing issues I saw on the mousquetaires ingredients:

  • céleri - rave : is parsed as rave
  • Semoule de blé dur de qualité supérieure, précuite à la vapeur : the comma can be deleted
  • Lait pasteurisé à 1,1% de Mat. Gr. : breaks ingredient after Mat.
  • Eau minérale naturelle de Luchon (96%) Sucre Acide citrique Arôme naturel Acidifiant : E330 Conservateurs : E202 E242 : missing comma's
  • matière grasse végétale (palme) raffinée : makes raffinée an ingredient.
  • Huile de colza Huile de tournesol à haute teneur en acide oléique Huile de tournesol Huile de lin Vitamine D : missing comma's
  • mono - et diglycérides d'acides gras d'origine végétale : is interpreted as Mono-diglycerides-d-acides-gras-d
  • lactosérum _ (lait) _ en poudre : makes en poudre a separate ingredient
  • mono et di-glycérides d'acides gras *dont _Lait : is one ingrdient

Part of

@aleene aleene added the 🐛 bug This is a bug, not a feature request. label Jul 13, 2019
@teolemon teolemon changed the title Parsing issues mousquetaires céleri - rave, is parsed as rave (Mousquetaires import) Jul 13, 2019
@aleene aleene changed the title céleri - rave, is parsed as rave (Mousquetaires import) Mousquetaires import parsing issues Jul 13, 2019
@stephanegigandet
Copy link
Contributor

céleri - rave has been fixed, fixing a few others.
Missing commas probably won't be fixed as it would be very difficult to get it right and to not introduce errors.

@teolemon teolemon added 🧽 Data quality https://wiki.openfoodfacts.org/Quality 🥗🔍 Ingredients analysis https://wiki.openfoodfacts.org/Ingredients_Extraction_and_Analysis labels Aug 24, 2023
@teolemon teolemon moved this to To discuss and validate in 🍊 Open Food Facts Server issues Apr 23, 2024
@teolemon teolemon removed the 🐛 bug This is a bug, not a feature request. label Oct 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data import 🧽 Data quality https://wiki.openfoodfacts.org/Quality 🥗🔍 Ingredients analysis https://wiki.openfoodfacts.org/Ingredients_Extraction_and_Analysis 🥗 Ingredients
Projects
Status: To do
Status: To discuss and validate
Status: To do
Development

No branches or pull requests

3 participants