Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Numeric-String Text Matching #62

Open
ganesh-morsu opened this issue Aug 22, 2023 · 1 comment
Open

Numeric-String Text Matching #62

ganesh-morsu opened this issue Aug 22, 2023 · 1 comment

Comments

@ganesh-morsu
Copy link

Hi

I am using the Text matching using Poly fuzz

import polyfuzz
model = polyfuzz.PolyFuzz()

model_fit=model.fit(["CIPLAR LA 40 TABLET", "CIPLAR LA 80 TABLET"])

model_fit.transform(['CIPLAR LA 40 TABLET'])

output :- {'TF-IDF':                   From                   To  Similarity
 0  CIPLAR LA 40 TABLET  CIPLAR LA 80 TABLET         1.0}

The matching is coming CIPLAR LA 80 TABLET but it should be CIPLAR LA 40 TABLET

It is not considering numeric, Do we have any option to not ignore numeric

@MaartenGr
Copy link
Owner

I would advise checking out the list of models you can choose from. More specifically, you can choose to use TF-IDF together with numeric values for updating its parameters for tokenizations and preprocessing.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants