Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spelling variations #17

Closed
joeflack4 opened this issue Jul 28, 2022 · 2 comments · Fixed by #20
Closed

Spelling variations #17

joeflack4 opened this issue Jul 28, 2022 · 2 comments · Fixed by #20
Assignees

Comments

@joeflack4
Copy link
Member

joeflack4 commented Jul 28, 2022

Summary

Need to utilize new spelling_variations column in the Google Sheet.

Additional info

At some point, we should probably refactor to use the NLP library that Stephanie recommended.

I don't think it's important to break down the results by which spelling variations were used. Can just aggregate all the results for all variations by and label it by what's in the keyword column.

@stephanieshong
Copy link

one word or two words are supported

@joeflack4 joeflack4 assigned joeflack4 and unassigned rohaher Jul 28, 2022
@joeflack4
Copy link
Member Author

I'm working on this now but my current understanding is that we will / should not use nltk for the spelling variations part of things. The reason is that we basically have 2 options starting out when it comes to getting raw text from zulip:
a. Get all the data from Zulip, then parse it.
b. Use Zulip's endpoints and query features to query certain keywords, on certain streams, etc.

We've went with (b). I think this is actually the better option simply because Zulip has a rate limit for how much we can query.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
3 participants