Skip to content
This repository has been archived by the owner on Mar 3, 2023. It is now read-only.

SQL DPSU demo notebook #17

Open
wants to merge 5 commits into
base: master
Choose a base branch
from
Open

SQL DPSU demo notebook #17

wants to merge 5 commits into from

Conversation

anujdesai510
Copy link

No description provided.

" from urllib.request import urlretrieve\n",
" urlretrieve(url, local_file)\n",
"\n",
"root_url = subprocess.check_output(\"git rev-parse --show-toplevel\".split(\" \")).decode(\"utf-8\").strip()\n",
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should remove this, it should no longer be needed. You can make root_url -> "." The current directory

Copy link
Contributor

@eedeleon eedeleon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice notebook!

Copy link
Contributor

@joshua-oss joshua-oss left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great; awesome to have the flag to control behavior!

I would also recommend showing len(result) and len(korolova_result), since those will be the count of distinct ngrams. That will be a good comparison with the statement earlier where you show len(exact), and should show the difference between the two methods even more clearly than the sum(n). The difference should be more pronounced as max_contrib gets larger.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants