-
Notifications
You must be signed in to change notification settings - Fork 49
SQL DPSU demo notebook #17
base: master
Are you sure you want to change the base?
Conversation
service/demo/DP_SQL_demo.ipynb
Outdated
" from urllib.request import urlretrieve\n", | ||
" urlretrieve(url, local_file)\n", | ||
"\n", | ||
"root_url = subprocess.check_output(\"git rev-parse --show-toplevel\".split(\" \")).decode(\"utf-8\").strip()\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We should remove this, it should no longer be needed. You can make root_url -> "." The current directory
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice notebook!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great; awesome to have the flag to control behavior!
I would also recommend showing len(result) and len(korolova_result), since those will be the count of distinct ngrams. That will be a good comparison with the statement earlier where you show len(exact), and should show the difference between the two methods even more clearly than the sum(n). The difference should be more pronounced as max_contrib gets larger.
No description provided.