Skip to content

Commit

Permalink
Added some baselines
Browse files Browse the repository at this point in the history
  • Loading branch information
Pringled committed Oct 11, 2024
1 parent c3a5143 commit ada8869
Showing 1 changed file with 11 additions and 2 deletions.
13 changes: 11 additions & 2 deletions tutorials/semantic_deduplication.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -71,7 +71,7 @@
" deduplicated_text_indices.append(i)\n",
" seen.add(text)\n",
"\n",
"len(deduplicated_text_indices)"
"print(\"Number of deduplicated docs:\", len(deduplicated_text_indices))"
]
},
{
Expand Down Expand Up @@ -260,7 +260,16 @@
"cell_type": "code",
"execution_count": 82,
"metadata": {},
"outputs": [],
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Number of deduplicated docs: 119128\n",
"Time taken: 42.20003666699631\n"
]
}
],
"source": [
"wl = WordLlama.load()\n",
"\n",
Expand Down

0 comments on commit ada8869

Please sign in to comment.