Skip to content

Commit

Permalink
Merge branch 'grouping' of github.com:ArturoAmorQ/scikit-learn-mooc i…
Browse files Browse the repository at this point in the history
…nto grouping
  • Loading branch information
ArturoAmorQ committed Apr 29, 2024
2 parents 5db388f + af0b33f commit 2a06b85
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions python_scripts/cross_validation_grouping.py
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,7 @@
_ = plt.title("Distribution of the test scores")

# %% [markdown]
# Shuffling the data results in higher cross-validated test accuracy with less
# Shuffling the data results in a higher cross-validated test accuracy with less
# variance compared to when the data is not shuffled. It means that some
# specific fold leads to a low score in this case.

Expand All @@ -104,7 +104,7 @@
# %% [markdown]
# Thus, shuffling the data breaks the underlying structure and thus makes the
# classification task easier to our model. To get a better understanding, we can
# read the dataset description:
# read the dataset description in more detail:

# %%
print(digits.DESCR)
Expand Down Expand Up @@ -188,7 +188,7 @@
# %% [markdown]
# Once we group the digits by writer, we can incorporate this information into
# the cross-validation process by using group-aware variations of the strategies
# we have explored in this course, for example, the class `GroupKFold`.
# we have explored in this course, for example, the `GroupKFold` strategy.

# %%
from sklearn.model_selection import GroupKFold
Expand Down

0 comments on commit 2a06b85

Please sign in to comment.