Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

consider a more sklearn like, pipeline approach #1207

Open
fgregg opened this issue Sep 22, 2024 · 1 comment
Open

consider a more sklearn like, pipeline approach #1207

fgregg opened this issue Sep 22, 2024 · 1 comment

Comments

@fgregg
Copy link
Contributor

fgregg commented Sep 22, 2024

  1. break out all the active learning bits into a separate class or multiple separate classes

  2. train a blocking model, using the familiar fit_transform syntax. this is a separate class that emits a stream of pairs. (is this something that could really fit into the sklearn pattern)

  3. train a classification model using fit_transform., this takes in a stream of pairs and emits a stream of classification decisions

actually, this all would work quite well.

https://scikit-learn.org/stable/modules/generated/sklearn.pipeline.Pipeline.html

@fgregg
Copy link
Contributor Author

fgregg commented Sep 22, 2024

we can think of blocking as related to clustering, and use that as inspo.

https://scikit-learn.org/stable/modules/generated/sklearn.cluster.KMeans.html#sklearn.cluster.KMeans

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant