Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow dataset preprocessing to automatically handle (ignore) mismatched files. #8

Closed
harrykeightley opened this issue Sep 19, 2023 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@harrykeightley
Copy link
Contributor

When running on the timit dataset, I had a lot of file copies in the dataset without matched audio/transcriptions.
I would expect these to be handled by the library, but it was not... and failed.

I think we should move the handling of file mis-matches to the library, to free that up from the frontend. This would also allow us to not store all the dataset files in the browser itself.

@harrykeightley harrykeightley added the enhancement New feature or request label Sep 19, 2023
@harrykeightley harrykeightley self-assigned this Sep 19, 2023
@harrykeightley
Copy link
Contributor Author

Closed in #10

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant