Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use smart_open for jsonl loading #119

Merged
merged 2 commits into from
Jan 9, 2025
Merged

Use smart_open for jsonl loading #119

merged 2 commits into from
Jan 9, 2025

Conversation

ryokan0123
Copy link
Contributor

@ryokan0123 ryokan0123 commented Jan 9, 2025

With this, Jsonl*Dataset can read compressed files like .gz, .zst.
This is rare for evaluation data, but happens when reusing the module for data loading in different contexts.

Since this isn't a primary use case and feels more like a hidden feature, I’m not adding tests for it at the moment.

@ryokan0123 ryokan0123 changed the title Use smart_open for jsonl loading Use smart_open for jsonl loading Jan 9, 2025
@ryokan0123 ryokan0123 requested a review from a team January 9, 2025 06:14
@ryokan0123 ryokan0123 merged commit 77d6a07 into main Jan 9, 2025
7 checks passed
@ryokan0123 ryokan0123 deleted the smart_open branch January 9, 2025 06:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants