Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add British Library books dataset (#3603)
* loading script draft * improve config naming * move parsing code into function * fix type hints * fix default config name * fix typo Co-authored-by: Quentin Lhoest <[email protected]> * add header Co-authored-by: Quentin Lhoest <[email protected]> * remove readlines call Co-authored-by: Quentin Lhoest <[email protected]> * update copyright date * add citation to README * update citation key * update citation key * add contact details * add URLs to configs * add url * black formatting * add config options to readme * generate dataset_infos * add dummy data * fix tags * Update datasets/blbooks/README.md Co-authored-by: Quentin Lhoest <[email protected]> Co-authored-by: Quentin Lhoest <[email protected]>
- Loading branch information
4c417d5
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Show benchmarks
PyArrow==3.0.0
Show updated benchmarks!
Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json
Show updated benchmarks!
Benchmark: benchmark_array_xd.json
Benchmark: benchmark_getitem_100B.json
Benchmark: benchmark_indices_mapping.json
Benchmark: benchmark_iterating.json
Benchmark: benchmark_map_filter.json