Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not hardcode chunk size in send_sample_id_and_label #361

Open
MaxiBoether opened this issue Jan 11, 2024 · 0 comments
Open

Do not hardcode chunk size in send_sample_id_and_label #361

MaxiBoether opened this issue Jan 11, 2024 · 0 comments
Labels

Comments

@MaxiBoether
Copy link
Contributor

Currently, we hard code a chunk size of 1000000 in the storage send_sample_id_and_label function. We need to chunk the number of files since potentially the number of files we get as input is huge. However, we do not really have a good parameter in the configuration (I think) to reflect the number here. Should we add one? Can we use some other parameter like sample_dbinsertion_batchsize?

@MaxiBoether MaxiBoether changed the title Do not hardware chunk size in send_sample_id_and_label Do not hardcode chunk size in send_sample_id_and_label Apr 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant