We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Currently I am using a bag that reduces to a local data frame. See my SO question/answer https://stackoverflow.com/questions/64512040/how-to-aggregate-large-number-of-small-csv-files-50k-efficiently-code-size/64517641
With a partitioning strategy it should be possible to build a distributed data frame (needed if the data is not that heavily reduced)
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Currently I am using a bag that reduces to a local data frame. See my SO question/answer https://stackoverflow.com/questions/64512040/how-to-aggregate-large-number-of-small-csv-files-50k-efficiently-code-size/64517641
With a partitioning strategy it should be possible to build a distributed data frame (needed if the data is not that heavily reduced)
The text was updated successfully, but these errors were encountered: