Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data loading and handling #11

Open
Sophon96 opened this issue Sep 21, 2023 · 2 comments
Open

Data loading and handling #11

Sophon96 opened this issue Sep 21, 2023 · 2 comments
Labels

Comments

@Sophon96
Copy link
Member

We need some way to load data formats, such as CSV, Apache Parquet, HDF. There's libraries for these in C++, but I'm not sure how well they integrate. I don't know if there's a ubiquitous data science data library like pandas for C++, which might be preferable to use.

@adamhutchings
Copy link
Contributor

Even if we write our own, I think all functions of this sort that are not directly related to machine learning should go in a top-level folder separate from core/. Any good name ideas or thoughts?

@Sophon96
Copy link
Member Author

I think we shouldn't roll our own. Apache Arrow looks like a pretty cool library. There's also this repo I found https://github.com/hosseinmoein/DataFrame/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants