Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat!: accept pd.DataFrame in session.createDataFrame #247

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

Thomzoy
Copy link

@Thomzoy Thomzoy commented Jan 17, 2025

This PR adds the possibility to create a DataFrame from a pd.DataFrame using session.createDataFrame, further mimicking pyspark's behaviour"

@Thomzoy Thomzoy requested a review from eakmanrq as a code owner January 17, 2025 13:31
@Thomzoy Thomzoy changed the title feat: accept pd.DataFrame in session.createDataFrame feat!: accept pd.DataFrame in session.createDataFrame Jan 17, 2025
sqlframe/base/session.py Outdated Show resolved Hide resolved
@eakmanrq
Copy link
Owner

Great addition @Thomzoy! Can you add a test for this showing the conversion from pandas to PySpark? Might be good to add one to this file and show the same behavior between SQLFrame and PySpark when getting pandas dataframes as an input: https://github.com/eakmanrq/sqlframe/blob/250d86ea56c0e7fa375f2a84daef0a646411292a/tests/integration/test_int_dataframe.py

@Thomzoy Thomzoy force-pushed the createDataFrame_from_pd_DataFrame branch from 2d1dee0 to 4e3abe4 Compare January 21, 2025 10:52
@Thomzoy
Copy link
Author

Thomzoy commented Jan 21, 2025

Testing is done !

@Thomzoy Thomzoy requested a review from eakmanrq January 21, 2025 14:31
@eakmanrq eakmanrq enabled auto-merge (squash) January 22, 2025 05:16
@eakmanrq
Copy link
Owner

@Thomzoy looks like the CI test failed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants