feat: support partition writer #153

ZENOTME · 2023-08-17T10:29:31Z

No description provided.

liurenjie1024

Generally LGTM, good job!

icelake/src/io/task_writer.rs

liurenjie1024 · 2023-08-18T03:17:07Z

icelake/src/io/task_writer.rs

+                .iter()
+                .map(|field| {
+                    let array: ArrayRef = batch
+                        .column_by_name(&field.name)


We don't need to search by name for each partition, this should happen in initialization of writer.

I'm not quite understand. Seems we can't do in initialization. For every batch coming in, we need to extract its related column to compute according partition field. And it's not gurantee that batch comes in is always have the same column order so we need to search it by name. (I'm not sure whether the function name is missleading.

I think we should do it in initialization, and the column index should be found by schema. It's required that the record batch's schema should match table schema, otherwise the parquet file's schema doesn't match table schema.

Oh, I see. Yes we should do it.

icelake/src/io/task_writer.rs

tests/integration/rust/src/append.rs

liurenjie1024

LGTM, Thanks

ZENOTME · 2023-08-18T07:25:29Z

We need to let all test case to use a same docker env.

support partition writer

bcf08f7

ZENOTME requested a review from liurenjie1024 August 17, 2023 10:29

add test for partition table

1fca978

ZENOTME force-pushed the partition branch from 7bf9266 to 1fca978 Compare August 17, 2023 10:31

ZENOTME requested a review from Xuanwo August 17, 2023 10:32

ZENOTME mentioned this pull request Aug 17, 2023

Tracking: Support partition write #114

Closed

3 tasks

liurenjie1024 reviewed Aug 18, 2023

View reviewed changes

liurenjie1024 approved these changes Aug 18, 2023

View reviewed changes

ZENOTME force-pushed the partition branch from d4d3735 to a4c0cde Compare August 18, 2023 06:53

fix

a070d9d

ZENOTME force-pushed the partition branch from a4c0cde to a070d9d Compare August 18, 2023 07:13

liurenjie1024 merged commit 393d000 into icelake-io:main Aug 18, 2023
3 checks passed

ZENOTME deleted the partition branch August 18, 2023 11:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support partition writer #153

feat: support partition writer #153

ZENOTME commented Aug 17, 2023

liurenjie1024 left a comment

liurenjie1024 Aug 18, 2023

ZENOTME Aug 18, 2023 •

edited

Loading

liurenjie1024 Aug 18, 2023

ZENOTME Aug 18, 2023

liurenjie1024 left a comment

ZENOTME commented Aug 18, 2023

feat: support partition writer #153

feat: support partition writer #153

Conversation

ZENOTME commented Aug 17, 2023

liurenjie1024 left a comment

Choose a reason for hiding this comment

liurenjie1024 Aug 18, 2023

Choose a reason for hiding this comment

ZENOTME Aug 18, 2023 • edited Loading

Choose a reason for hiding this comment

liurenjie1024 Aug 18, 2023

Choose a reason for hiding this comment

ZENOTME Aug 18, 2023

Choose a reason for hiding this comment

liurenjie1024 left a comment

Choose a reason for hiding this comment

ZENOTME commented Aug 18, 2023

ZENOTME Aug 18, 2023 •

edited

Loading