feat(sink): introduce file sink in PARQUET format #17311

wcy-fdu · 2024-06-18T08:33:46Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

This pr introduce basic file sink, which allow RisingWave sink data into s3 or other file system(currently s3 and gcs) in parquet format.
As sink decouple is not enabled yet, we will force write files every time checkpoint barrier arrives, that is to say, between two checkpoint barrier, there will be parallelism files written. To distinguish written files, the current naming convention is epoch_executor_id.suffix.

After this pr merged, we will introduce file sink batching strategy according to specific request, that is, enable sink decoupling. In addition, more sink file types will be introduced.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

To be edited(after all comments are resolved)

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

If this PR includes changes that directly affect users or other significant modifications relevant to the community, kindly draft a release note to provide a concise summary of these changes. Please prioritize highlighting the impact these changes will have on users.

hzxa21

Also, is the TODO specified in the PR description done?

proto/connector_service.proto

src/connector/src/sink/encoder/mod.rs

src/connector/src/sink/formatter/mod.rs

src/connector/src/sink/mod.rs

src/connector/Cargo.toml

src/connector/src/sink/file_sink/mod.rs

hzxa21 · 2024-07-04T07:33:28Z

src/connector/src/sink/file_sink/mod.rs

+ let filters =
+ chunk.visibility() & ops.iter().map(|op| *op == Op::Insert).collect::<Bitmap>();
+ chunk.set_visibility(filters);


hmmm, this seems to implictly convert a retractable stream chunk into an append-only stream chunk. Is it possible to receive a stream chunk with op != Op::Insert here? If yes, I feel that we should give an error, ask user to set force_append_only, and we only apply this filtering when the option is set. If not, we should add an assertion here instead of filter.

Currently for file sink, user need to explicitly support force_append_only=true.

hzxa21 · 2024-07-04T07:34:29Z

src/connector/src/sink/file_sink/mod.rs

+ let mut schema_fields = HashMap::new();
+ rw_schema.fields.iter().for_each(|field| {
+ let res = schema_fields.insert(&field.name, &field.data_type);
+ // This assert is to make sure there is no duplicate field name in the schema.


Should we make sure this in validate?

I think validate is only used to verify whether a connection can be established with the downstream sink. For example, for a file sink, it is used to verify whether the file can be written normally.
The rw_schema here is defined by the user's create sink statement and should be verified on the frontend to some extent, while I notice that other sinks also just get the schema directly from param without doing any validation, so I'd like to maintain the current implementation. Feel free to leave your stronger reasons.

src/connector/src/sink/file_sink/mod.rs

src/connector/src/sink/file_sink/opendal_sink.rs

hzxa21 · 2024-07-04T07:46:44Z

It seems that there are many conflicts. I suggest we resolve the conflicts first.

wcy-fdu · 2024-07-04T08:32:22Z

Also, is the TODO specified in the PR description done?

Not yet, I will add ci/e2e test after parquet source is merged.

wcy-fdu · 2024-08-15T10:35:13Z

debug logging:

I think the CN exit with code 139 is related to the newly added code:

                { S3, $crate::sink::file_sink::opendal_sink::FileSink<$crate::sink::file_sink::s3::S3Sink>},
                { Gcs, $crate::sink::file_sink::opendal_sink::FileSink<$crate::sink::file_sink::gcs::GcsSink>  },
                { Fs, $crate::sink::file_sink::opendal_sink::FileSink<FsSink>  },

If any of these three sinks appear alone, there will be no problem, but if there are more than two, 139 will appear.
I have reason to suspect that the problem is caused by the FileSink struct and its associated type OpendalSinkBackend, but I have no clue at the moment.

Maybe there is a shared mutable state between multiple Sinks, or maybe it’s an initialization problem.

…gwave into wcy/s3_sink

wcy-fdu · 2024-08-21T16:18:24Z

Thank @wenym1 very much, finally we resolved the mysterious 139 (segmentation fault) issue, let me briefly summarize what we did and our suspicions.

The issue mainly occurred with the dispatch_sink! macro. Although this PR only added three types of file sinks, I suspect that the macro expansion has become quite large, meaning it may be on the edge of a stack overflow. I have this suspicion because even when I individually added Box to one or multiple fields in the newly added FileSink struct, it did not resolve the issue, so it may not be due to the FileSink itself.

Here, let's take a look at the key changes.

This code will generate a 139 segment fault

dispatch_sink!(self.sink, sink, {
    let consume_log_stream = Self::execute_consume_log(
        *sink,
        log_reader,
        self.input_columns,
        self.sink_param,
        self.sink_writer_param,
        self.actor_context,
    )
    .instrument_await(format!("consume_log (sink_id {sink_id})"))
    .map_ok(|never| match never {}); // unify return type to `Message`

    select(consume_log_stream.into_stream(), write_log_stream).boxed()
})

This code will not generate a 139 segment fault

let consume_log_stream_future = dispatch_sink!(self.sink, sink, {
    let consume_log_stream = Self::execute_consume_log(
        *sink,
        log_reader,
        self.input_columns,
        self.sink_param,
        self.sink_writer_param,
        self.actor_context,
    )
    .instrument_await(format!("consume_log (sink_id {sink_id})"))
    .map_ok(|never| match never {}); // unify return type to `Message`

    consume_log_stream.boxed()
});
select(consume_log_stream_future.into_stream(), write_log_stream)

As you can see that the difference lies essentially between select(A, B).boxed() and select(A.boxed(), B.boxed()). Initially, I thought the performance impact of the two would be similar. However, since A is a macro expansion, one possible explanation is that the former generates n selects, while boxing the macro first results in only one select. As the content of the macro(n) grows larger, the former leads to a segment fault.

Note that this is just a suspicion, despite this, I still don't fully understand

why a segmentation fault occurs during jdbc.load (perhaps the communication with jni at this time causes a surge in memory)
why a single file sink will not cause segment fault, but two or more will. Is it really a coincidence that the two of them touched the critical point?

Anyways, we found a solution and developed a methodology. The common causes of segmentation faults typically include

Stack Overflow:
Due to deep recursive calls or allocating large data structures on the stack.
Uninitialized Memory:
Accessing uninitialized variables or memory, leading to reading invalid addresses.
Unsafe Code:
Errors in dereferencing pointers or accessing invalid memory within unsafe blocks.
Data Races:
Improper synchronization of shared data access in a multithreaded environment.
External Library Issues:
Bugs in third-party libraries that cause memory access errors.
Circular References:
Circular references in data structures may lead to memory leaks and stack overflow.

While if a rust system encounters a 139 segmentation fault, it is likely due to some allocations on the stack being too large. In such cases, wrapping potentially problematic fields with Box may be a way to address the issue.

Co-authored-by: William Wen <[email protected]>

Co-authored-by: congyi wang <[email protected]> Co-authored-by: William Wen <[email protected]>

BugenZhao · 2024-08-22T07:03:55Z

While if a rust system encounters a 139 segmentation fault, it is likely due to some allocations on the stack being too large.

You can attach a debugger to the process and it will show the reason for any exceptions (like segmentation fault). Typically it's stack overflow, and by checking the backtrace we can find where it happens.

wcy-fdu · 2024-08-22T07:09:08Z

While if a rust system encounters a 139 segmentation fault, it is likely due to some allocations on the stack being too large.

You can attach a debugger to the process and it will show the reason for any exceptions (like segmentation fault). Typically it's stack overflow, and by checking the backtrace we can find where it happens.

Thank you for your explanation! Can we attach debugger in out CI? This issue can not reproduce locally, only happen in CI.

wcy-fdu added 14 commits March 11, 2024 17:24

save work

d8349c9

save work, add gcs

dea340a

implement sink writer

22fe512

make clippy happy

0b47117

save work, add parquet writer

ad6f3a3

minor

08d05f0

add parquet writer, todo: add e2e test and comments

f4618c1

minor

31a8052

fix typo

d1b61a9

add fs sink for test

f216379

save work

9b593c7

save work

69d5052

save work

0a72bdb

introduce file sink with parquet type

b200a3c

wcy-fdu requested a review from a team as a code owner June 18, 2024 08:33

wcy-fdu marked this pull request as draft June 18, 2024 08:33

wcy-fdu added 5 commits June 19, 2024 17:48

refactor

7e18fe1

add fs sink for test

da6a4dd

add comments

ac785be

minor for parquet change

a3c6449

todo: upgrade to opendal 0.47

ac20951

wcy-fdu marked this pull request as ready for review July 3, 2024 10:19

wcy-fdu requested review from xxhZs, hzxa21 and wenym1 July 3, 2024 10:19

hzxa21 reviewed Jul 4, 2024

View reviewed changes

tabVersion self-requested a review July 8, 2024 06:11

remove json encode, minor refactor

f555951

wcy-fdu added 2 commits August 15, 2024 15:45

make clippy happy

67466d4

fix cargo.toml

46031f2

wcy-fdu and others added 9 commits August 20, 2024 18:18

use box for operator

ac9f869

use box in dispatch_sink

854c9ba

try Box new operator

4edff17

try Box get_path

a6b9151

try again

223f7da

boxed future

64100c0

merge main

774aa46

Merge branch 'wcy/s3_sink' of https:/risingwavelabs/risin…

fdd1472

…gwave into wcy/s3_sink

revert box in FileSink struct

807c869

wcy-fdu added this pull request to the merge queue Aug 22, 2024

Merged via the queue into main with commit 7ebe0a2 Aug 22, 2024
28 of 30 checks passed

wcy-fdu deleted the wcy/s3_sink branch August 22, 2024 02:39

neverchanje mentioned this pull request Aug 22, 2024

Document: feat(sink): introduce file sink in PARQUET format risingwavelabs/risingwave-docs#2496

Closed

wcy-fdu added the need-cherry-pick-release-2.0 label Aug 22, 2024

neverchanje mentioned this pull request Aug 22, 2024

Document: feat(sink): introduce file sink in PARQUET format risingwavelabs/risingwave-docs#2497

Closed

github-actions bot pushed a commit that referenced this pull request Aug 22, 2024

feat(sink): introduce file sink in PARQUET format (#17311)

f2701fb

Co-authored-by: William Wen <[email protected]>

github-actions bot mentioned this pull request Aug 22, 2024

feat(sink): introduce file sink in PARQUET format (#17311) #18170

Merged

lmatz added the user-facing-changes Contains changes that are visible to users label Aug 22, 2024

neverchanje mentioned this pull request Aug 22, 2024

Document: feat(sink): introduce file sink in PARQUET format risingwavelabs/risingwave-docs#2498

Closed

github-merge-queue bot pushed a commit that referenced this pull request Aug 22, 2024

feat(sink): introduce file sink in PARQUET format (#17311) (#18170)

8d6a05b

Co-authored-by: congyi wang <[email protected]> Co-authored-by: William Wen <[email protected]>

lmatz mentioned this pull request Aug 26, 2024

chore: add license to file sink #18227

Merged

9 tasks

WanYixian mentioned this pull request Aug 31, 2024

Add doc of sinking data in parquet format risingwavelabs/risingwave-docs#2552

Merged

3 tasks

BugenZhao mentioned this pull request Sep 18, 2024

risingwave 2.0.0 risingwavelabs/homebrew-risingwave#44

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(sink): introduce file sink in PARQUET format #17311

feat(sink): introduce file sink in PARQUET format #17311

wcy-fdu commented Jun 18, 2024 •

edited by lmatz

Loading

hzxa21 left a comment

hzxa21 Jul 4, 2024

wcy-fdu Jul 9, 2024

hzxa21 Jul 4, 2024

wcy-fdu Jul 29, 2024 •

edited

Loading

hzxa21 commented Jul 4, 2024

wcy-fdu commented Jul 4, 2024

wcy-fdu commented Aug 15, 2024

wcy-fdu commented Aug 21, 2024 •

edited

Loading

BugenZhao commented Aug 22, 2024

wcy-fdu commented Aug 22, 2024 •

edited

Loading

feat(sink): introduce file sink in PARQUET format #17311

feat(sink): introduce file sink in PARQUET format #17311

Conversation

wcy-fdu commented Jun 18, 2024 • edited by lmatz Loading

What's changed and what's your intention?

Checklist

Documentation

Release note

hzxa21 left a comment

Choose a reason for hiding this comment

hzxa21 Jul 4, 2024

Choose a reason for hiding this comment

wcy-fdu Jul 9, 2024

Choose a reason for hiding this comment

hzxa21 Jul 4, 2024

Choose a reason for hiding this comment

wcy-fdu Jul 29, 2024 • edited Loading

Choose a reason for hiding this comment

hzxa21 commented Jul 4, 2024

wcy-fdu commented Jul 4, 2024

wcy-fdu commented Aug 15, 2024

wcy-fdu commented Aug 21, 2024 • edited Loading

BugenZhao commented Aug 22, 2024

wcy-fdu commented Aug 22, 2024 • edited Loading

wcy-fdu commented Jun 18, 2024 •

edited by lmatz

Loading

wcy-fdu Jul 29, 2024 •

edited

Loading

wcy-fdu commented Aug 21, 2024 •

edited

Loading

wcy-fdu commented Aug 22, 2024 •

edited

Loading