Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Structured Streaming] Exactly-once guarantee with ILM/Rollover #1386

Open
1 task done
danielyahn opened this issue Nov 7, 2019 · 2 comments
Open
1 task done

[Structured Streaming] Exactly-once guarantee with ILM/Rollover #1386

danielyahn opened this issue Nov 7, 2019 · 2 comments

Comments

@danielyahn
Copy link

What kind an issue is this?

  • Feature Request.

Feature description

When using ILM (index lifecycle management) and its roll over API implementation, your ingest job needs to point at the write alias. ILM, especially with its ability to roll over by size, has benefits on operation.

However, it's not possible to have exactly-once guarantee when using write alias. Current checkpoint implementation for ES Sink doesn't capture the actual index that write alias points to.

Therefore, when if you're replaying some batches (whether one batch that failed halfway or multiple batches for any operational reason), you can't guarantee that your records are going to same index.

@jbaiera jbaiera added the discuss label Nov 8, 2019
@jbaiera
Copy link
Member

jbaiera commented Nov 8, 2019

I'm marking this as discuss for the team since this is a general problem across multiple streaming write workloads that are using ILM for managing indices.

@jakelandis
Copy link
Contributor

Related: elastic/elasticsearch#44794

@jbaiera jbaiera added the :Core label Jun 29, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants