-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[HUDI-6504] [DOCS] Updated roadmap #9142
Conversation
website/src/pages/roadmap.md
Outdated
|
||
| Release | Timeline | | ||
|--------------------------------------------------------------------------|----------| | ||
| [0.12.2](https://issues.apache.org/jira/projects/HUDI/versions/12352249) | Dec 2022 | | ||
| [0.13.0](https://issues.apache.org/jira/projects/HUDI/versions/12352101) | Jan 2023 | | ||
| [0.14.0](https://issues.apache.org/jira/projects/HUDI/versions/12352700) | July 2023 | | ||
| [1.0.0](https://issues.apache.org/jira/projects/HUDI/versions/12351210) | TBD | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lets make Oct as timeline for 1..x
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
updated
website/src/pages/roadmap.md
Outdated
| Eager conflict detection for Optimistic Concurrency Control | 0.13.0 | [RFC-56](https:/apache/hudi/pull/6003), [HUDI-1575](https://issues.apache.org/jira/browse/HUDI-1575) | | ||
| Index Function for Optimizing Query Performance | 0.13.0 | [RFC-63](https:/apache/hudi/pull/7235), [HUDI-512](https://issues.apache.org/jira/browse/HUDI-512) | | ||
| Support for primary key-less table | 0.14.0 | [HUDI-4699](https://issues.apache.org/jira/browse/HUDI-4699) | | ||
| Metaserver for all metadata | 1.0.0 | [Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.ged084e5bf6_0_278), [RFC-36](https:/apache/hudi/pull/4718), [HUDI-3345](https://issues.apache.org/jira/browse/HUDI-3345) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
move all of 0.14.0 to the beginning under each section
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Major part of metaserver is already implemented and released in 0.13.0. We should remove it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed @yihua
website/src/pages/roadmap.md
Outdated
| Support of verification with multiple event_time fields | 1.0.0 onward | [RFC-59](https:/apache/hudi/pull/6382), [HUDI-4569](https://issues.apache.org/jira/browse/HUDI-4569) | | ||
| New Hudi Table Spec API for Query Integrations | 1.0.0 | [RFC-64](https:/apache/hudi/pull/7080) | | ||
| Snapshot view management | 1.0.0 | [RFC-61](https:/apache/hudi/pull/6576), [HUDI-4678](https://issues.apache.org/jira/browse/HUDI-4678) | | ||
| Engine specific record representation for better performance | 1.0.0 | [RFC-46](https:/apache/hudi/blob/master/rfc/rfc-46/rfc-46.md), [HUDI-3217](https://issues.apache.org/jira/browse/HUDI-3217) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
sorry, I only gave this title.
may be we can consider this as well
"Optimizing performance w/ engine based record representation"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
fixed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is released in 0.13.0.
website/src/pages/roadmap.md
Outdated
| Redesign and optimization of record payload abstraction | 0.13.0 | [RFC-46](https:/apache/hudi/blob/master/rfc/rfc-46/rfc-46.md), [HUDI-3217](https://issues.apache.org/jira/browse/HUDI-3217) | | ||
| Optimized storage layout for cloud object stores | 0.13.0 | [RFC-60](https:/apache/hudi/pull/5113), [HUDI-3625](https://issues.apache.org/jira/browse/HUDI-3625) | | ||
| Support of verification with multiple event_time fields | 1.0.0 onward | [RFC-59](https:/apache/hudi/pull/6382), [HUDI-4569](https://issues.apache.org/jira/browse/HUDI-4569) | | ||
| New Hudi Table Spec API for Query Integrations | 1.0.0 | [RFC-64](https:/apache/hudi/pull/7080) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We landed this in 0.14.0: MOR snapshot reader to integrate with query engines without using Hadoop APIs #9066 . Is it worth mentioning?
website/src/pages/roadmap.md
Outdated
| Mutable, Transactional caching for Hudi Tables (could be accelerated based on community feedback) | 1.0.0 onward | [Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.gf7e0551254_0_5) | | ||
| Hudi Reverse streamer | 1.0.0 onward | [RFC-70](https:/apache/hudi/pull/9040) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We also discussed to include perf improvement for spark-sql writes as a line item. can we add that in
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
it's under execution engine- should i move it elsewhere?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, under execution engine.
website/src/pages/roadmap.md
Outdated
| Eager conflict detection for Optimistic Concurrency Control | 0.13.0 | [RFC-56](https:/apache/hudi/pull/6003), [HUDI-1575](https://issues.apache.org/jira/browse/HUDI-1575) | | ||
| Index Function for Optimizing Query Performance | 0.13.0 | [RFC-63](https:/apache/hudi/pull/7235), [HUDI-512](https://issues.apache.org/jira/browse/HUDI-512) | | ||
| Support for primary key-less table | 0.14.0 | [HUDI-4699](https://issues.apache.org/jira/browse/HUDI-4699) | | ||
| Metaserver for all metadata | 1.0.0 | [Strawman design](https://docs.google.com/presentation/d/1QBgLw11TM2Qf1KUESofGrQDb63EuggNCpPaxc82Kldo/edit#slide=id.ged084e5bf6_0_278), [RFC-36](https:/apache/hudi/pull/4718), [HUDI-3345](https://issues.apache.org/jira/browse/HUDI-3345) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Major part of metaserver is already implemented and released in 0.13.0. We should remove it.
website/src/pages/roadmap.md
Outdated
| Supervised table service planning and execution | 1.0.0 | [RFC-43](https:/apache/hudi/pull/4309), [HUDI-4147](https://issues.apache.org/jira/browse/HUDI-4147) | | ||
| Record-level index to speed up UUID-based upserts and deletes | 0.14.0 | [RFC-08](https://cwiki.apache.org/confluence/display/HUDI/RFC-08++Record+level+indexing+mechanisms+for+Hudi+datasets), [HUDI-53](https://issues.apache.org/jira/browse/HUDI-53) | | ||
| Secondary index to improve query performance | 1.0.0 | [RFC-52](https:/apache/hudi/pull/5370), [HUDI-3907](https://issues.apache.org/jira/browse/HUDI-3907) | | ||
| Index Function for Optimizing Query Performance | 1.0.0 | [RFC-63](https:/apache/hudi/pull/7235), [HUDI-512](https://issues.apache.org/jira/browse/HUDI-512) | | ||
| Lock-Free message queue to improve writing efficiency | 0.13.0 | [RFC-53](https:/apache/hudi/blob/master/rfc/rfc-53/rfc-53.md), [HUDI-3963](https://issues.apache.org/jira/browse/HUDI-3963) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is done in 0.13.0.
website/src/pages/roadmap.md
Outdated
| Lock free concurrency control | 1.0.0 onward | [HUDI-3187](https://issues.apache.org/jira/browse/HUDI-3187) | | ||
| Non-blocking/Lock-free updates during clustering | 1.0.0 onward | [HUDI-1042](https://issues.apache.org/jira/browse/HUDI-1042) | | ||
| Time Travel updates, deletes | 1.0.0 onward || | ||
| General purpose support for multi-table transactions | 1.0.0 onward || | ||
| Lockless multi-writer support | 1.0.0 | [RFC-66](https:/apache/hudi/pull/7907) | | ||
| A more effective HoodieMergeHandler for COW table with parque | 1.0.0 | [RFC-68](https:/apache/hudi/blob/f1afb1bf04abdc94a26d61dc302f36ec2bbeb15b/rfc/rfc-68/rfc-68.md) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| A more effective HoodieMergeHandler for COW table with parque | 1.0.0 | [RFC-68](https:/apache/hudi/blob/f1afb1bf04abdc94a26d61dc302f36ec2bbeb15b/rfc/rfc-68/rfc-68.md) | | |
| A more effective HoodieMergeHandler for COW table with parquet | 1.0.0 | [RFC-68](https:/apache/hudi/blob/f1afb1bf04abdc94a26d61dc302f36ec2bbeb15b/rfc/rfc-68/rfc-68.md) | |
website/src/pages/roadmap.md
Outdated
| Redesign and optimization of record payload abstraction | 0.13.0 | [RFC-46](https:/apache/hudi/blob/master/rfc/rfc-46/rfc-46.md), [HUDI-3217](https://issues.apache.org/jira/browse/HUDI-3217) | | ||
| Optimized storage layout for cloud object stores | 0.13.0 | [RFC-60](https:/apache/hudi/pull/5113), [HUDI-3625](https://issues.apache.org/jira/browse/HUDI-3625) | | ||
| Support of verification with multiple event_time fields | 1.0.0 onward | [RFC-59](https:/apache/hudi/pull/6382), [HUDI-4569](https://issues.apache.org/jira/browse/HUDI-4569) | | ||
| New Hudi Table Spec API for Query Integrations | 1.0.0 | [RFC-64](https:/apache/hudi/pull/7080) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We landed this in 0.14.0: MOR snapshot reader to integrate with query engines without using Hadoop APIs #9066 . Is it worth mentioning?
website/src/pages/roadmap.md
Outdated
| Support of verification with multiple event_time fields | 1.0.0 onward | [RFC-59](https:/apache/hudi/pull/6382), [HUDI-4569](https://issues.apache.org/jira/browse/HUDI-4569) | | ||
| New Hudi Table Spec API for Query Integrations | 1.0.0 | [RFC-64](https:/apache/hudi/pull/7080) | | ||
| Snapshot view management | 1.0.0 | [RFC-61](https:/apache/hudi/pull/6576), [HUDI-4678](https://issues.apache.org/jira/browse/HUDI-4678) | | ||
| Engine specific record representation for better performance | 1.0.0 | [RFC-46](https:/apache/hudi/blob/master/rfc/rfc-46/rfc-46.md), [HUDI-3217](https://issues.apache.org/jira/browse/HUDI-3217) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is released in 0.13.0.
website/src/pages/roadmap.md
Outdated
| Support for Spark 3.4.0 | 0.14.0 | [HUDI-6198](https://issues.apache.org/jira/browse/HUDI-6198) | | ||
| Integrate column stats index with all query engines | 0.14.0 | [RFC-58](https:/apache/hudi/pull/6345), [HUDI-4552](https://issues.apache.org/jira/browse/HUDI-4552) | | ||
| Upgrade to Spark 3 as the default profile | 0.14.0 | [HUDI-3431](https://issues.apache.org/jira/browse/HUDI-3431) | | ||
| Performance for Spark-SQL writes | 0.14.0 | [HUDI-6315](https://issues.apache.org/jira/browse/HUDI-6315), [HUDI-6376](https://issues.apache.org/jira/browse/HUDI-6376) | | ||
| Spark datasource V2 read | 1.0.0 onward | [HUDI-4449](https://issues.apache.org/jira/browse/HUDI-4449) | | ||
| Materialized Views with incremental updates using Flink | 1.0.0 onward || | ||
| SQL DML support for Presto/Trino connectors (could be accelerated based on community feedback) | 1.0.0 onward || |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't plan to do this (SQL DML support in Presto/Trino) in near future. We should remove this entry.
website/src/pages/roadmap.md
Outdated
@@ -61,8 +64,10 @@ Next major release : [0.13.0](https://issues.apache.org/jira/projects/HUDI/versi | |||
|
|||
| Feature | Target Release | Tracking | | |||
|-----------------------------------------------------------------------------------------------------|--------------|-----------------------------------------------------------------------------------------------------------------------------------------| | |||
| Protobuf source for DeltaStreamer | 0.13.0 | [RFC-57](https:/apache/hudi/blob/master/rfc/rfc-57/rfc-57.md), [HUDI-4399](https://issues.apache.org/jira/browse/HUDI-4399) | | |||
| Improving Hudi CLI features and usability | 0.13.0 | [HUDI-1388](https://issues.apache.org/jira/browse/HUDI-1388) | | |||
| Improving Hudi CLI features and usability | 0.14.0 | [HUDI-1388](https://issues.apache.org/jira/browse/HUDI-1388) | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is done in 0.13.0.
updated roadmap based on siva and ethan's feedback
cc @yihua updated with items from rfc 69 |
Can you add a jira and link it in title please. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM with a few edits
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
few minor comments
Change Logs
updated the roadmap for 0.14.0 and 1.x.
cc @nsivabalan @yihua @bhasudha