Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

aws_glue : CfnCrawler missing data source HudiTarget both in CFN and CDK. HudiTarget only available in GUI #31133

Open
1 of 2 tasks
lorenzo-necto opened this issue Aug 17, 2024 · 1 comment
Labels
@aws-cdk/aws-glue Related to AWS Glue effort/small Small work item – less than a day of effort feature-request A feature should be added or improved. needs-cfn This issue is waiting on changes to CloudFormation before it can be addressed. p2

Comments

@lorenzo-necto
Copy link

Describe the feature

Hello
If I set up my crawler from GUI console I can choose Hudi S3 Table as a data source to be crawled. This is not yet possible neither on CFN or CDK. CDK already has iceberg and delta lake as sources for the crawler but no Hudi : https://docs.aws.amazon.com/cdk/api/v2/docs/aws-cdk-lib.aws_glue.CfnCrawler.html

Use Case

I would like to use the serde Hudi and not parquet when crawling my hudi S3 folders, although available via GUI it is still not present in CDK or CFN

Proposed Solution

P1. Add at least to CloudFormation HudiTarget to complete the data lake frameworks formats (given that Iceberg and Delta Target are present in CFN)

P2. Add to CDK the target type

Other Information

No response

Acknowledgements

  • I may be able to implement this feature request
  • This feature might incur a breaking change

CDK version used

"aws-cdk-lib": "^2.115.0",

Environment details (OS name and version, etc.)

MACBookPro M1

@lorenzo-necto lorenzo-necto added feature-request A feature should be added or improved. needs-triage This issue or PR still needs to be triaged. labels Aug 17, 2024
@github-actions github-actions bot added the @aws-cdk/aws-glue Related to AWS Glue label Aug 17, 2024
@ashishdhingra ashishdhingra self-assigned this Aug 19, 2024
@ashishdhingra ashishdhingra added p2 investigating This issue is being investigated and/or work is in progress to resolve the issue. and removed needs-triage This issue or PR still needs to be triaged. labels Aug 19, 2024
@pahud
Copy link
Contributor

pahud commented Aug 19, 2024

Add at least to CloudFormation HudiTarget to complete the data lake frameworks formats (given that Iceberg and Delta Target are present in CFN)

Looks like it's still missing in CloudFormation? Pleae submit the feature request to cloudformation-coverage-roadmap to help the CFN team prioritize this support. I am making this a p2 feat request pending on CFN support. As soon as we have that support, CDK should be able to support that immediately.

@ashishdhingra ashishdhingra added effort/small Small work item – less than a day of effort needs-cfn This issue is waiting on changes to CloudFormation before it can be addressed. and removed investigating This issue is being investigated and/or work is in progress to resolve the issue. labels Aug 19, 2024
@ashishdhingra ashishdhingra removed their assignment Aug 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
@aws-cdk/aws-glue Related to AWS Glue effort/small Small work item – less than a day of effort feature-request A feature should be added or improved. needs-cfn This issue is waiting on changes to CloudFormation before it can be addressed. p2
Projects
None yet
Development

No branches or pull requests

3 participants