-
Notifications
You must be signed in to change notification settings - Fork 9.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
EMR spot/on demand instance group creation fails after 10 minutes #14093
Comments
I had a similar issue when I used nat instances instead of nat GWs in my legacy VPCs. The point is that EMR nodes are mutable. All necessary components will be installed on it after the launch. Unfortunately, I can't say for sure this is the cause of your issue. But perhaps this will help someone. |
Thanks @admssa. I do have VPC with nat GW & S3 endpoint as well setup. However, it's still failing. In any case the time out is too low if I see the EMR cluster timeout which is set up to 75 minutes, that's why I raised a pull request with 30 Minutes of timeout (I have seen some of my instances gets up in 20 min in worst condition ) |
Yeah, saw it and added+1. Unfortunately, this won't solve the issue with autoscaling which is the only advantage of instance groups. (IG may stuck in resizing state b/c of timeout on aws side). |
The timeout increases have been merged and will release with version 3.4.0 of the Terraform AWS Provider, likely later today. Thanks to @pasalkarsachin1 for the implementation. 👍 |
This has been released in version 3.4.0 of the Terraform AWS provider. Please see the Terraform documentation on provider versioning or reach out if you need any assistance upgrading. For further feature requests or bug reports with this functionality, please create a new GitHub issue following the template for triage. Thanks! |
I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues. If you feel this issue should be reopened, we encourage creating a new issue linking back to this one for added context. Thanks! |
My Terraform is failing as
aws_emr_instance_group
is not able to get my spot/on demand instance inRUNNING
state before 10 minutes, causing the Terraform job to fail. Every time I re-execute the job it creates a new instance group with same name but fails again. This creates un-necessary groups as the old group came up after sometime but as TF was failed it didn't updated the group in state file. It has hardcoded wait time on 10 minutesTerraform Version
0.12.20
hashicorp/aws 2.69.0
Affected Resource(s)
Terraform Configuration Files
Debug Output
Panic Output
Expected Behavior
Atleast Instance creation wait time should be configurable, so my terraform will not fail.
Actual Behavior
error waiting for EMR Instance Group (ig-XXXXXX) creation: timeout while waiting for state to become 'RUNNING' (last state: 'RESIZING', timeout: 10m0s)
Steps to Reproduce
terraform apply
Important Factoids
References
The text was updated successfully, but these errors were encountered: