Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] Docker login/logout in Jenkins cause gradle check to fail on Windows #3816

Closed
peterzhuamazon opened this issue Jul 26, 2023 · 10 comments
Closed
Assignees
Labels
bug Something isn't working cicd jenkins Jenkins related issue

Comments

@peterzhuamazon
Copy link
Member

The windows runner does not have docker installed by default due to various reasons mentioned in opensearch-project/opensearch-ci#281.

When we add docker login for gradle check rate limit in #3367 we only make it run on linux.

We need to ignore this for windows for now. Thanks.

@peterzhuamazon peterzhuamazon added bug Something isn't working cicd jenkins Jenkins related issue labels Jul 26, 2023
@peterzhuamazon peterzhuamazon self-assigned this Jul 26, 2023
@github-actions github-actions bot added the untriaged Issues that have not yet been triaged label Jul 26, 2023
@peterzhuamazon
Copy link
Member Author

@rishabh6788
Copy link
Collaborator

Waiting on opensearch-project/opensearch-ci#281 to be resolved.

@peterzhuamazon
Copy link
Member Author

Keep seeing this:


ERROR: Timed out after 1210 seconds of waiting for winrm to be connected
com.amazonaws.AmazonClientException: Timed out after 1210 seconds of waiting for winrm to be connected
	at hudson.plugins.ec2.win.EC2WindowsLauncher.connectToWinRM(EC2WindowsLauncher.java:142)
	at hudson.plugins.ec2.win.EC2WindowsLauncher.launchScript(EC2WindowsLauncher.java:52)
	at hudson.plugins.ec2.EC2ComputerLauncher.launch(EC2ComputerLauncher.java:48)
	at hudson.slaves.SlaveComputer.lambda$_connect$0(SlaveComputer.java:298)
	at jenkins.util.ContextResettingExecutorService$2.call(ContextResettingExecutorService.java:46)
	at jenkins.security.ImpersonatingExecutorService$2.call(ImpersonatingExecutorService.java:80)
	at java.base/java.util.concurrent.FutureTask.run(FutureTask.java:264)
	at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
	at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
	at java.base/java.lang.Thread.run(Thread.java:829)


Will switch to c5 instance for windows as m5 is not stable at all for it.

@peterzhuamazon
Copy link
Member Author

It is fixed now after several other PRs.

@peterzhuamazon
Copy link
Member Author

We are moving to docker container run gradle check on both linux and windows now.
It does not make sense to maintain both docker image and ami version of the same setups.
Thanks.

@peterzhuamazon
Copy link
Member Author

The container running gradle check is not returning stable runs.
We also have an issue with our Windows 2019 instance that does not support wsl2, which means we cannot run linux containers natively on Windows for now.

Per @mch2 discussion, we should either remove those docker-compose tasks for Windows, or put this task to backlog until we re-visit with update to Windows 2022 later.

As of now, it will still run windows but fail the docker-compose tasks, but at least it will not get blocked on the start because docker does not exist. Thanks.

@peterzhuamazon
Copy link
Member Author

This has been fixed for now for docker commands not having errors on Windows agent.
But still, core team needs to disable linux containers on Windows runs.

@mch2 Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working cicd jenkins Jenkins related issue
Projects
Development

No branches or pull requests

2 participants