Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

nacos2.4.1较大概率不进行raft选主 #12526

Closed
wuqtao opened this issue Aug 20, 2024 · 9 comments · Fixed by #12573
Closed

nacos2.4.1较大概率不进行raft选主 #12526

wuqtao opened this issue Aug 20, 2024 · 9 comments · Fixed by #12573
Labels
area/Nacos Core kind/bug Category issues or prs related to bug.
Milestone

Comments

@wuqtao
Copy link

wuqtao commented Aug 20, 2024

k8s环境启动3节点集群,经常有一个节点长时间卡在Nacos is staring…状态,pod启动两三分钟后alipay-jraft.log日志文件依然为空,无选主日志,pod多次重启后有概率可以成功启动。
其他正常节点,一般启动后5s左右alipay-jraft.log日志文件就会有日志输出。
2.4.0.1等之前版本无该问题,希望尽快修复,谢谢。

@KomachiSion
Copy link
Collaborator

无法复现此问题,能否在复现时 使用jstack指令输出一下堆栈, 看下main方法的堆栈位置是卡在哪里。

@wuqtao
Copy link
Author

wuqtao commented Aug 23, 2024

无法复现此问题,能否在复现时 使用jstack指令输出一下堆栈, 看下main方法的堆栈位置是卡在哪里。

我这边是大概率(复现率90%以上)复现的,我下面贴下jstack信息,有个死锁

@wuqtao
Copy link
Author

wuqtao commented Aug 23, 2024

IMG_20240823_091414
IMG_20240823_091425
IMG_20240823_091453
IMG_20240823_091505
IMG_20240823_091512
IMG_20240823_091521

@wuqtao
Copy link
Author

wuqtao commented Aug 23, 2024

IMG_20240823_101229
IMG_20240823_101308

@KomachiSion
Copy link
Collaborator

拍照看起来太麻烦了,同时也无法检索, 能否上传一份stack文件。

@KomachiSion
Copy link
Collaborator

另外部署模式是集群+derby?

@wuqtao
Copy link
Author

wuqtao commented Aug 28, 2024

另外部署模式是集群+derby?

对的,集群+derby

@wuqtao
Copy link
Author

wuqtao commented Aug 28, 2024

拍照看起来太麻烦了,同时也无法检索, 能否上传一份stack文件。

nacos-jstack.log
here is the log of jstack

@fuxiuzhan
Copy link

多线程初始化bean导致死锁了。

@KomachiSion KomachiSion added kind/bug Category issues or prs related to bug. and removed status/need feedback labels Sep 2, 2024
@KomachiSion KomachiSion added this to the 2.4.2 milestone Sep 2, 2024
KomachiSion added a commit to KomachiSion/nacos that referenced this issue Sep 2, 2024
KomachiSion added a commit that referenced this issue Sep 2, 2024
* Enhance health check readiness and server status logic.

* Enhance health check readiness and server status logic.

* For #12526, judge whether cp protocol has init to avoid dead lock to get cp protocol.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/Nacos Core kind/bug Category issues or prs related to bug.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants