issue of zk push-empty protection #6162

laywin · 2023-12-15T14:06:05Z

I have searched the issues of this repository and believe that this is not a duplicate.

Ⅰ. Issue Description

zk not take into account the situation of push-empty protection
zk客户端实现没有考虑到空推保护的情况

Ⅱ. Describe what happened

If there is an exception, please attach the exception trace:

Just paste your stack trace here!

Ⅲ. Describe what you expected to happen

Ⅳ. How to reproduce it (as minimally and precisely as possible)

xxx
xxx
xxx

Minimal yet complete reproducer code (or URL to code):

Ⅴ. Anything else we need to know?

Ⅵ. Environment:

JDK version(e.g. java -version):
Seata client/server version:
Database version:
OS(e.g. uname -a):
Others:

The text was updated successfully, but these errors were encountered:

funky-eyes · 2023-12-15T14:21:43Z

我怀疑这是一个bug,作者可能想的是node如果被删了最好把对应的cluster也删掉,避免浪费内存空间.但是else if中又对instaces做了判空处理,防止推空,所以这是一个矛盾的产物.我建议修复方案

当事务分组对应的cluster被切换时,做延迟删除,延迟删除可以有效保证如果在短时间内cluster切换回来,可以立即开始工作,而不需要又从zookeeper中读取一次. 也能满足原作者的目的,并且延迟删除将对应的listener也可以一并清除
将监听逻辑改为,如果当前group对应的cluster还是自身watch的node,那么就防推空,如果不是就允许推空.
如果社区有更好的建议欢迎在这一起讨论

I suspect this is a bug. The author may be thinking that if node is deleted, it is best to delete the corresponding cluster to avoid wasting memory space. But in else if, instaces is judged empty to prevent empty, so this is a contradictory product. I suggest a fix

When the cluster corresponding to the transaction group is switched, do delayed deletion. Delayed deletion can effectively ensure that if the cluster switches back in a short time, it can start working immediately without reading from the zookeeper again. It can also meet the purpose of the original author, and delayed deletion will also clear the corresponding listener together
Change the listening logic to, if the cluster corresponding to the current group is still the node of its own watch, then anti-push empty, if not, allow push empty.
If the community has better suggestions, welcome to discuss them together.

slievrly · 2023-12-16T10:27:56Z

我怀疑这是一个bug,作者可能想的是node如果被删了最好把对应的cluster也删掉,避免浪费内存空间.但是else if中又对instaces做了判空处理,防止推空,所以这是一个矛盾的产物.我建议修复方案

当事务分组对应的cluster被切换时,做延迟删除,延迟删除可以有效保证如果在短时间内cluster切换回来,可以立即开始工作,而不需要又从zookeeper中读取一次. 也能满足原作者的目的,并且延迟删除将对应的listener也可以一并清除

将监听逻辑改为,如果当前group对应的cluster还是自身watch的node,那么就防推空,如果不是就允许推空.
如果社区有更好的建议欢迎在这一起讨论

I suspect this is a bug. The author may be thinking that if node is deleted, it is best to delete the corresponding cluster to avoid wasting memory space. But in else if, instaces is judged empty to prevent empty, so this is a contradictory product. I suggest a fix

When the cluster corresponding to the transaction group is switched, do delayed deletion. Delayed deletion can effectively ensure that if the cluster switches back in a short time, it can start working immediately without reading from the zookeeper again. It can also meet the purpose of the original author, and delayed deletion will also clear the corresponding listener together

Change the listening logic to, if the cluster corresponding to the current group is still the node of its own watch, then anti-push empty, if not, allow push empty.
If the community has better suggestions, welcome to discuss them together.

I agree with this plan. Regarding the original cluster list, I think we can consider not deleting it for the time being, as the actual data volume it occupies is quite small.

laywin · 2023-12-17T02:37:50Z

我怀疑这是一个bug,作者可能想的是node如果被删了最好把对应的cluster也删掉,避免浪费内存空间.但是else if中又对instaces做了判空处理,防止推空,所以这是一个矛盾的产物.我建议修复方案

当事务分组对应的cluster被切换时,做延迟删除,延迟删除可以有效保证如果在短时间内cluster切换回来,可以立即开始工作,而不需要又从zookeeper中读取一次. 也能满足原作者的目的,并且延迟删除将对应的listener也可以一并清除

将监听逻辑改为,如果当前group对应的cluster还是自身watch的node,那么就防推空,如果不是就允许推空.
如果社区有更好的建议欢迎在这一起讨论

I suspect this is a bug. The author may be thinking that if node is deleted, it is best to delete the corresponding cluster to avoid wasting memory space. But in else if, instaces is judged empty to prevent empty, so this is a contradictory product. I suggest a fix

When the cluster corresponding to the transaction group is switched, do delayed deletion. Delayed deletion can effectively ensure that if the cluster switches back in a short time, it can start working immediately without reading from the zookeeper again. It can also meet the purpose of the original author, and delayed deletion will also clear the corresponding listener together

Change the listening logic to, if the cluster corresponding to the current group is still the node of its own watch, then anti-push empty, if not, allow push empty.
If the community has better suggestions, welcome to discuss them together.

理解一下，处理逻辑和 #6164 redis 处理的差不多的，移除的时候需要 check 一下回调集群如果和当前正在使用的集群不一致，才能允许空推移除，并且需要移除listener, 取消订阅

Understand that the processing logic is similar to that of #6164 redis processing. When removing, you need to check the callback cluster. If it is inconsistent with the cluster currently in use, empty push removal can be allowed. , and need to remove the listener and unsubscribe

funky-eyes assigned laywin Dec 22, 2023

laywin linked a pull request Dec 25, 2023 that will close this issue

optimize: zk push empty protect #6206

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issue of zk push-empty protection #6162

issue of zk push-empty protection #6162

laywin commented Dec 15, 2023

funky-eyes commented Dec 15, 2023

slievrly commented Dec 16, 2023

laywin commented Dec 17, 2023

issue of zk push-empty protection #6162

issue of zk push-empty protection #6162

Comments

laywin commented Dec 15, 2023

Ⅰ. Issue Description

Ⅱ. Describe what happened

Ⅲ. Describe what you expected to happen

Ⅳ. How to reproduce it (as minimally and precisely as possible)

Ⅴ. Anything else we need to know?

Ⅵ. Environment:

funky-eyes commented Dec 15, 2023

slievrly commented Dec 16, 2023

laywin commented Dec 17, 2023