Seems that readIndex() may return inconsistent value (e.g. 0) after node restart, is this a known issue? #1049

sanpwc · 2023-12-11T07:26:31Z

Within node restart lastCommittedIndex initialized as 0

public class BallotBox implements Lifecycle<BallotBoxOptions>, Describer {
    ...
    private long lastCommittedIndex = 0;
    ...
}

Taking into the consideration that local log application triggered by NodeImpl#init is asynchronous, it's possible to handle readIndex request before local log re-play is finished and thus see inconsistent readIndex value. In other words, seems that there's a race between readIndex evaluation and lastCommittedIndex restoration on node restart.

The text was updated successfully, but these errors were encountered:

killme2008 · 2023-12-11T07:57:21Z

I think it's impossible, the handleReadIndexRequest will check the state of the node before reading the lastCommittedIndex, and if its state is not the leader or the current leader doesn't commit an entry before, the readIndex will fail.

sofa-jraft/jraft-core/src/main/java/com/alipay/sofa/jraft/core/NodeImpl.java

Line 1511 in 2ea82d6

switch (this.state) {

sofa-jraft/jraft-core/src/main/java/com/alipay/sofa/jraft/core/NodeImpl.java

Line 1566 in 2ea82d6

if (this.logManager.getTerm(lastCommittedIndex) != this.currTerm) {

And when a node becomes a leader, it will update the lastCommittedIndex before releasing the write lock:

sofa-jraft/jraft-core/src/main/java/com/alipay/sofa/jraft/core/NodeImpl.java

Line 1264 in 2ea82d6

this.confCtx.flush(this.conf.getConf(), this.conf.getOldConf());

sofa-jraft/jraft-core/src/main/java/com/alipay/sofa/jraft/core/NodeImpl.java

Line 502 in 2ea82d6

    
           this.node.unsafeApplyConfiguration(conf, oldConf == null || oldConf.isEmpty() ? null : oldConf, true);

sanpwc · 2023-12-11T08:53:18Z

Hmm, seems that if (this.logManager.getTerm(lastCommittedIndex) != this.currTerm) will be skipped in case of quorum <= 1 and this is the case when the issue is reproduced.

killme2008 · 2023-12-11T09:02:55Z

Hmm, seems that if (this.logManager.getTerm(lastCommittedIndex) != this.currTerm) will be skipped in case of quorum <= 1 and this is the case when the issue is reproduced.

If so, that would be possible, good catch! But I really doubt if is this a common case(just one node) for the production usage of jraft.

We can fix it, but maybe it is not urgent.

sanpwc · 2023-12-11T09:30:43Z

We can fix it

Well, that'll be nice))

BTW, seems that it's not the difficult to fix it. In case of single noded cluster (and only single noded cluster). We may consider pendingIndex on restart as committed one, just because in case of single node no-one will discard log entries.
So basically, it should be possible to do following to solve the issue. In case of single noded clusters only.

    public boolean resetPendingIndex(final long newPendingIndex) {
            ...
            this.lastCommittedIndex = newPendingIndex - 1;
            ...
        }

What do you think?

killme2008 · 2023-12-12T02:03:27Z

We can fix it

Well, that'll be nice))

BTW, seems that it's not the difficult to fix it. In case of single noded cluster (and only single noded cluster). We may consider pendingIndex on restart as committed one, just because in case of single node no-one will discard log entries. So basically, it should be possible to do following to solve the issue. In case of single noded clusters only.
    public boolean resetPendingIndex(final long newPendingIndex) {
            ...
            this.lastCommittedIndex = newPendingIndex - 1;
            ...
        }
What do you think?

It seems like the fix will work as expected. Would you be able to create a pull request for this change?

sanpwc · 2023-12-12T06:00:08Z

Sure

sanpwc mentioned this issue Dec 12, 2023

IGNITE-21000 Fix read commands reordering on raft node restart apache/ignite-3#2908

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Seems that readIndex() may return inconsistent value (e.g. 0) after node restart, is this a known issue? #1049

Seems that readIndex() may return inconsistent value (e.g. 0) after node restart, is this a known issue? #1049

sanpwc commented Dec 11, 2023

killme2008 commented Dec 11, 2023

sanpwc commented Dec 11, 2023

killme2008 commented Dec 11, 2023

sanpwc commented Dec 11, 2023 •

edited

killme2008 commented Dec 12, 2023

sanpwc commented Dec 12, 2023

Seems that readIndex() may return inconsistent value (e.g. 0) after node restart, is this a known issue? #1049

Seems that readIndex() may return inconsistent value (e.g. 0) after node restart, is this a known issue? #1049

Comments

sanpwc commented Dec 11, 2023

killme2008 commented Dec 11, 2023

sanpwc commented Dec 11, 2023

killme2008 commented Dec 11, 2023

sanpwc commented Dec 11, 2023 • edited

killme2008 commented Dec 12, 2023

sanpwc commented Dec 12, 2023

sanpwc commented Dec 11, 2023 •

edited