SELECT ... AS OF TIMESTAMP statement with explicit time may break linearizability when TSO is drifting #56809

MyonKeminta · 2024-10-23T09:41:28Z

Bug Report

Summary

When user explicitly specifies time in a SELECT ... AS OF TIMESTAMP statement to perform stale-read, it's possible that user specifies a time that's much larger than the latest ts that PD has allocated. Users usually don't actually want to read on a future time, but there are some cases that PD's TSO is significantly lagging from the actual physical time. Note that TSO lagging might not always be caused by system time drifts, but may also caused by PD's abnormal writing etcd latency.

TiKV doesn't uses the timestamp from stale-read requests to push its max_ts (which is used for commit_ts calculation in Async Commit / 1PC transactions). However, when retries happens on stale-read requests, TiDB (client-go) usually fallback it to normal leader read mode, without resetting its read_ts. This can cause the retrying request have the manually-specified read ts, but without the stale read flag, and as a result TiKV can use it to push max_ts, and results in a value larger than that PD can allocate after that. This breaks the linearizablility of Async Commit / 1PC transactions.

Minimal reproduce step

Create a cluster where PD's TSO is lagging. A simple way is to set tso-update-physical-interval="2s" on PD, which lets PD updates physical every 2 seconds.

Continuously run some SELECT ... AS OF TIMESTAMP statement, specify a recent past time by representing the time in string. Let the stale read requests sent to TiKV retry due to regoin error or lock conflict. At the same time, let there be some transactions committing in Async Commit / 1PC mode.

Case 1: Prepare the table like this:

create table t (id int primary key, v int);
insert into t values (1, 1);

Then run the following procedure concurrently (some error handling omitted, c1 and c2 are two different connection)

for {
    for {
        readTime := time.Now().Add(-time.Second * 1)
        readTimeStr := readTime.Format("2006-01-02 15:04:05.999")
        _, err = c2.ExecContext(ctx, "select * from t as of timestamp '"+readTimeStr+"' where id = 1")
        if err != nil {
            // Ignore the case that the time is rejected
            if strings.Contains(err.Error(), "cannot set read timestamp to a future time") {
                time.Sleep(time.Millisecond * 10)
                continue
            }
            panic(err)
        }
        break
    }

    tx:= c1.BeginTx(ctx, &sql.TxOptions{})
    tx.ExecContext(ctx, "update t set v = v + 1 where id = 1")
    tx.Commit()
}

Then the following error may be met on the update statement:

Error 1105 (HY000): Txn 453423637808807957 Retrying aggressive locking with ForUpdateTS (453423637808807963) less than previous LockedWithConflictTS (453423637831352321)

The reason is that concurrently running the above procedure may let the stale read statement meet the lock of the other transaction from other concurrent threads. It can let the stale read statement fallback to leader read mode. The statement update t set v = v + 1 where id = 1 runs in fair locking mode, and when it encounters write conflict (another transaction committed on this key), the current transaction gets a new forUpdateTS from PD for retrying the statement, and it asserts the new ts from PD must be larger than the previously met conflicting commit record. However this assertion is violated.

Case 2:
1. Hack the code to let all stale read returns DataIsNotReady error, which is a normal and commen error that may happen in stale read scenarios. This will force stale read requests always fallback to leader read mode.
2. Run the same procedure as in case 1, but it don't need to be run concurrently; or run on different tables for each thread. The there will be a same error as seen in case 1.

Case 3: Run the following procedure repeatedly. It might need to set pessimistic-txn.max-retry-count=100000 to TiDB's config file to avoid the "pessimistic lock retry limit reached" error.

/* t1 */ set @@tidb_pessimistic_txn_fair_locking = 0;
/* t1 */ begin;
/* t1 */ update t set v = v + 1 where id = 1;
-- Calculate the time as the same way in case 1;
-- Retry if it reports "cannot set read timestamp to a future time"
/* t2 */ select * from t as of timestamp '...' where id = 1; 
-- Save this result as `value1`
/* t1 */ select v from t where id = 1;
/* t1 */ commit;
-- Save the commit_ts of this transaction as `ts1`
/* t1 */ select @@tidb_last_txn_info;
-- Save this result as `value2`. Do not use the `where` clause to avoid it using max_uint64 as ts for point-get.
/* t1 */ select v from t;
-- Save the start_ts of this querys as `ts2`
/* t1 */ select @@tidb_last_query_info;
-- Assert `value1` == `value2` and `ts1` <= `ts2`.

In this procedure, the later query in t1 should read the result committed by the previous transaction, and the timestamps should be monotonic, no matter what t2 does. However, the assertion can sometimes fail. This is a sample output of such failure in our test program:

t2 read at 2024-10-23 17:48:28.822
res1 128, res2 127, txnInfo {"txn_scope":"global","start_ts":453424423584662150,"commit_ts":453424423586234369,"txn_commit_mode":"1pc","async_commit_fallback":false,"one_pc_fallback":false}, lastQueryInfo {"txn_scope":"global","start_ts":453424423584662153,"for_update_ts":453424423584662153,"ru_consumption":0.5033112418619792}

What is your TiDB version?

v7.5.1

The text was updated successfully, but these errors were encountered:

…57315) close #56809

) close pingcap#56809

…57372) close #56809

) close pingcap#56809

…58086) close #56809

…58158) close #56809

MyonKeminta added the type/bug The issue is confirmed as a bug. label Oct 23, 2024

cfzjywxk added sig/transaction SIG:Transaction severity/critical labels Oct 23, 2024

ti-chi-bot bot added may-affects-5.4 This bug maybe affects 5.4.x versions. may-affects-6.1 may-affects-6.5 may-affects-7.1 may-affects-7.5 may-affects-8.1 labels Oct 23, 2024

crazycs520 mentioned this issue Oct 23, 2024

kvrpcpb: change Context.stale_read from bool to enum pingcap/kvproto#1274

Open

cfzjywxk assigned MyonKeminta Oct 29, 2024

This was referenced Nov 1, 2024

Support adaptive update interval for low resolution ts tikv/client-go#1484

Merged

*: Use strict validation for stale read ts & flashback ts #57050

Merged

ti-chi-bot added the affects-8.5 This bug affects the 8.5.x(LTS) versions. label Nov 1, 2024

cfzjywxk mentioned this issue Nov 6, 2024

server: enhance the ts check considering commit timestamp #57165

Open

MyonKeminta mentioned this issue Nov 11, 2024

Support adaptive update interval for low resolution ts (#1484) tikv/client-go#1491

Merged

ti-chi-bot bot closed this as completed in #57050 Nov 12, 2024

ti-chi-bot bot closed this as completed in 3578b1d Nov 12, 2024

ti-chi-bot mentioned this issue Nov 12, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) #57315

Merged

13 tasks

ti-chi-bot bot pushed a commit that referenced this issue Nov 13, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) (#…

3b61650

…57315) close #56809

ti-chi-bot pushed a commit to ti-chi-bot/tidb that referenced this issue Nov 14, 2024

*: Use strict validation for stale read ts & flashback ts (pingcap#57050

563999a

) close pingcap#56809

ti-chi-bot mentioned this issue Nov 14, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) #57372

Merged

13 tasks

ti-chi-bot bot pushed a commit that referenced this issue Nov 19, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) (#…

0ab1f34

…57372) close #56809

cfzjywxk mentioned this issue Nov 28, 2024

server: enhancement of ts validity check for all types of requests #57786

Open

7 tasks

This was referenced Dec 9, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) #58086

Merged

*: Use strict validation for stale read ts & flashback ts (#57050) #58158

Merged

MyonKeminta added a commit to ti-chi-bot/tidb that referenced this issue Dec 12, 2024

*: Use strict validation for stale read ts & flashback ts (pingcap#57050

9296a17

) close pingcap#56809

MyonKeminta added a commit to ti-chi-bot/tidb that referenced this issue Dec 12, 2024

*: Use strict validation for stale read ts & flashback ts (pingcap#57050

9246b9c

) close pingcap#56809

ti-chi-bot bot pushed a commit that referenced this issue Dec 13, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) (#…

dc56ee8

…58086) close #56809

ti-chi-bot bot pushed a commit that referenced this issue Dec 16, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) (#…

a15df10

…58158) close #56809

MyonKeminta added the affects-6.1 This bug affects the 6.1.x(LTS) versions. label Dec 19, 2024

This was referenced Dec 19, 2024

*: Use strict validation for stale read ts & flashback ts (#57050) #58409

Open

*: Use strict validation for stale read ts & flashback ts (#57050) #58410

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SELECT ... AS OF TIMESTAMP statement with explicit time may break linearizability when TSO is drifting #56809

SELECT ... AS OF TIMESTAMP statement with explicit time may break linearizability when TSO is drifting #56809

MyonKeminta commented Oct 23, 2024 •

edited

Loading

SELECT ... AS OF TIMESTAMP statement with explicit time may break linearizability when TSO is drifting #56809

SELECT ... AS OF TIMESTAMP statement with explicit time may break linearizability when TSO is drifting #56809

Comments

MyonKeminta commented Oct 23, 2024 • edited Loading

Bug Report

Summary

Minimal reproduce step

What is your TiDB version?

MyonKeminta commented Oct 23, 2024 •

edited

Loading