Configurable Replica Read Timeout with Retry #44771

Tema · 2023-06-19T04:42:36Z

Configurable Replica Read Timeout with Retry Feature Request

Is your feature request related to a problem? Please describe:
One of common problems running TiDB in the cloud on network attached disks (Amazon EBS, Google PD or Azure managed disks) is temporary elevated disk IO latency. This can happen if a cloud provider storage node fails and goes through a repair procedure. During the repair phase a network attached disk exhibits 100ms or even single digit second latency vs single digit millisecond latency under the normal conditions.

Describe the feature you'd like:
If TiDB customer uses Follower Read or Stale Read feature, it is possible to retry a request initially landed on the TiKV node with network disk exhibiting elevated latency on the other TiKV replica. While retry policy already exists in tikv go-client, the default network timeout is 10s of seconds.

OLTP workload on TiDB could leverage an introduction of system variable tidb_tikv_read_timeout which then could be passed as a context timeout for TiKV requests made by TiDB layer and rely on existing selector logic to retry requests on other replicas. The implementation of this feature needs also to take care of the following:

The intermediate timeouts should still bubble up in metrics
Back-off should be skipped on this timeout
Right now Stale Read retry always goes to the leader (avoid dataIsNotReady error while retrying stale read on the leader tikv/client-go#765). This needs to be changed so that it retries on leader only for DataIsNot ready error, but after timeout it should go to another replica.

Describe alternatives you've considered:
TiDB already has a max_execution_timeout system variable, but it is not used as a context deadline in go-client to network calls from TiDB to TiKV. Moreover, if TiKV request takes longer than max_execution_timeout, then the session is marked as killed and retry won’t happen.

Teachability, Documentation, Adoption, Migration Strategy:
The feature would be fully controlled by session variable tidb_tikv_read_timeout.

The text was updated successfully, but these errors were encountered:

ref #44771

cfzjywxk · 2023-07-17T01:58:17Z

The RFC: https://github.com/pingcap/tidb/blob/7105505a78fc886c33258caa5813baf197b15247/docs/design/2023-06-30-configurable-kv-timeout.md
Related Tracking Issue: #45380
/cc @Tema @hihihuhu @crazycs520

easonn7 · 2023-08-04T03:44:38Z

Controlling the timeout behavior of tikv-client is reasonable and requires such a parameter. However, the newly added variable overlaps with the existing variable "tidb_load_based_replica_read_threshold". I personally suggest keeping only tidb_tikv_read_timeout and gradually deprecating the tidb_load_based_replica_read_threshold variable in the future.

Tema added the type/feature-request Categorizes issue or PR as related to a new feature. label Jun 19, 2023

cfzjywxk mentioned this issue Jun 30, 2023

doc: rfc for configurable kv timeout #45093

Merged

ti-chi-bot bot pushed a commit that referenced this issue Jul 10, 2023

doc: rfc for configurable kv timeout (#45093)

b040671

ref #44771

cfzjywxk mentioned this issue Jul 17, 2023

Configurable KV Timeout Tracking Issue #45380

Closed

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configurable Replica Read Timeout with Retry #44771

Configurable Replica Read Timeout with Retry #44771

Tema commented Jun 19, 2023

cfzjywxk commented Jul 17, 2023

easonn7 commented Aug 4, 2023

Configurable Replica Read Timeout with Retry #44771

Configurable Replica Read Timeout with Retry #44771

Comments

Tema commented Jun 19, 2023

Configurable Replica Read Timeout with Retry Feature Request

cfzjywxk commented Jul 17, 2023

easonn7 commented Aug 4, 2023