Transaction
Last updated
Last updated
The transaction engine in TiDB is responsible to provide ACID guarantees for all the read and write requests. It consists of the client/coordinator part in the TiDB repository and the server/participant part in the TiKV repository.
This document is mainly about the TiDB part.
In TiDB the transaction write flow is like this:
After the transaction starts in a session, all the reads and writes will use a snapshot to fetch data, and the written content will be buffered in the memory of the transaction. When the commit
statement is received from the client, the Percolator protocol will be used to commit these changes to the storage system.
In TiDB, the Transaction
interface defines the commonly used transaction operations.
These are common interfaces the transaction will provide.
For example, Commit
will be used to commit the current ongoing transaction. The transaction is considered ongoing before the Commit
operation is triggered. The two-phase commit processing will be used to commit a transaction and it will finally become committed or aborted.
LazyTxn
is a wrapper of the transaction implementations, when the SQL statements are executed using a standalone session context, LazyTxn
will be used to do things like:
Return the memory buffer for write.
Set specific operations or flags for the current transaction.
Return the snapshot of this transaction.
Commit the current transaction.
Lock specific keys.
Usually, the first thing that will be done executing a statement is to activate
the related transaction
By default, TiDB provides the snapshot isolation level. Thus, in each new transaction, a new global strong snapshot will be fetched first, before executing statements.
In TiDB, the snapshot is represented by a global TSO which is fetched from the PD server, and it acts as the unique identifier for this transaction. After this operation, a transaction is regarded as activated
.
For the read SQL statements, the snapshot will be used to provide a global strong consistent snapshot, all the reads will check data visibility using this snapshot. Most executors will set the timestamp doing the build, and the transaction could be activated by the building process. Some commonly used snapshot APIs are as below:
For the write SQL statements, they will write data into the transaction memory buffer temporarily until the commit
operation is triggered. There are 3 main interfaces which will write query data into the memory buffer. Here is the table API:
Every statement will use a staging
buffer during its execution. If it's successful, the staging content will be merged into the transaction memory buffer.
For example, AddRecord
will try to write a row into the current statement staging buffer, and the RemoveRecord
will try to remove a row from the staging statement buffer. The existing transaction memory buffer will not be affected if the statement has failed.
The memory buffer implementation is wrapped in memBuffer. The internal implementation is MemDB
struct.
The memory buffer is an ordered map and it provides the staging
and discard
utilities. For example, the memory content generated by a statement will be discarded if its execution has failed.
After the statement execution phase, the commit
statement will trigger the commit execution for the current transaction. In TiDB, the Percolator protocol is used as the distributed transaction protocol, it's a two-phase protocol.
In the first stage, the transaction coordinator (TiDB server) will try to prewrite all the related keys. If all of them are successful, the transaction coordinator will then commit the primary key. After that the transaction is considered committed successfully, and a success message will be responded to the client. All the other keys will be committed asynchronously.
All the changes in the transaction memory buffer will be converted into mutations which will be used by the two-phase committer. These mutations will be grouped by their region locations, and prewrite requests will be sent to their region leaders.
If all the prewrite requests are processed successfully, the commit request for the primary key will be sent to TiKV first. If the primary key commit is successful, the transaction is considered committed and will respond to the client with successful results.