Title :
KuaFu: Closing the parallelism gap in database replication
Author :
Chuntao Hong ; Dong Zhou ; Mao Yang ; Carbo Kuo ; Lintao Zhang ; Lidong Zhou
Author_Institution :
Microsoft Res. Asia, Beijing, China
Abstract :
Database systems are nowadays increasingly deployed on multi-core commodity servers, with replication to guard against failures. Database engine is best designed to scale with the number of cores to offer a high degree of parallelism on a modern multi-core architecture. On the other hand, replication traditionally resorts to a certain form of serialization for data consistency among replicas. In the widely used primary/backup replication with log shipping, concurrent executions on the primary and the serialized log replay on a backup creates a serious parallelism gap. Our experiment on MySQL with a 16-core configuration shows that the serial replay of a backup can sustain only less than one third of the throughput achievable on the primary under an OLTP workload. This paper proposes KuaFu to close the parallelism gap on replicated database systems by enabling concurrent replay of transactions on a backup. KuaFu maintains write consistency on backups by tracking transaction dependencies. Concurrent replay on a backup does introduce read inconsistency between the primary and backups. KuaFu further leverages multi-version concurrency control to produce snapshots in order to restore the consistency semantics. We have implemented KuaFu on MySQL; our evaluations show that KuaFu allows a backup to keep up with the primary while preserving replication consistency.
Keywords :
SQL; concurrency control; data mining; multiprocessing systems; replicated databases; KuaFu; MySQL; OLTP workload; backup replication; concurrent executions; concurrent replay; data consistency; database engine; database replication; log shipping; modern multicore architecture; multicore commodity servers; multiversion concurrency control; parallelism gap; preserving replication consistency; primary replication; replicated database systems; serialized log replay; Database systems; Engines; Parallel processing; Semantics; Servers; Throughput;
Conference_Titel :
Data Engineering (ICDE), 2013 IEEE 29th International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-4909-3
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2013.6544908