TY - GEN
T1 - An active data-aware cache consistency protocol for highly-scalable data-shipping DBMS architectures
AU - Wu, Keqiang
AU - Chuang, Peng Fei
AU - Lilja, David J.
PY - 2004
Y1 - 2004
N2 - In a data-shipping database system, data items are retrieved from the server machines, cached and processed at the client machines, and then shipped back to the server. Current cache consistency approaches typically rely on a centralized server or servers to enforce the necessary concurrency control actions. This centralized server imposes a limitation on the scalability and performance of these systems. This paper presents a new consistency protocol, Active Data-aware Cache Consistency (ADCC), that allows clients to be aware of the global state of their cached data via a two-tier directory. Using parallel communication with simultaneous clientserver and client-client messages, ADCC reduces the network latency for detecting data conflicts by 50%, while increasing message overhead by about 8% only. In addition, ADCC improves scalability by partially offloading the concurrency control function from the server to the clients. An optimization, Lazy Update, is introduced to reduce the message overhead for maintaining client directory consistency. We implement ADCC in a page server DBMS architecture and compare it with the leading cache consistency algorithm, Callback Locking (CBL), which is the most widely implemented algorithm in commercial DBMSs. Our performance study shows that ADCC has a similar or lower abort rate, higher throughput, and better scalability for important workloads and system configurations. Both the simulation results and the analytic study indicate that the message overhead is low and that ADCC produces better behavior compared to the traditional server-based communication under high contention workloads.
AB - In a data-shipping database system, data items are retrieved from the server machines, cached and processed at the client machines, and then shipped back to the server. Current cache consistency approaches typically rely on a centralized server or servers to enforce the necessary concurrency control actions. This centralized server imposes a limitation on the scalability and performance of these systems. This paper presents a new consistency protocol, Active Data-aware Cache Consistency (ADCC), that allows clients to be aware of the global state of their cached data via a two-tier directory. Using parallel communication with simultaneous clientserver and client-client messages, ADCC reduces the network latency for detecting data conflicts by 50%, while increasing message overhead by about 8% only. In addition, ADCC improves scalability by partially offloading the concurrency control function from the server to the clients. An optimization, Lazy Update, is introduced to reduce the message overhead for maintaining client directory consistency. We implement ADCC in a page server DBMS architecture and compare it with the leading cache consistency algorithm, Callback Locking (CBL), which is the most widely implemented algorithm in commercial DBMSs. Our performance study shows that ADCC has a similar or lower abort rate, higher throughput, and better scalability for important workloads and system configurations. Both the simulation results and the analytic study indicate that the message overhead is low and that ADCC produces better behavior compared to the traditional server-based communication under high contention workloads.
KW - Active control
KW - Cache consistency
KW - DBMS
KW - Data-shipping
KW - Parallel communication
UR - http://www.scopus.com/inward/record.url?scp=4143120161&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=4143120161&partnerID=8YFLogxK
U2 - 10.1145/977091.977122
DO - 10.1145/977091.977122
M3 - Conference contribution
AN - SCOPUS:4143120161
SN - 1581137419
SN - 9781581137415
T3 - 2004 Computing Frontiers Conference
SP - 222
EP - 234
BT - 2004 Computing Frontiers Conference
PB - Association for Computing Machinery (ACM)
T2 - 2004 Computing Frontiers Conference
Y2 - 14 April 2004 through 16 April 2004
ER -