Optimized Durable Commitlog for Apache Cassandra Using CAPI-Flash
Citations Over TimeTop 10% of 2016 papers
Abstract
High-velocity data imposes high durability overheads on Big Data technology components such as NoSQL data stores. In Apache Cassandra, a widely used NoSQL solution with high scalability and availability, write-ahead logging is used to support Commitlog operations, which in turn provides fault tolerance to applications. However, current write-ahead logging techniques are limited by the excessive overhead in the I/O subsystem. To address this performance gap, we have designed a novel CAPI-Flash based high performance durable Commitlog for Apache Cassandra. We take advantage of the high throughput, low latency path to flash storage provided by the Coherent Accelerator Processor Interface (CAPI) on IBM POWER8 Systems. Our experimental results show that for write-intensive workloads CAPI-Flash logging provides up to 107% improvement in throughput compared to Cassandra's durable alternative. We also provide 77% better throughput in update-mostly workloads.
Related Papers
- → Flash memory cells data loss caused by total ionizing dose and heavy ions(2014)26 cited
- → Secondary Electron flash-a high performance, low power flash technology for 0.35 μm and below(2002)60 cited
- Effective Way to Handling Big Data Problems using NoSQL Database (MongoDB)(2015)
- → On the capacity of flash memories(2008)6 cited
- → A 0.18 μm flash source side erasing improvement(2005)4 cited