LAST-HDFS: Location-Aware Storage Technique for Hadoop Distributed File System
Citations Over TimeTop 10% of 2016 papers
Abstract
Enabled by the state-of-the-art cloud computing technologies, cloud storage has gained increasing popularity in recent years. Despite of the benefit of flexible and reliable data access offered by such services, users have to bear with the fact of not actually knowing the whereabouts of their data. The lack of knowledge and control of the physical locations of data could raise legal and regulatory issues, especially for certain sensitive data that are governed by laws to remain within certain geographic boundaries and borders. In this paper, we study the problem of data placement control within distributed file systems supporting cloud storage. Particularly, we consider the open source Hadoop file system (HDFS) as the underlying architecture, and propose a location-aware cloud storage system, named LAST-HDFS, to support and enforce location-aware storage in HDFS-based clusters. In addition, it also includes a monitoring system deployed at individual hosts to oversee and detect potential data placement violations due to the existence of malicious datanodes. We carried out an extensive experimental evaluation in a real cloud environment that demonstrates the effectiveness and efficiency of our proposed system.
Related Papers
- → Availability in the Flexible and Adaptable Distributed File System(2015)2 cited
- Distributed Storage System Surveyed(2011)
- → Create Cloud Table Storage(2009)
- → DFS Response Time Prediction Using the Techniques of “Deep Learning”(2020)
- An Adaptive Approach to implement an Object Storage for Private Cloud(2021)