Scalable multi-access flash store for big data analytics

For many "Big Data" applications, the limiting factor in performance is often the transportation of large amount of data from hard disks to where it can be processed, i.e. DRAM. In this paper we examine an architecture for a scalable distributed flash store which aims to overcome this limi...

Full description

Bibliographic Details
Main Authors: Jun, Sang-Woo (Author), Liu, Ming (Author), Fleming, Kermin Elliott (Author), Arvind, Arvind (Author)
Other Authors: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor), Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Materials Science and Engineering (Contributor)
Format: Article
Language:English
Published: Association for Computing Machinery (ACM), 2019-06-28T15:45:00Z.
Subjects:
Online Access:Get fulltext
LEADER 02311 am a22002533u 4500
001 121450
042 |a dc 
100 1 0 |a Jun, Sang-Woo  |e author 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Materials Science and Engineering  |e contributor 
700 1 0 |a Liu, Ming  |e author 
700 1 0 |a Fleming, Kermin Elliott  |e author 
700 1 0 |a Arvind, Arvind  |e author 
245 0 0 |a Scalable multi-access flash store for big data analytics 
260 |b Association for Computing Machinery (ACM),   |c 2019-06-28T15:45:00Z. 
856 |z Get fulltext  |u https://hdl.handle.net/1721.1/121450 
520 |a For many "Big Data" applications, the limiting factor in performance is often the transportation of large amount of data from hard disks to where it can be processed, i.e. DRAM. In this paper we examine an architecture for a scalable distributed flash store which aims to overcome this limitation in two ways. First, the architecture provides a highperformance, high-capacity, scalable random-access storage. It achieves high-throughput by sharing large numbers of flash chips across a low-latency, chip-to-chip backplane network managed by the flash controllers. The additional latency for remote data access via this network is negligible as compared to flash access time. Second, it permits some computation near the data via a FPGA-based programmable flash controller. The controller is located in the datapath between the storage and the host, and provides hardware acceleration for applications without any additional latency. We have constructed a small-scale prototype whose network bandwidth scales directly with the number of nodes, and where average latency for user software to access flash store is less than 70?s, including 3.5?s of network overhead. 
520 |a Quanta Computer Incorporated (#6922986) 
520 |a Samsung Electronics Co. (#6925093) 
546 |a en 
655 7 |a Article 
773 |t 10.1145/2554688.2554789 
773 |t Proceeding FPGA '14 Proceedings of the 2014 ACM/SIGDA international symposium on Field-programmable gate arrays