SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores

Persistent key-value stores have been widely adopted as storage engines for modern IT infrastructures because they provide high performance with simple design principles. Moreover, many key-value stores commonly employ LSM-tree as their index structure due to its attractive features such as high wri...

Full description

Bibliographic Details
Main Authors: Hoyoung Lee, Minho Lee, Young Ik Eom
Format: Article
Language:English
Published: IEEE 2021-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/9491041/
id doaj-d549a9d5566f4d0a922844c0087ea600
record_format Article
spelling doaj-d549a9d5566f4d0a922844c0087ea6002021-07-27T23:00:51ZengIEEEIEEE Access2169-35362021-01-01910315310316610.1109/ACCESS.2021.30987369491041SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value StoresHoyoung Lee0https://orcid.org/0000-0002-2984-0914Minho Lee1https://orcid.org/0000-0003-0677-4480Young Ik Eom2https://orcid.org/0000-0001-6141-8054Department of Computer Science and Engineering, Sungkyunkwan University, Suwon, South KoreaDepartment of Electrical and Computer Engineering, Sungkyunkwan University, Suwon, South KoreaDepartment of Electrical and Computer Engineering, College of Computing and Informatics, Sungkyunkwan University, Suwon, South KoreaPersistent key-value stores have been widely adopted as storage engines for modern IT infrastructures because they provide high performance with simple design principles. Moreover, many key-value stores commonly employ LSM-tree as their index structure due to its attractive features such as high write throughput and storage space efficiency. Unfortunately, LSM-tree has critical drawbacks in that it leads to write/read amplification problem. One of the prevalent solutions for remedying the write amplification problem is the tiering merge policy that reduces the number of rewrites by delaying merge operations. However, in spite of this advantage, the tiering merge policy may lead to a side-effect that induces high read amplification, increasing search/scan cost for upcoming read operations. In this paper, we concentrate on mitigating the high read amplification problem of the tiering merge policy, while maintaining its low write amplification. To achieve this, we propose a novel LSM-tree scheme, called Spatially Fragmented LSM-tree (SFM), which delays merge operations only for the non-read-intensive key-spaces. For this, SFM identifies the read intensity of each key-spaces by dynamically estimating their read/write hotness. We have implemented SFM based on PebblesDB and evaluated the performance benefits of our scheme under real-world workloads of Facebook. Experimental results clearly show that our scheme improves throughput by up to <inline-formula> <tex-math notation="LaTeX">$1.67\times $ </tex-math></inline-formula> compared with the conventional schemes while maintaining low write amplification, and also indicate that its latency is lowered by up to 41.41&#x0025; on average by mitigating the read amplification problem of the existing schemes, by up to 43.68&#x0025;.https://ieeexplore.ieee.org/document/9491041/Persistent key-value storesLSM-treestorage managementdatabase management systems (DBMS)compactionmerge policy
collection DOAJ
language English
format Article
sources DOAJ
author Hoyoung Lee
Minho Lee
Young Ik Eom
spellingShingle Hoyoung Lee
Minho Lee
Young Ik Eom
SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
IEEE Access
Persistent key-value stores
LSM-tree
storage management
database management systems (DBMS)
compaction
merge policy
author_facet Hoyoung Lee
Minho Lee
Young Ik Eom
author_sort Hoyoung Lee
title SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
title_short SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
title_full SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
title_fullStr SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
title_full_unstemmed SFM: Mitigating Read/Write Amplification Problem of LSM-Tree-Based Key-Value Stores
title_sort sfm: mitigating read/write amplification problem of lsm-tree-based key-value stores
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2021-01-01
description Persistent key-value stores have been widely adopted as storage engines for modern IT infrastructures because they provide high performance with simple design principles. Moreover, many key-value stores commonly employ LSM-tree as their index structure due to its attractive features such as high write throughput and storage space efficiency. Unfortunately, LSM-tree has critical drawbacks in that it leads to write/read amplification problem. One of the prevalent solutions for remedying the write amplification problem is the tiering merge policy that reduces the number of rewrites by delaying merge operations. However, in spite of this advantage, the tiering merge policy may lead to a side-effect that induces high read amplification, increasing search/scan cost for upcoming read operations. In this paper, we concentrate on mitigating the high read amplification problem of the tiering merge policy, while maintaining its low write amplification. To achieve this, we propose a novel LSM-tree scheme, called Spatially Fragmented LSM-tree (SFM), which delays merge operations only for the non-read-intensive key-spaces. For this, SFM identifies the read intensity of each key-spaces by dynamically estimating their read/write hotness. We have implemented SFM based on PebblesDB and evaluated the performance benefits of our scheme under real-world workloads of Facebook. Experimental results clearly show that our scheme improves throughput by up to <inline-formula> <tex-math notation="LaTeX">$1.67\times $ </tex-math></inline-formula> compared with the conventional schemes while maintaining low write amplification, and also indicate that its latency is lowered by up to 41.41&#x0025; on average by mitigating the read amplification problem of the existing schemes, by up to 43.68&#x0025;.
topic Persistent key-value stores
LSM-tree
storage management
database management systems (DBMS)
compaction
merge policy
url https://ieeexplore.ieee.org/document/9491041/
work_keys_str_mv AT hoyounglee sfmmitigatingreadwriteamplificationproblemoflsmtreebasedkeyvaluestores
AT minholee sfmmitigatingreadwriteamplificationproblemoflsmtreebasedkeyvaluestores
AT youngikeom sfmmitigatingreadwriteamplificationproblemoflsmtreebasedkeyvaluestores
_version_ 1721279311975022592