Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.

As DRAM is facing the scaling difficulty in terms of energy cost and reliability, some nonvolatile storage materials were proposed to be the substitute or supplement of main memory. Phase Change Memory (PCM) is one of the most promising nonvolatile memory that could be put into use in the near futur...

Full description

Bibliographic Details
Main Authors: Ning An, Rui Wang, Yuan Gao, Hailong Yang, Depei Qian
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-01-01
Series:PLoS ONE
Online Access:http://europepmc.org/articles/PMC4497737?pdf=render
id doaj-e558ad35f5c34ba4ab781006cf52f29c
record_format Article
spelling doaj-e558ad35f5c34ba4ab781006cf52f29c2020-11-24T21:58:38ZengPublic Library of Science (PLoS)PLoS ONE1932-62032015-01-01107e013196410.1371/journal.pone.0131964Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.Ning AnRui WangYuan GaoHailong YangDepei QianAs DRAM is facing the scaling difficulty in terms of energy cost and reliability, some nonvolatile storage materials were proposed to be the substitute or supplement of main memory. Phase Change Memory (PCM) is one of the most promising nonvolatile memory that could be put into use in the near future. However, before becoming a qualified main memory technology, PCM should be designed reliably so that it can ensure the computer system's stable running even when errors occur. The typical wear-out errors in PCM have been well studied, but the transient errors, that caused by high-energy particles striking on the complementary metal-oxide semiconductor (CMOS) circuit of PCM chips or by resistance drifting in multi-level cell PCM, have attracted little focus. In this paper, we propose an innovative mechanism, Local-ECC-Global-ECPs (LEGE), which addresses both soft errors and hard errors (wear-out errors) in PCM memory systems. Our idea is to deploy a local error correction code (ECC) section to every data line, which can detect and correct one-bit errors immediately, and a global error correction pointers (ECPs) buffer for the whole memory chip, which can be reloaded to correct more hard error bits. The local ECC is used to detect and correct the unknown one-bit errors, and the global ECPs buffer is used to store the corrected value of hard errors. In comparison to ECP-6, our method provides almost identical lifetimes, but reduces approximately 50% storage overhead. Moreover, our structure reduces approximately 3.55% access latency overhead by increasing 1.61% storage overhead compared to PAYG, a hard error only solution.http://europepmc.org/articles/PMC4497737?pdf=render
collection DOAJ
language English
format Article
sources DOAJ
author Ning An
Rui Wang
Yuan Gao
Hailong Yang
Depei Qian
spellingShingle Ning An
Rui Wang
Yuan Gao
Hailong Yang
Depei Qian
Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
PLoS ONE
author_facet Ning An
Rui Wang
Yuan Gao
Hailong Yang
Depei Qian
author_sort Ning An
title Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
title_short Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
title_full Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
title_fullStr Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
title_full_unstemmed Balancing the Lifetime and Storage Overhead on Error Correction for Phase Change Memory.
title_sort balancing the lifetime and storage overhead on error correction for phase change memory.
publisher Public Library of Science (PLoS)
series PLoS ONE
issn 1932-6203
publishDate 2015-01-01
description As DRAM is facing the scaling difficulty in terms of energy cost and reliability, some nonvolatile storage materials were proposed to be the substitute or supplement of main memory. Phase Change Memory (PCM) is one of the most promising nonvolatile memory that could be put into use in the near future. However, before becoming a qualified main memory technology, PCM should be designed reliably so that it can ensure the computer system's stable running even when errors occur. The typical wear-out errors in PCM have been well studied, but the transient errors, that caused by high-energy particles striking on the complementary metal-oxide semiconductor (CMOS) circuit of PCM chips or by resistance drifting in multi-level cell PCM, have attracted little focus. In this paper, we propose an innovative mechanism, Local-ECC-Global-ECPs (LEGE), which addresses both soft errors and hard errors (wear-out errors) in PCM memory systems. Our idea is to deploy a local error correction code (ECC) section to every data line, which can detect and correct one-bit errors immediately, and a global error correction pointers (ECPs) buffer for the whole memory chip, which can be reloaded to correct more hard error bits. The local ECC is used to detect and correct the unknown one-bit errors, and the global ECPs buffer is used to store the corrected value of hard errors. In comparison to ECP-6, our method provides almost identical lifetimes, but reduces approximately 50% storage overhead. Moreover, our structure reduces approximately 3.55% access latency overhead by increasing 1.61% storage overhead compared to PAYG, a hard error only solution.
url http://europepmc.org/articles/PMC4497737?pdf=render
work_keys_str_mv AT ningan balancingthelifetimeandstorageoverheadonerrorcorrectionforphasechangememory
AT ruiwang balancingthelifetimeandstorageoverheadonerrorcorrectionforphasechangememory
AT yuangao balancingthelifetimeandstorageoverheadonerrorcorrectionforphasechangememory
AT hailongyang balancingthelifetimeandstorageoverheadonerrorcorrectionforphasechangememory
AT depeiqian balancingthelifetimeandstorageoverheadonerrorcorrectionforphasechangememory
_version_ 1725850962170478592