Optimized Code Design for Constrained DNA Data Storage With Asymmetric Errors

  • Li Deng
  • , Yixin Wang
  • , M. D. Noor-A-Rahim
  • , Yong Liang Guan
  • , Zhiping Shi
  • , Erry Gunawan
  • , Chueh Loo Poh

Research output: Contribution to journalArticlepeer-review

Abstract

With ultra-high density and preservation longevity, deoxyribonucleic acid (DNA)-based data storage is becoming an emerging storage technology. Limited by the current biochemical techniques, data might be corrupted during the processes of DNA data storage. A hybrid coding architecture consisting of modified variable-length run-length limited (VL-RLL) codes and optimized protograph low-density parity-check (LDPC) codes is proposed in order to suppress error occurrence and correct asymmetric substitution errors. Based on the analyses of the different asymmetric DNA sequencer channel models, a series of the protograph LDPC codes are optimized using a modified extrinsic information transfer algorithm (EXIT). The simulation results show the better error performance of the proposed protograph LDPC codes over the conventional good codes and the codes used in the existing DNA data storage system. In addition, the theoretical analysis shows that the proposed hybrid coding scheme stores 1.98 bits per nucleotide (bits/nt) with only 1% gap from the upper boundary (2 bits/nt).

Original languageEnglish
Article number8746106
Pages (from-to)84107-84121
Number of pages15
JournalIEEE Access
Volume7
DOIs
Publication statusPublished - 2019

Keywords

  • asymmetric substitutions
  • constrained codes
  • DNA data storage
  • DNA sequencing
  • protograph LDPC codes

Fingerprint

Dive into the research topics of 'Optimized Code Design for Constrained DNA Data Storage With Asymmetric Errors'. Together they form a unique fingerprint.

Cite this