Efficient Constraining of Transcoding in DNA-Based Image Storage
October 07, 2025 Β· Declared Dead Β· π International Conference on Information Photonics
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Sara Al Sayyed, Aline Roumy, Thomas Maugey
arXiv ID
2511.14771
Category
q-bio.OT
Cross-listed
cs.MM
Citations
0
Venue
International Conference on Information Photonics
Last Checked
3 months ago
Abstract
DNA has emerged as a promising alternative for long-term data storage due to its high capacity, durability, and low-energy potential. However, storing data in DNA presents several challenges. First, it requires complex and costly biochemical processes, making efficient compression crucial to reducing DNA synthesis time and cost. Second, these processes are prone to errors that must be avoided and/or corrected. In particular, homopolymers (repetitions of the same nucleotide) are a wellknown source of errors during the sequencing step. Avoiding such repetitions helps mitigate errors but introduces a constraint that may increase the data compression rate. In this paper, we propose two transcoding methods that address these two key challenges: reducing data rate and minimizing errors. The first method strictly enforces the error-minimization constraint by eliminating homopolymers of a certain length, at the cost of an increased data rate. In contrast, the second method accepts a slight increase in homopolymers. However, we show that these increases remain limited (2.14% increase in compression rate for the first method and 0.39% homopolymer rate for the second). These two approaches demonstrate that it is possible to efficiently constrain transcoding while balancing error minimization and compression performance.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-bio.OT
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
A Probabilistic Framework for Quantifying Biological Complexity
R.I.P.
π»
Ghosted
An Introduction to Programming for Bioscientists: A Python-based Primer
R.I.P.
π»
Ghosted
Ultrametrics in the genetic code and the genome
R.I.P.
π»
Ghosted
Nutritionally recommended food for semi- to strict vegetarian diets based on large-scale nutrient composition data
R.I.P.
π»
Ghosted
Comparing Machine Learning Algorithms with or without Feature Extraction for DNA Classification
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted