Encoding binary data into DNA sequence

post, Jan 3, 2019 on Mitja Felicijan's blog

Initial thoughts

Imagine a world where you could go outside and take a leaf from a tree and put @@ -177,21 +177,18 @@ like.

Sample binary file 1KB

We create following binary files:

  • 1KB.bin
  • 10KB.bin
  • 100KB.bin
  • 1MB.bin
  • 10MB.bin
  • 100MB.bin

After this we create FASTA files for all the binary files by encoding them into DNA sequence.

./dnae-encode -i 100MB.bin -o 100MB.fa
 

Then we GZIP all the FASTA files to see how much the can be compressed.

gzip -9 < 10MB.fa > 10MB.fa.gz
-
Encode to FASTA

The speed increase that occurs when encoding to FASTA format.

File sizes

Size of the out file after encoding.

Download CSV file with benchmarks.

References