From f59672679db271f6b24a41e215401ab5001ecd96 Mon Sep 17 00:00:00 2001 From: Mitja Felicijan Date: Tue, 31 Oct 2023 08:38:25 +0100 Subject: Theme updates --- .../2019-01-03-encoding-binary-data-into-dna-sequence.md | 16 +++++++--------- 1 file changed, 7 insertions(+), 9 deletions(-) (limited to 'content/posts/2019-01-03-encoding-binary-data-into-dna-sequence.md') diff --git a/content/posts/2019-01-03-encoding-binary-data-into-dna-sequence.md b/content/posts/2019-01-03-encoding-binary-data-into-dna-sequence.md index 0d44a40..8bd8d74 100644 --- a/content/posts/2019-01-03-encoding-binary-data-into-dna-sequence.md +++ b/content/posts/2019-01-03-encoding-binary-data-into-dna-sequence.md @@ -99,8 +99,8 @@ Deoxyribonucleic acid, a self-replicating material which is **present in nearly all living organisms** as the main constituent of chromosomes. It is the **carrier of genetic information**. -> The nitrogen in our DNA, the calcium in our teeth, the iron in our blood, -> the carbon in our apple pies were made in the interiors of collapsing stars. +> The nitrogen in our DNA, the calcium in our teeth, the iron in our blood, +> the carbon in our apple pies were made in the interiors of collapsing stars. > We are made of starstuff. > **-- Carl Sagan, Cosmos** @@ -134,9 +134,7 @@ As already mentioned, the Basic Encoding is based on a simple mapping. Since DNA is composed of 4 nucleotides (Adenine, Cytosine, Guanine, Thymine; usually referred using the first letter). Using this technique we can encode -
- -
+ using a single nucleotide. In this way, we are able to use the 4 bases that compose the DNA strand to encode each byte of data. @@ -196,7 +194,7 @@ sequence, often starting with a unique library accession number, and with time it has become commonplace to always use ">" for the first line and to not use ";" comments (which would otherwise be ignored). -``` +```txt ;LCBO - Prolactin precursor - Bovine ; a sample sequence in FASTA format MDSKGSSQKGSRLLLLLVVSNLLLCQGVVSTPVCPNGPGNCQVSLRDLFDRAVMVSHYIHDLSS @@ -273,7 +271,7 @@ First we encode text file into FASTA file. Output of `quote.fa` file contains the encoded DNA sequence in ASCII format. -``` +```txt >SEQ1 GACAGCTTGTGTACAAGTGTGCTTGCTCGCGAGCGGGTACGCGCGTGGGCTAACAAGTGA GCCAGCAGGTGAACAAGTGTGCGGACAAGCCAGCAGGTGCGCGGACAAGCTGGCGGGTGA @@ -325,7 +323,7 @@ I have created a toolkit with two main programs: - dnae-encode (encodes file into FASTA file) - dnae-png (encodes FASTA file into PNG) -Toolkit with full source code is available on +Toolkit with full source code is available on [github.com/mitjafelicijan/dna-encoding](https://github.com/mitjafelicijan/dna-encoding). ### dnae-encode @@ -383,7 +381,7 @@ We create following binary files: - 10MB.bin - 100MB.bin -After this we create FASTA files for all the binary files by encoding them +After this we create FASTA files for all the binary files by encoding them into DNA sequence. ```bash -- cgit v1.2.3