Why does this RNA virus look like DNA?
The other day, I was diffing coronaviruses:
taking long strings of
GCAT characters that make up the COVID-19 genome,
and pairing them up with the
GCAT characters of similar viruses.
I didn’t realize it at the time,
but there’s an oddity here.
Coronavirus is not made of DNA; it’s made of RNA.
RNA looks like
Uracil instead of
But these files are full of
GCAT, like DNA.
What was going on here?
Was the genome sequence lying to me, or does COVID-19 really contain DNA, rather than RNA?
The answer, it turned out, is that the sequence is a lie!
We must replace all
Us to get a faithful sequence of the coronavirus RNA.
So, why is it represented this way?
The reason is that this is a sequencing of DNA which was generated from the original RNA. Apparently, nearly all RNA sequencing is done this away, because tooling for DNA sequencing is cheaper and more mature, and DNA is more stable than RNA. This process uses a reverse transcriptase to convert the RNA to DNA. More precisely, this creates a complementary DNA, or “cDNA”.
I was able to answer this with the help of the Bioinformatics Stack Exchange.
More by Jim
- Your syntax highlighter is wrong
- Granddad died today
- The Three Ts of Time, Thought and Typing: measuring cost on the web
- I hate telephones
- The sorry state of OpenSSL usability
- The dots do matter: how to scam a Gmail user
- My parents are Flat-Earthers
- How Hacker News stays interesting
- Project C-43: the lost origins of asymmetric crypto
- The hacker hype cycle
- The inception bar: a new phishing method
- Time is running out to catch COVID-19
- A probabilistic pub quiz for nerds
- Smear phishing: a new Android vulnerability