Ciro Santilli $$ Sponsor Ciro $$ 中国独裁统治 China Dictatorship 新疆改造中心、六四事件、法轮功、郝海东、709大抓捕、2015巴拿马文件 邓家贵、低端人口、西藏骚乱
  • 3 Gbps
  • 20k genes
  • 37.2 trillion cells[ref]
gunzip --keep GRCh38_latest_genomic.fna.gz
The point of these is that they are good for transfection apparently.
20k genes, 3 billion base pairs. We can handle this!!!
This is really cool. Ciro Santilli would be tempted to participate, but his wife is not a fan, in part due to the loss of privacy of children. Maybe she is right...
Someone should implement a version of that where you can upload your privately sequenced genome and get analytics for free.
This was the first large part of the genome that was sequenced, in 1981: Cambridge Reference Sequence. Presumably they picked it because it is short and does not undergo crossover.
About 16.6 kbp:
TODO: many places say "exactly" 16,569, it seems that variable number tandem repeat are either rare or don't occur!
By Fred Sanger's group.
As mentioned by Craig Venter in 100 Greatest Discoveries by the Discovery Channel (2004-2005), it established the ballpark number of human genes, and showed that human genomes are very similar across individuals.
Important predecessors:
This was one of the first notable country-led large scale sequencing efforts of the world.
UniProt human: It is interesting to see in the Mutagenesis how many known mutations can increase or decrease SARS-CoV-2 S protein binding affinity.