The Entire Human Genome Has Been Sequenced
June 2, 2021
- The International Human Genome Sequencing Consortium and Celera Genomics published the initial drafts of the human genome in 2001.
- The initial human genome publication revolutionized the field of genomics.
- The drafts and the follow-up updates covered the euchromatic part of the genome.
- However, many other complex regions as well as the heterochromatin were left incomplete or incorrect.
- Euchromatin, as opposed to heterochromatin, is a loosely packed chromatin and is genetically active usually undergoing transcription.
- This incomplete/erroneous part comprises 8% of the genome.
- Telomere-to-Telomere Consortium has finished the first complete sequence of a human genome.
- The research work is still in pre-print and has not been peer-reviewed.
- The new improved human reference genome has a 3.055 billion base pair sequence.
- The new human reference genome includes gapless assemblies for all 22 autosomal chromosomes including the X chromosomes.
- The new reference also corrects a number of errors and introduces almost 200 million base pairs of new sequences containing 2,226 paralogous gene copies.
- Paralogous genes are genes that are descended from the same ancestral gene through gene duplication in the path of evolution.
- Of the paralogous gene copies, 115 are predicted to code for proteins.
- The new complete regions include all centromeric satellite arrays and the short arms of all five acrocentric chromosomes.
- Satellite arrays play significant roles in heterochromatin formation, genome stability, reproductive isolation, dosage compensation, and evolution.
- Acrocentric chromosome has a centromere placed close to one end so that the short arm is very small.
- For the first time, the new data unlocked these complex regions of the genome to functional and variational studies.
Sergey Nurk, Sergey Koren, Arang Rhie, Mikko Rautiainen, Andrey V. Bzikadze, Alla Mikheenko, Mitchell R. Vollger, Nicolas Altemose, Lev Uralsky, Ariel Gershman, Sergey Aganezov, Savannah J. Hoyt, Mark Diekhans, Glennis A. Logsdon, Michael Alonge, Stylianos E. Antonarakis, Matthew Borchers, Gerard G. Bouffard, Shelise Y. Brooks, Gina V. Caldas, Haoyu Cheng, Chen-Shan Chin, William Chow, Leonardo G. de Lima, Philip C. Dishuck, Richard Durbin, Tatiana Dvorkina, Ian T. Fiddes, Giulio Formenti, Robert S. Fulton, Arkarachai Fungtammasan, Erik Garrison, Patrick G.S. Grady, Tina A. Graves-Lindsay, Ira M. Hall, Nancy F. Hansen, Gabrielle A. Hartley, Marina Haukness, Kerstin Howe, Michael W. Hunkapiller, Chirag Jain, Miten Jain, Erich D. Jarvis, Peter Kerpedjiev, Melanie Kirsche, Mikhail Kolmogorov, Jonas Korlach, Milinn Kremitzki, Heng Li, Valerie V. Maduro, Tobias Marschall, Ann M. McCartney, Jennifer McDaniel, Danny E. Miller, James C. Mullikin, Eugene W. Myers, Nathan D. Olson, Benedict Paten, Paul Peluso, Pavel A. Pevzner, David Porubsky, Tamara Potapova, Evgeny I. Rogaev, Jeffrey A. Rosenfeld, Steven L. Salzberg, Valerie A. Schneider, Fritz J. Sedlazeck, Kishwar Shafin, Colin J. Shew, Alaina Shumate, Yumi Sims, Arian F. A. Smit, Daniela C. Soto, Ivan Sović, Jessica M. Storer, Aaron Streets, Beth A. Sullivan, Françoise Thibaud-Nissen, James Torrance, Justin Wagner, Brian P. Walenz, Aaron Wenger, Jonathan M. D. Wood, Chunlin Xiao, Stephanie M. Yan, Alice C. Young, Samantha Zarate, Urvashi Surti, Rajiv C. McCoy, Megan Y. Dennis, Ivan A. Alexandrov, Jennifer L. Gerton, Rachel J. O’Neill, Winston Timp, Justin M. Zook, Michael C. Schatz, Evan E. Eichler, Karen H. Miga, Adam M. Phillippy
bioRxiv 2021.05.26.445798; doi: https://doi.org/10.1101/2021.05.26.445798
Keywords: human genome project, genome sequence, human genome, gene sequencing
Date Published: March 23, 2011 Publisher: Public Library of Science Author(s): Jeffry L. Shultz, Eugenia Voziyanova, Jay H. Konieczka, Yuri Voziyanov, Robert Oshima. http://doi.org/10.1371/journal.pone.0018077 Abstract: Efficient and precise genome manipulations can be achieved by the Flp/FRT system of site-specific DNA recombination. Applications of this system are limited, however, to cases when target sites for Flp recombinase, … Continue reading
Date Published: September 9, 2016 Publisher: Public Library of Science Author(s): Kirill Kryukov, Tadashi Imanishi, Deyou Zheng. http://doi.org/10.1371/journal.pone.0162424 Abstract: Contamination in genome assembly can lead to wrong or confusing results when using such genome as reference in sequence comparison. Although bacterial contamination is well known, the problem of human-originated contamination received little attention. In this … Continue reading
Date Published: July 11, 2007 Publisher: Public Library of Science Author(s): Clara S.M. Tang, Richard J. Epstein, Guillaume Bourque. http://doi.org/10.1371/journal.pone.0000603 Abstract: Promoter-associated CpG islands (PCIs) mediate methylation-dependent gene silencing, yet tend to co-locate to transcriptionally active genes. To address this paradox, we used data mining to assess the behavior of PCI-positive (PCI+) genes in the human … Continue reading
Date Published: May 23, 2007 Publisher: Public Library of Science Author(s): Evan H. Hurowitz, Iddo Drori, Victoria C. Stodden, David L. Donoho, Patrick O. Brown, Juan Valcarcel. http://doi.org/10.1371/journal.pone.0000460 Abstract: We applied the Virtual Northern technique to human brain mRNA to systematically measure human mRNA transcript lengths on a genome-wide scale. Partial Text: Now that the human … Continue reading