New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats

Brázda, Václav, Bohálová, Natália and Bowater, Richard P. ORCID: https://orcid.org/0000-0002-2745-7807 (2022) New telomere to telomere assembly of human chromosome 8 reveals a previous underestimation of G-quadruplex forming sequences and inverted repeats. Gene, 810. ISSN 0378-1119

[thumbnail of Accepted_Version]
Preview
PDF (Accepted_Version) - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.

Download (784kB) | Preview

Abstract

Taking advantage of evolving and improving sequencing methods, human chromosome 8 is now available as a gapless, end-to-end assembly. Thanks to advances in long-read sequencing technologies, its centromere, telomeres, duplicated gene families and repeat-rich regions are now fully sequenced. We were interested to assess if the new assembly altered our understanding of the potential impact of non-B DNA structures within this completed chromosome sequence. It has been shown that non-B secondary structures, such as G-quadruplexes, hairpins and cruciforms, have important regulatory functions and potential as targeted therapeutics. Therefore, we analysed the presence of putative G-quadruplex forming sequences and inverted repeats in the current human reference genome (GRCh38) and in the new end-to-end assembly of chromosome 8. The comparison revealed that the new assembly contains significantly more inverted repeats and G-quadruplex forming sequences compared to the current reference sequence. This observation can be explained by improved accuracy of the new sequencing methods, particularly in regions that contain extensive repeats of bases, as is preferred by many non-B DNA structures. These results show a significant underestimation of the prevalence of non-B DNA secondary structure in previous assembly versions of the human genome and point to their importance being not fully appreciated. We anticipate that similar observations will occur as the improved sequencing technologies fill in gaps across the genomes of humans and other organisms.

Item Type: Article
Faculty \ School: Faculty of Science > School of Biological Sciences
UEA Research Groups: Faculty of Science > Research Centres > Centre for Molecular and Structural Biochemistry
Faculty of Science > Research Groups > Biosciences Teaching and Education Research
Faculty of Science > Research Groups > Molecular Microbiology
Depositing User: LivePure Connector
Date Deposited: 03 Nov 2021 08:27
Last Modified: 21 Apr 2023 01:12
URI: https://ueaeprints.uea.ac.uk/id/eprint/81961
DOI: 10.1016/j.gene.2021.146058

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item