Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

  • Annett Erkes
  • René P. Grove
  • Milena Žarković
  • Sebastian Krautwurst
  • Ralf Koebnik
  • Richard D. Morgan
  • Geoffrey G. Wilson
  • Martin Hölzer
  • Manja Marz
  • Jens Boch
  • Jan Grau

Organisationseinheiten

Externe Organisationen

  • Martin-Luther-Universität Halle-Wittenberg
  • Friedrich-Schiller-Universität Jena
  • Universität Montpellier
  • New England Biolabs
  • Robert Koch-Institut (RKI)
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Aufsatznummer151
FachzeitschriftBMC GENOMICS
Jahrgang24
PublikationsstatusVeröffentlicht - Dez. 2023

Abstract

Background: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.

ASJC Scopus Sachgebiete

  • Biochemie, Genetik und Molekularbiologie (insg.)
  • Biotechnologie
  • Biochemie, Genetik und Molekularbiologie (insg.)
  • Genetik

Zitieren

Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing. / Erkes, Annett; Grove, René P.; Žarković, Milena et al.
in: BMC GENOMICS, Jahrgang 24, 151, 12.2023.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Erkes, A, Grove, RP, Žarković, M, Krautwurst, S, Koebnik, R, Morgan, RD, Wilson, GG, Hölzer, M, Marz, M, Boch, J & Grau, J 2023, 'Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing', BMC GENOMICS, Jg. 24, 151. https://doi.org/10.1186/s12864-023-09228-1
Erkes, A., Grove, R. P., Žarković, M., Krautwurst, S., Koebnik, R., Morgan, R. D., Wilson, G. G., Hölzer, M., Marz, M., Boch, J., & Grau, J. (2023). Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing. BMC GENOMICS, 24, Artikel 151. https://doi.org/10.1186/s12864-023-09228-1
Erkes A, Grove RP, Žarković M, Krautwurst S, Koebnik R, Morgan RD et al. Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing. BMC GENOMICS. 2023 Dez;24:151. doi: 10.1186/s12864-023-09228-1
Erkes, Annett ; Grove, René P. ; Žarković, Milena et al. / Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing. in: BMC GENOMICS. 2023 ; Jahrgang 24.
Download
@article{3e274346fe9946ec85943e5ca7254266,
title = "Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing",
abstract = "Background: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.",
keywords = "Bacteria, Genome assembly, Oxford Nanopore Technologies, Sequencing technologies, Transcription activator-like effectors",
author = "Annett Erkes and Grove, {Ren{\'e} P.} and Milena {\v Z}arkovi{\'c} and Sebastian Krautwurst and Ralf Koebnik and Morgan, {Richard D.} and Wilson, {Geoffrey G.} and Martin H{\"o}lzer and Manja Marz and Jens Boch and Jan Grau",
note = "Funding Information: Open Access funding enabled and organized by Projekt DEAL. This work was supported by grants from the Deutsche Forschungsgemeinschaft ( http://www.dfg.de ) (BO 1496/8-2 to JB, GR 4587/1-2 to JG and FZT 118/2 to SK and MM). Work in RK{\textquoteright}s laboratory was supported by a grant from the French Agence Nationale de la Recherche (ANR-2010-BLAN-1723). MM and SK were supported by the Ministry for Economics, Sciences and Digital Society of Thuringia (TMWWDG), under the framework of the Landesprogramm ProDigital (DigLeben-5575/10-9). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Funding Information: We acknowledge the financial support of the Open Access Publication Fund of the Martin Luther University Halle-Wittenberg ( https://www.uni-halle.de ). ",
year = "2023",
month = dec,
doi = "10.1186/s12864-023-09228-1",
language = "English",
volume = "24",
journal = "BMC GENOMICS",
issn = "1471-2164",
publisher = "BioMed Central Ltd.",

}

Download

TY - JOUR

T1 - Assembling highly repetitive Xanthomonas TALomes using Oxford Nanopore sequencing

AU - Erkes, Annett

AU - Grove, René P.

AU - Žarković, Milena

AU - Krautwurst, Sebastian

AU - Koebnik, Ralf

AU - Morgan, Richard D.

AU - Wilson, Geoffrey G.

AU - Hölzer, Martin

AU - Marz, Manja

AU - Boch, Jens

AU - Grau, Jan

N1 - Funding Information: Open Access funding enabled and organized by Projekt DEAL. This work was supported by grants from the Deutsche Forschungsgemeinschaft ( http://www.dfg.de ) (BO 1496/8-2 to JB, GR 4587/1-2 to JG and FZT 118/2 to SK and MM). Work in RK’s laboratory was supported by a grant from the French Agence Nationale de la Recherche (ANR-2010-BLAN-1723). MM and SK were supported by the Ministry for Economics, Sciences and Digital Society of Thuringia (TMWWDG), under the framework of the Landesprogramm ProDigital (DigLeben-5575/10-9). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Funding Information: We acknowledge the financial support of the Open Access Publication Fund of the Martin Luther University Halle-Wittenberg ( https://www.uni-halle.de ).

PY - 2023/12

Y1 - 2023/12

N2 - Background: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.

AB - Background: Most plant-pathogenic Xanthomonas bacteria harbor transcription activator-like effector (TALE) genes, which function as transcriptional activators of host plant genes and support infection. The entire repertoire of up to 29 TALE genes of a Xanthomonas strain is also referred to as TALome. The DNA-binding domain of TALEs is comprised of highly conserved repeats and TALE genes often occur in gene clusters, which precludes the assembly of TALE-carrying Xanthomonas genomes based on standard sequencing approaches. Results: Here, we report the successful assembly of the 5 Mbp genomes of five Xanthomonas strains from Oxford Nanopore Technologies (ONT) sequencing data. For one of these strains, Xanthomonas oryzae pv. oryzae (Xoo) PXO35, we illustrate why Illumina short reads and longer PacBio reads are insufficient to fully resolve the genome. While ONT reads are perfectly suited to yield highly contiguous genomes, they suffer from a specific error profile within homopolymers. To still yield complete and correct TALomes from ONT assemblies, we present a computational correction pipeline specifically tailored to TALE genes, which yields at least comparable accuracy as Illumina-based polishing. We further systematically assess the ONT-based pipeline for its multiplexing capacity and find that, combined with computational correction, the complete TALome of Xoo PXO35 could have been reconstructed from less than 20,000 ONT reads. Conclusions: Our results indicate that multiplexed ONT sequencing combined with a computational correction of TALE genes constitutes a highly capable tool for characterizing the TALomes of huge collections of Xanthomonas strains in the future.

KW - Bacteria

KW - Genome assembly

KW - Oxford Nanopore Technologies

KW - Sequencing technologies

KW - Transcription activator-like effectors

UR - http://www.scopus.com/inward/record.url?scp=85150979833&partnerID=8YFLogxK

U2 - 10.1186/s12864-023-09228-1

DO - 10.1186/s12864-023-09228-1

M3 - Article

C2 - 36973643

AN - SCOPUS:85150979833

VL - 24

JO - BMC GENOMICS

JF - BMC GENOMICS

SN - 1471-2164

M1 - 151

ER -