Chromosome-scale genome sequencing, assembly and annotation of six genomes from subfamily Leishmaniinae

Sci Data. 2021 Sep 6;8(1):234. doi: 10.1038/s41597-021-01017-3.

Abstract

We provide the raw and processed data produced during the genome sequencing of isolates from six species of parasites from the sub-family Leishmaniinae: Leishmania martiniquensis (Thailand), Leishmania orientalis (Thailand), Leishmania enriettii (Brazil), Leishmania sp. Ghana, Leishmania sp. Namibia and Porcisia hertigi (Panama). De novo assembly was performed using Nanopore long reads to construct chromosome backbone scaffolds. We then corrected erroneous base calling by mapping short Illumina paired-end reads onto the initial assembly. Data has been deposited at NCBI as follows: raw sequencing output in the Sequence Read Archive, finished genomes in GenBank, and ancillary data in BioSample and BioProject. Derived data such as quality scoring, SAM files, genome annotations and repeat sequence lists have been deposited in Lancaster University's electronic data archive with DOIs provided for each item. Our coding workflow has been deposited in GitHub and Zenodo repositories. This data constitutes a resource for the comparative genomics of parasites and for further applications in general and clinical parasitology.

Publication types

  • Dataset
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Genome, Protozoan*
  • Genomics
  • Leishmania / classification*
  • Molecular Sequence Annotation
  • Phylogeny*
  • Repetitive Sequences, Nucleic Acid