Project: PRJNA883718
In this study, we determined the nuclear genome sequence of the model diplonemid Diplonema papillatum to provide a reference genome for the clade.
The ~280-Mb genome assembly contains nearly 38,000 predicted protein-coding genes, likely co-transcribed in groups of up to 100. Gene clusters are separated by long repetitive regions that include numerous transposable elements, which also reside within introns. Analysis of gene family evolution reveals that the last common diplonemid ancestor underwent considerable metabolic expansions, mostly via horizontal gene transfer.
In addition to serving as a reference genome for the specious diplonemid clade, the D.papillatum nuclear genome sequence will allow systematic functional and cell-biological studies in D. papillatum, because we have recently put in place genetic-manipulation methodologies for this species. Further, genome information will be critical for comparative studies of gene complement evolution in the sister group of kinetoplastids that includes human-pathogenic taxa. It will become possible to demonstrate by phylogenetic analyses including D. papillatum sequences that genes previously thought to be confined to kinetoplastids were in fact emerged already present in the common ancestor of the two groups.