ToxCodAn-Genome: an automated pipeline for toxin-gene annotation in genome assembly of venomous lineages
Author
Butantan affiliation
External affiliation
Publication type
Article
Language
English
Access rights
Open access
Terms of use
CC BY
Appears in Collections:
Metrics
Abstract
Background: The rapid development of sequencing technologies resulted in a wide expansion of genomics studies using venomous
lineages. This facilitated research focusing on understanding the evolution of adaptive traits and the search for novel compounds
that can be applied in agriculture and medicine. However, the toxin annotation of genomes is a laborious and time-consuming task,
and no consensus pipeline is currently available. No computational tool currently exists to address the challenges specific to toxin
annotation and to ensure the reproducibility of the process.
Results: Here, we present ToxCodAn-Genome, the first software designed to perform automated toxin annotation in genomes of
venomous lineages. This pipeline was designed to retrieve the full-length coding sequences of toxins and to allow the detection of
novel truncated paralogs and pseudogenes. We tested ToxCodAn-Genome using 12 genomes of venomous lineages and achieved high
performance on recovering their current toxin annotations. This tool can be easily customized to allow improvements in the final
toxin annotation set and can be expanded to virtually any venomous lineage. ToxCodAn-Genome is fast, allowing it to run on any
personal computer, but it can also be executed in multicore mode, taking advantage of large high-performance servers. In addition,
we provide a guide to direct future research in the venomics field to ensure a confident toxin annotation in the genome being studied.
As a case study, we sequenced and annotated the toxin repertoire of Bothrops alternatus, which may facilitate future evolutionary and
biomedical studies using vipers as models.
Conclusions: ToxCodAn-Genome is suitable to perform toxin annotation in the genome of venomous species and may help to improve
the reproducibility of further studies. ToxCodAn-Genome and the guide are freely available at https://github.com/pedronachtigall/T
oxCodAn-Genome.
Link to cite this reference
https://repositorio.butantan.gov.br/handle/butantan/5255
URL
https://doi.org/10.1093/gigascience/giad116
Journal title
Keywords
Issue Date
2024
Files in This Item:
This item is licensed under a Creative Commons License