Bioinformatics Methods for Prediction of Gene Families Encoding Extracellular Peptides
- Authors
- Publication Date
- Nov 30, 2024
- Identifiers
- DOI: 10.1007/978-1-0716-3511-7_1
- OAI: oai:HAL:hal-04394046v1
- Source
- HAL-Descartes
- Keywords
- Language
- English
- License
- Unknown
- External links
Abstract
Genes encoding small secreted peptides are widely distributed among plant genomes but their detection and annotation remains challenging. The bioinformatics protocol described here aims to identify as exhaustively as possible secreted peptide precursors belonging to a family of interest. First, homology searches are performed at the protein and genome levels. Next, multiple sequence alignments and predictions of a secretion signal are used to define a set of homologous proteins sharing features of secreted peptide precursors. These protein sequences are then used as input of motif detection and profile-based tools to build representative matrices and profiles that are used iteratively as guides to scan again the proteome and genome until family completion.