Affordable Access

Publisher Website

Plant DNA barcoding necessitates marker-specific efforts to establish more comprehensive reference databases.

Authors
  • Kolter, Andreas1, 1
  • Gemeinholzer, Birgit1, 1
  • 1 Justus Liebig University Giessen, Systematic Botany, Heinrich-Buff-Ring 38, Giessen, Hessen, DE 35390, Germany. , (Germany)
Type
Published Article
Journal
Genome
Publication Date
Mar 01, 2021
Volume
64
Issue
3
Pages
265–298
Identifiers
DOI: 10.1139/gen-2019-0198
PMID: 32649839
Source
Medline
Keywords
Language
English
License
Unknown

Abstract

The problem of low species-level identification rates in plants by DNA barcoding is exacerbated by the fact that reference databases are far from being comprehensive. We investigate the impact of increased sampling depth on identification success by analyzing the efficacy of established plant barcode marker sequences (rbcL, matK, trnL-trnF, psbA-trnH, ITS). Adding sequences of the same species to the reference database led to an increase in correct species assignment of +10.9% for rbcL and +19.0% for ITS. Simultaneously, erroneous identification dropped from ∼40% to ∼12.5%. Despite its evolutionary constraints, ITS showed the highest identification rate and identification gain by increased sampling effort, which makes it a very suitable marker in the planning phase of a barcode study. The limited sequence availability of trnL-trnF is problematic for an otherwise very promising plastid plant barcoding marker. Future developments in machine learning algorithms have the potential to give new impetus to plant barcoding, but are dependent on extensive reference databases. We expect that our results will be incorporated into future plans for the development of DNA barcoding reference databases and will lead to these being developed with greater depth and taxonomic coverage.

Report this publication

Statistics

Seen <100 times