Extracting abundance information from DNA‐based data

Luo, Mingjie, Yinqiu, Ji, Warton, David and Yu, Douglas W. (2023) Extracting abundance information from DNA‐based data. Molecular Ecology Resources, 23 (1). pp. 174-189. ISSN 1755-098X

[thumbnail of Molecular Ecology Resources - 2022 - Luo - Extracting abundance information from DNA‐based data]
Preview
PDF (Molecular Ecology Resources - 2022 - Luo - Extracting abundance information from DNA‐based data) - Published Version
Available under License Creative Commons Attribution Non-commercial.

Download (4MB) | Preview
[thumbnail of Luo et al - Extracting abundance information from DNA-based data_20220827]
Preview
PDF (Luo et al - Extracting abundance information from DNA-based data_20220827)
Download (3MB) | Preview

Abstract

The accurate extraction of species-abundance information from DNA-based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and foodweb reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA-based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within-sample across-species quantification describes relative species abundances in one sample. (ii) Across-sample within-species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across-species abundance information (by removing what we call “species pipeline biases”) and within-species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within-species quantification, and we therefore demonstrate how to use a “DNA spike-in” to correct for pipeline noise and recover within-species abundance information. We also introduce a modelbased estimator that can be used on data sets without a physical spike-in to approximate and correct for pipeline noise.

Item Type: Article
Faculty \ School: Faculty of Science > School of Biological Sciences
UEA Research Groups: Faculty of Science > Research Centres > Centre for Ecology, Evolution and Conservation
Faculty of Science > Research Groups > Organisms and the Environment
Depositing User: LivePure Connector
Date Deposited: 29 Sep 2022 08:30
Last Modified: 17 May 2023 03:33
URI: https://ueaeprints.uea.ac.uk/id/eprint/88705
DOI: 10.1111/1755-0998.13703

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item