MetaDIA: A DDA-free Database Reduction Strategy for DIA Human Gut Metaproteomics

Duan, Haonan, Ning, Zhibin, Sun, Zhongzhi, Guo, Tiannan, Sun, Yingying and Figeys, Daniel (2026) MetaDIA: A DDA-free Database Reduction Strategy for DIA Human Gut Metaproteomics. Genomics, Proteomics & Bioinformatics. ISSN 1672-0229

[thumbnail of qzag029]
Preview
PDF (qzag029) - Accepted Version
Available under License Creative Commons Attribution.

Download (4MB) | Preview

Abstract

Microbiomes, especially within the gut, are complex and may comprise hundreds of species. The identification of peptides in metaproteomics presents a substantial challenge, as it involves matching peptides to mass spectra within an enormous search space for complex and unknown samples. This poses difficulties for both the accuracy and the speed of identification. Specifically, analysis of data-independent acquisition (DIA) datasets has relied on libraries constructed from prior data-dependent acquisition (DDA) results. However, this method is resource-intensive, consumes samples, and limits identification to peptides previously identified. These limitations restrict the application of DIA in metaproteomics research. We introduced a novel strategy to reduce the search space by utilizing species abundance and functional abundance information from the microbiome to score each peptide and prioritize those most likely to be detected. Using this strategy, we have developed and optimized a workflow called MetaDIA for the analysis of microbiome data generated by DIA, which operates independently of DDA assistance. Our approach successfully created a smaller, yet sufficient database for DIA data search in metaproteomics. The results demonstrated strong consistency with the traditional DDA-based library approach at both protein and functional levels. MetaDIA is readily accessible as an open-source project hosted on GitHub (https://github.com/northomics/MetaDIA).

Item Type: Article
Additional Information: Data availability: The datasets generated in this study were sourced from ProteomeXchange Consortium (http://www.proteomexchange.org) with dataset identifier PXD063632.
Uncontrolled Keywords: metaproteomics,human gut microbiome,data independent acquisition,data-dependent acquisition-free,diapasef
Faculty \ School: Faculty of Medicine and Health Sciences > Norwich Medical School
UEA Research Groups: Faculty of Medicine and Health Sciences > Research Centres > Metabolic Health
Related URLs:
Depositing User: LivePure Connector
Date Deposited: 13 May 2026 10:27
Last Modified: 14 May 2026 15:16
URI: https://ueaeprints.uea.ac.uk/id/eprint/102983
DOI: 10.1093/gpbjnl/qzag029

Downloads

Downloads per month over past year

Actions (login required)

View Item View Item