A New Tag Index Scheme Enables Fast Peptide Retrieval for Protein Identification

Zhou, Piyu and Hou, Xinhang and Wang, Haipeng (2022) A New Tag Index Scheme Enables Fast Peptide Retrieval for Protein Identification. Journal of Computer and Communications, 10 (04). pp. 14-23. ISSN 2327-5219

[thumbnail of jcc_2022040813504028.pdf] Text
jcc_2022040813504028.pdf - Published Version

Download (3MB)

Abstract

Sequence tag index in the field of computational proteomics can be used to facilitate faster open-search-based identification of modified peptides and in-depth analysis of mass spectrometry data. In protein-identification search engines, sequence tag index are playing a prominent role in recent ten years due to fast searching speed. However, in pursuit of less index space consumption, some protein search engines design excessively concise index schemes which lead to higher computational burden. We proposed a new tag index scheme named TIIP with a better balance between space and time complexity. TIIP has a unique two-level hierarchical index structure which allows rapid retrieval of all peptide sequences and their corresponding masses. Theoretically, the index space consumption of TIIP is not much higher compared to the typical tag index schemes, but the time complexity of sequence retrieval can be reduced to O(1), and practically, TIIP has about one million fold improvement in searching speed compared with brute force approach.

Item Type: Article
Subjects: Digital Academic Press > Computer Science
Depositing User: Unnamed user with email support@digiacademicpress.org
Date Deposited: 11 May 2023 07:04
Last Modified: 25 Aug 2025 03:44
URI: http://core.ms4sub.com/id/eprint/1046

Actions (login required)

View Item
View Item