TY - GEN
T1 - Using Reduced Amino-Acid Alphabets and Simulated Annealing to Identify Antimicrobial Peptides
AU - Healy, John
AU - Caprani, Michela
AU - Slattery, Orla
AU - O’Keeffe, Joan
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - The efficient detection of similarity between biological sequences is a fundamental task in bioinformatics. This paper describes a k-mer approach for identifying and classifying antimicrobial peptide sequences using 64-bit encoded multiple spaced seeds and a suite of reduced amino acid alphabets. We implemented and tested the approach using a total of 74 reduced alphabets that were either published, altered using simulated annealing, or randomly generated. Our results show that the approach is very accurate and that all of the reduced alphabets of sizes between 9 and 16 were equally effective and far more accurate than smaller sized alphabets. Our custom designed alphabets exhibited higher sensitivity for some families of AMP than any of the published reduced alphabets that we tested.
AB - The efficient detection of similarity between biological sequences is a fundamental task in bioinformatics. This paper describes a k-mer approach for identifying and classifying antimicrobial peptide sequences using 64-bit encoded multiple spaced seeds and a suite of reduced amino acid alphabets. We implemented and tested the approach using a total of 74 reduced alphabets that were either published, altered using simulated annealing, or randomly generated. Our results show that the approach is very accurate and that all of the reduced alphabets of sizes between 9 and 16 were equally effective and far more accurate than smaller sized alphabets. Our custom designed alphabets exhibited higher sensitivity for some families of AMP than any of the published reduced alphabets that we tested.
UR - http://www.scopus.com/inward/record.url?scp=85115248801&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-86258-9_2
DO - 10.1007/978-3-030-86258-9_2
M3 - Conference contribution
AN - SCOPUS:85115248801
SN - 9783030862572
T3 - Lecture Notes in Networks and Systems
SP - 11
EP - 21
BT - Practical Applications of Computational Biology and Bioinformatics, 15th International Conference, PACBB 2021
A2 - Rocha, Miguel
A2 - Fdez-Riverola, Florentino
A2 - Mohamad, Mohd Saberi
A2 - Casado-Vara, Roberto
PB - Springer Science and Business Media Deutschland GmbH
T2 - 15th International Conference on Practical Applications of Computational Biology and Bioinformatics, PACBB 2021
Y2 - 6 October 2021 through 8 October 2021
ER -