Rizqiana, Annisa and Afiahayati, Afiahayati (2024) Protein Secondary Structure Prediction using N-Grams and 1-Dimensional Convolutional Neural Network. In: Joint 13th International Conference on Soft Computing and Intelligent Systems and 25th International Symposium on Advanced Intelligent Systems, SCIS and ISIS 2024, 9 November 2024through 12 November 2024, Himeji.
![[thumbnail of 135.Protein_Secondary_Structure_Prediction_using_N-Grams_and_1-Dimensional_Convolutional_Neural_Network.pdf]](https://ir.lib.ugm.ac.id/style/images/fileicons/text.png)
135.Protein_Secondary_Structure_Prediction_using_N-Grams_and_1-Dimensional_Convolutional_Neural_Network.pdf - Published Version
Restricted to Registered users only
Download (409kB) | Request a copy
Abstract
In this study, we initiated an approach to predict Protein Secondary Structure Prediction (PSSP) by using n-grams modeling, namely n adjacent amino acid sequences, to represent the amino acid sequence and 1-Dimensional Convolutional Neural Network (CNN). We compare n-grams with a one-hot encoding approach and find that n-grams provide better performance. The analysis results show that bigrams are more effective than trigrams in describing protein amino acid sequence patterns. We also combine it with the PSSM profile feature to improve model performance. The results of this study reveal that the n-grams approach, especially bigrams, in combination with PSSM, can produce more accurate protein secondary structure predictions than previous models, with superior accuracy evaluations of Q8 68.1 % and Q3 82.75% on the CB513 dataset.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | Bioinformatics; Convolutional Neural Network; N-grams; Protein Secondary Structure Prediction; Sequence Labeling |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Faculty of Mathematics and Natural Sciences > Computer Science & Electronics Department |
Depositing User: | Ismu WIDARTO |
Date Deposited: | 24 Jun 2025 04:15 |
Last Modified: | 24 Jun 2025 04:15 |
URI: | https://ir.lib.ugm.ac.id/id/eprint/19010 |