Tong, Kwok-KitKwok-KitTongCheung, Kwan-YauKwan-YauCheungLee, Kin-HongKin-HongLeeProf. LEUNG Kwong Sak2023-03-022023-03-0220152015 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB97814799692659781479969258http://hdl.handle.net/20.500.11861/7462Classification on pseudoknots existence is a challenging and meaningful problem in Bioinformatics. As predicting RNA secondary structures with pseudoknots is NP-complete problem while predicting pseudoknot-free structures can be done in O(n 3 ) time, if a preliminary pseudoknots existence classification of RNA sequence can be done before the prediction, the classification result can enhance the efficiency of RNA secondary structure prediction. In this paper, a classification of the existence of pseudoknots in an RNA sequence is presented. A set of features have been chosen by partial sequence content and thousands of RNA sequences with validated structures are used to train the classifier. Using a validated testing dataset, this classification method is shown to achieve a very good performance that the best result get 87% accuracy in 10-fold cross validation and around 75% accuracy in testing data. Moreover it may reveal how partial sequence content can affect the formation of pseudoknots.enClassification of RNA sequences with pseudoknots using features based on partial sequencesConference Paper10.1109/CIBCB.2015.7300277