Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7508
DC FieldValueLanguage
dc.contributor.authorChan, Tak-Mingen_US
dc.contributor.authorLo, Leung-Yauen_US
dc.contributor.authorWong, Man-Leungen_US
dc.contributor.authorLiang, Yongen_US
dc.contributor.authorProf. LEUNG Kwong Saken_US
dc.date.accessioned2023-03-16T04:52:44Z-
dc.date.available2023-03-16T04:52:44Z-
dc.date.issued2013-
dc.identifier.citation2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2013, pp. 198-205en_US
dc.identifier.urihttp://hdl.handle.net/20.500.11861/7508-
dc.description.abstractDNA motif discovery is an important problem for deciphering protein-DNA bindings in gene regulation. To discover generic spaced motifs which have multiple conserved patterns separated by wild-cards called spacers, the genetic algorithm (GA) based GASMEN has been proposed and shown to outperform related methods. However, the over-generic modeling of any number of spacers increases the optimization difficulty in practice. In protein-DNA binding case studies, complicated spaced motifs are rare while dimers with single spacers are more common spaced motifs. Moreover, errors (mismatches) in a conserved pattern are not arbitrarily distributed as certain highly conserved nucleotides are essential to maintain bindings. Motivated by better optimization in real applications, we have developed a new method, which is GA for Dimer-led and Error-restricted Spaced Motifs (GADESM). Common spaced motifs are paid special attention to using dimer-led initialization in the population initialization. The results on real datasets show that the dimer-led initialization in GADESM achieves better fitness than GASMEN with statistical significance. With additional error-restricted motif occurrence retrieval, GADESM has shown better performance than GASMEN on both comprehensive simulation data and a real ChIP-seq case study.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.relation.ispartof2013 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB)en_US
dc.titleGenetic algorithm for dimer-led and error-restricted spaced motif discoveryen_US
dc.typePeer Reviewed Journal Articleen_US
dc.identifier.doi10.1109/CIBCB.2013.6595409.-
item.fulltextNo Fulltext-
crisitem.author.deptDepartment of Applied Data Science-
Appears in Collections:Applied Data Science - Publication
Show simple item record

Page view(s)

37
Last Week
0
Last month
checked on Dec 20, 2024

Google ScholarTM

Impact Indices

Altmetric

PlumX

Metrics


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.