Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7540
DC FieldValueLanguage
dc.contributor.authorNi, Bingen_US
dc.contributor.authorLo, Leung Yauen_US
dc.contributor.authorProf. LEUNG Kwong Saken_US
dc.date.accessioned2023-03-23T03:26:25Z-
dc.date.available2023-03-23T03:26:25Z-
dc.date.issued2010-
dc.identifier.citationProceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010, pp. 369 - 372, Article number 5706593en_US
dc.identifier.isbn978-142448307-5-
dc.identifier.urihttp://hdl.handle.net/20.500.11861/7540-
dc.description.abstractIn this work, we define generalized (sequence) patterns, which is based on several real Biological problems, including transcription factors (TFs) binding to transcription factor binding sites (TFBSs), cis-regulatory modules, protein domain analysis, and alternative splicing etc. Simply speaking, a generalized pattern is composed of several substrings with gaps in-between two substrings. We propose a generalized pattern matching algorithm that uses a complementary dual-seeding strategy, which is sensitive to errors (both mismatches and indels). We also develop a generalized pattern matching tool 1, which is to our knowledge the first ever developed specially for generalized pattern matching. Rather than replacing the existing general purpose matching tools, such as BLAST, BLAT, and PatternHunter etc, our tool provides an alternative and helps users to solve real problems, especially those that can be modeled as generalized patterns. We use data randomly sampled from reference sequences of human genome (NCBI build v18) in experiments, and hit 98.74% generalized patterns on average. The tool runs on both LINUX and Windows platforms, and the memory peak goes to a little bit larger than 1GB only. ©2010 IEEE.en_US
dc.language.isoenen_US
dc.relation.ispartofProceedings - 2010 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2010en_US
dc.titleA generalized sequence pattern matching algorithm using complementary dual-seedingen_US
dc.typeConference Paperen_US
dc.identifier.doi10.1109/BIBM.2010.5706593-
item.fulltextNo Fulltext-
crisitem.author.deptDepartment of Applied Data Science-
Appears in Collections:Applied Data Science - Publication
Show simple item record

SCOPUSTM   
Citations

1
checked on Dec 15, 2024

Page view(s)

29
Last Week
0
Last month
checked on Dec 20, 2024

Google ScholarTM

Impact Indices

Altmetric

PlumX

Metrics


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.