Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7449
Title: Discovering Protein-DNA Binding Cores by Aligned Pattern Clustering
Authors: Lee, En-Shiun Annie 
Sze-To, Ho-Yin Antonio 
Wong, Man-Hon 
Prof. LEUNG Kwong Sak 
Lau, Terrence Chi-Kong 
Wong, Andrew K. C. 
Issue Date: 2015
Publisher: IEEE
Source: IEEE/ACM Transactions on Computational Biology and Bioinformatics, 2015, vol. 14(2), pp. 254-263
Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics 14(2), pp. 254-263 
Abstract: Understanding binding cores is of fundamental importance in deciphering Protein-DNA (TF-TFBS) binding and gene regulation. Limited by expensive experiments, it is promising to discover them with variations directly from sequence data. Although existing computational methods have produced satisfactory results, they are one-to-one mappings with no site-specific information on residue/nucleotide variations, where these variations in binding cores may impact binding specificity. This study presents a new representation for modeling binding cores by incorporating variations and an algorithm to discover them from only sequence data. Our algorithm takes protein and DNA sequences from TRANSFAC (a Protein-DNA Binding Database) as input; discovers from both sets of sequences conserved regions in Aligned Pattern Clusters (APCs); associates them as Protein-DNA Co-Occurring APCs; ranks the Protein-DNA Co-Occurring APCs according to their co-occurrence, and among the top ones, finds three-dimensional structures to support each binding core candidate. If successful, candidates are verified as binding cores. Otherwise, homology modeling is applied to their close matches in PDB to attain new chemically feasible binding cores. Our algorithm obtains binding cores with higher precision and much faster runtime ( ≥ 1,600x) than that of its contemporaries, discovering candidates that do not co-occur as one-to-one associated patterns in the raw data. Availability: http://www.pami.uwaterloo.ca/~ealee/files/tcbbPnDna2015/Release.zip .
Type: Peer Reviewed Journal Article
URI: http://hdl.handle.net/20.500.11861/7449
DOI: 10.1109/TCBB.2015.2474376
Appears in Collections:Applied Data Science - Publication

Show full item record

SCOPUSTM   
Citations

2
checked on Dec 15, 2024

Page view(s)

46
Last Week
1
Last month
checked on Dec 20, 2024

Google ScholarTM

Impact Indices

Altmetric

PlumX

Metrics


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.