Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7510
Title: GP-Pi: Using Genetic Programming with Penalization and Initialization on Genome-Wide Association Study
Authors: Sze-To, Ho-Yin 
Lee, Kwan-Yeung 
Tso, Kai-Yuen 
Wong, Man-Hon 
Lee, Kin-Hong 
Tang, Nelson L. S. 
Prof. LEUNG Kwong Sak 
Issue Date: 2013
Publisher: Springer, Berlin, Heidelberg.
Source: In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M. (eds) Artificial Intelligence and Soft Computing. ICAISC 2013. Lecture Notes in Computer Science(), vol 7895.
Journal: Artificial Intelligence and Soft Computing 
Abstract: The advancement of chip-based technology has enabled the measurement of millions of DNA sequence variations across the human genome. Experiments revealed that high-order, but not individual, interactions of single nucleotide polymorphisms (SNPs) are responsible for complex diseases such as cancer. The challenge of genome-wide association studies (GWASs) is to sift through high-dimensional datasets to find out particular combinations of SNPs that are predictive of these diseases. Genetic Programming (GP) has been widely applied in GWASs. It serves two purposes: attribute selection and/or discriminative modeling. One advantage of discriminative modeling over attribute selection lies in interpretability. However, existing discriminative modeling algorithms do not scale up well with the increase in the SNP dimension. Here, we have developed GP-Pi. We have introduced a penalizing term in the fitness function to penalize trees with common SNPs and an initializer which utilizes expert knowledge to seed the population with good attributes. Experimental results on simulated data suggested that GP-Pi outperforms GPAS with statistically significance. GP-Pi was further evaluated on a real GWAS dataset of Rheumatoid Arthritis, obtained from the North American Rheumatoid Arthritis Consortium. Our results, with potential new discoveries, are found to be consistent with literature
Type: Conference Paper
URI: http://hdl.handle.net/20.500.11861/7510
ISBN: 978-3-642-38609-1
978-3-642-38610-7
DOI: 10.1007/978-3-642-38610-7_31
Appears in Collections:Applied Data Science - Publication

Show full item record

SCOPUSTM   
Citations

1
checked on Nov 17, 2024

Page view(s)

47
Last Week
1
Last month
checked on Nov 27, 2024

Google ScholarTM

Impact Indices

Altmetric

PlumX

Metrics


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.