Please use this identifier to cite or link to this item: http://hdl.handle.net/20.500.11861/7532
Title: A fast CUDA implementation of agrep algorithm for approximate nucleotide sequence matching
Authors: Li, Hongjian 
Ni, Bing 
Wong, Man-Hon 
Prof. LEUNG Kwong Sak 
Issue Date: 2011
Source: Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011, pp. 74 - 77, 2011 , Article number 5941082
Journal: Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011 
Abstract: The availability of huge amounts of nucleotide sequences catalyzes the development of fast algorithms for approximate DNA and RNA string matching. However, most existing online algorithms can only handle small scale problems. When querying large genomes, their performance becomes unacceptable. Offline algorithms such as Bowtie and BWA require building indexes, and their memory requirement is high. We have developed a fast CUDA implementation of agrep algorithm for approximate nucleotide sequence matching by exploiting the huge computational power of modern GPU hardware. Our CUDA program is capable of searching large genomes for patterns of length up to 64 with edit distance up to 9. For example, it is able to search the entire human genome (3.10 Gbp in 24 chromosomes) for patterns of lengths of 30 and 60 with edit distances of 3 and 6 within 371 and 1,188 milliseconds respectively on one NVIDIA GeForce GTX285 graphics card, achieving 70-fold and 36-fold speedups over multithreaded QuadCore CPU counterpart. Our program employs online approach and does not require building indexes of any kind, it thus can be applied in real time. Using two-bits-for-one-character binary representation, its memory requirement is merely one fourth of the original genome size. Therefore it is possible to load multiple genomes simultaneously. The x86 and x64 executables for Linux and Windows, C++ source code, documentations, user manual, and an AJAX MVC website for online real time searching are available at http://agrep.cse.cuhk.edu.hk. Users can also send emails to CUDAagrepGmail.com to queue up for a job. © 2011 IEEE.
Type: Conference Proceedings
URI: http://hdl.handle.net/20.500.11861/7532
ISBN: 978-145771211-1
DOI: 10.1109/SASP.2011.5941082
Appears in Collections:Applied Data Science - Publication

Show full item record

SCOPUSTM   
Citations

16
checked on Jan 12, 2025

Page view(s)

41
Last Week
1
Last month
checked on Jan 19, 2025

Google ScholarTM

Impact Indices

Altmetric

PlumX

Metrics


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.