Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.11861/7532
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, Hongjian | en_US |
dc.contributor.author | Ni, Bing | en_US |
dc.contributor.author | Wong, Man-Hon | en_US |
dc.contributor.author | Prof. LEUNG Kwong Sak | en_US |
dc.date.accessioned | 2023-03-22T07:34:41Z | - |
dc.date.available | 2023-03-22T07:34:41Z | - |
dc.date.issued | 2011 | - |
dc.identifier.citation | Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011, pp. 74 - 77, 2011 , Article number 5941082 | en_US |
dc.identifier.isbn | 978-145771211-1 | - |
dc.identifier.uri | http://hdl.handle.net/20.500.11861/7532 | - |
dc.description.abstract | The availability of huge amounts of nucleotide sequences catalyzes the development of fast algorithms for approximate DNA and RNA string matching. However, most existing online algorithms can only handle small scale problems. When querying large genomes, their performance becomes unacceptable. Offline algorithms such as Bowtie and BWA require building indexes, and their memory requirement is high. We have developed a fast CUDA implementation of agrep algorithm for approximate nucleotide sequence matching by exploiting the huge computational power of modern GPU hardware. Our CUDA program is capable of searching large genomes for patterns of length up to 64 with edit distance up to 9. For example, it is able to search the entire human genome (3.10 Gbp in 24 chromosomes) for patterns of lengths of 30 and 60 with edit distances of 3 and 6 within 371 and 1,188 milliseconds respectively on one NVIDIA GeForce GTX285 graphics card, achieving 70-fold and 36-fold speedups over multithreaded QuadCore CPU counterpart. Our program employs online approach and does not require building indexes of any kind, it thus can be applied in real time. Using two-bits-for-one-character binary representation, its memory requirement is merely one fourth of the original genome size. Therefore it is possible to load multiple genomes simultaneously. The x86 and x64 executables for Linux and Windows, C++ source code, documentations, user manual, and an AJAX MVC website for online real time searching are available at http://agrep.cse.cuhk.edu.hk. Users can also send emails to CUDAagrepGmail.com to queue up for a job. © 2011 IEEE. | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartof | Proceedings of the 2011 IEEE 9th Symposium on Application Specific Processors, SASP 2011 | en_US |
dc.title | A fast CUDA implementation of agrep algorithm for approximate nucleotide sequence matching | en_US |
dc.type | Conference Proceedings | en_US |
dc.identifier.doi | 10.1109/SASP.2011.5941082 | - |
item.fulltext | No Fulltext | - |
crisitem.author.dept | Department of Applied Data Science | - |
Appears in Collections: | Applied Data Science - Publication |
SCOPUSTM
Citations
16
checked on Dec 15, 2024
Page view(s)
38
Last Week
1
1
Last month
checked on Dec 20, 2024
Google ScholarTM
Impact Indices
Altmetric
PlumX
Metrics
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.