Please use this identifier to cite or link to this item:
http://hdl.handle.net/20.500.11861/8274
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Li, Qizhi | en_US |
dc.contributor.author | Zheng, Xubin | en_US |
dc.contributor.author | Xie, Jize | en_US |
dc.contributor.author | Wang, Ran | en_US |
dc.contributor.author | Li, Mengyao | en_US |
dc.contributor.author | Wong, Man-Hon | en_US |
dc.contributor.author | Prof. LEUNG Kwong Sak | en_US |
dc.contributor.author | Li, Shuai | en_US |
dc.contributor.author | Geng, Qingshan | en_US |
dc.contributor.author | Cheng, Lixin | en_US |
dc.date.accessioned | 2023-10-17T01:24:22Z | - |
dc.date.available | 2023-10-17T01:24:22Z | - |
dc.date.issued | 2023 | - |
dc.identifier.citation | Bioinformatics, 2023, Vol. 39(3), article no. btad109. | en_US |
dc.identifier.issn | 1367-4811 | - |
dc.identifier.issn | 1367-4803 | - |
dc.identifier.uri | http://hdl.handle.net/20.500.11861/8274 | - |
dc.description.abstract | Motivation The confusion of acute inflammation infected by virus and bacteria or noninfectious inflammation will lead to missing the best therapy occasion resulting in poor prognoses. The diagnostic model based on host gene expression has been widely used to diagnose acute infections, but the clinical usage was hindered by the capability across different samples and cohorts due to the small sample size for signature training and discovery. Results Here, we construct a large-scale dataset integrating multiple host transcriptomic data and analyze it using a sophisticated strategy which removes batch effect and extracts the common information from different cohorts based on the relative expression alteration of gene pairs. We assemble 2680 samples across 16 cohorts and separately build gene pair signature (GPS) for bacterial, viral, and noninfected patients. The three GPSs are further assembled into an antibiotic decision model (bacterial–viral–noninfected GPS, bvnGPS) using multiclass neural networks, which is able to determine whether a patient is bacterial infected, viral infected, or noninfected. bvnGPS can distinguish bacterial infection with area under the receiver operating characteristic curve (AUC) of 0.953 (95% confidence interval, 0.948–0.958) and viral infection with AUC of 0.956 (0.951–0.961) in the test set (N = 760). In the validation set (N = 147), bvnGPS also shows strong performance by attaining an AUC of 0.988 (0.978–0.998) on bacterial-versus-other and an AUC of 0.994 (0.984–1.000) on viral-versus-other. bvnGPS has the potential to be used in clinical practice and the proposed procedure provides insight into data integration, feature selection and multiclass classification for host transcriptomics data. Availability and implementation The codes implementing bvnGPS are available at https://github.com/Ritchiegit/bvnGPS. The construction of iPAGE algorithm and the training of neural network was conducted on Python 3.7 with Scikit-learn 0.24.1 and PyTorch 1.7. The visualization of the results was implemented on R 4.2, Python 3.7, and Matplotlib 3.3.4. | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartof | Bioinformatics | en_US |
dc.title | BvnGPS: A generalizable diagnostic model for acute bacterial and viral infection using integrative host transcriptomics and pretrained neural networks | en_US |
dc.type | Peer Reviewed Journal Article | en_US |
dc.identifier.doi | 10.1093/bioinformatics/btad109 | - |
item.fulltext | No Fulltext | - |
crisitem.author.dept | Department of Applied Data Science | - |
Appears in Collections: | Applied Data Science - Publication |
SCOPUSTM
Citations
6
checked on Nov 17, 2024
Page view(s)
39
Last Week
1
1
Last month
checked on Nov 18, 2024
Google ScholarTM
Impact Indices
Altmetric
PlumX
Metrics
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.