Utilizing a convolutional neural network to predict HIV-1 Tat biological functions and the impact of Tat genetic variation on neurocognitive impairment
Human immunodeficiency virus type 1 (HIV-1) infections have been associated with neurocognitive impairment. However, this cognitive decline occurs at a rate that is specific to individual patients. Evidence suggests that the protein sequence of HIV-1 Tat may have an impact on this rate. The mutations within Tat can affect the biological function of this viral protein. This study proposes to utilize deep learning techniques to perform inference on amino acid changes and their effects on Tat functions. To predict biological functions consistent with annotations of the Gene Ontology (GO) Consortium, a convolutional neural network (CNN) was developed and trained on more than 500,000 GO-annotated sequences from the Uniprot Knowledgebase. Analysis was performed on various lengths of Tat protein sequences, considering the prevalence of naturally truncated Tat protein sequences in HIV-1-infected patients from the Drexel Medicine CNS AIDS Research and Eradication Study (CARES) cohort. Results of processing truncated Tat subtype B sequences with lengths between 20 and 60 amino acids revealed a decreased sequence similarity with proteins found in the nucleus and an increased sequence similarity with proteins associated with pathogenesis, extracellular regions, and cellular toxicity. With the focus of examining which regions of Tat are responsible for its known biological functions, a windowing strategy was implemented. Using 30-mer windows, there was an increase in sequence v similarity to proteins associated with ribosomal function and translation for the Tat polypeptide region between residues 40 and 80. The CNN achieved an area under the receiver operating curve (AUC) of 0.985 for GO predictions and 0.541 for neurocognitive impairment. The evaluation metric of AUC has been used in several peer-reviewed papers, but it is not an ideal evaluation method. Although some neurocognitive impairment (NCI) prediction AUC scores are fairly high, there is little to no correlation between the predicted and calculated, actual score using Pearson correlation.M.S., Biomedical Science -- Drexel University, 201
Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.