Search CORE

16 research outputs found

Machine Learning Approaches for the Integrative Analysis of Multi-omics Data

Author: Shang Gao (1525495)
Publication venue
Publication date: 01/12/2022
Field of study

Multi-omics data is a collective term which refers to multiple levels of “-omic” sequencing datasets. It can provide a comprehensive understanding of the regulatory mechanisms that link the genotype to phenotype. A large number of “omics” datasets have been generated from studies in various biological scenarios but there is a need for advanced analytical tools that derive meaningful biological insights from these datasets. In this thesis, three projects have been developed for integrating multiple “omics” datasets to answer distinct biological questions. In the first project, dimension reduction and clustering methods are applied on single cell RNA-sequencing data from lung endothelial cells to identify the cellular heterogeneity within the lung endothelium during lung injury and regeneration. We identified three major subpopulations in lung endothelial cells at baseline and at the time points post injury. One subpopulation is enriched for the expression of immune-related genes while another subpopulation is enriched for the expression of developmental genes. In the second project, a Bayesian Inference model, BITFAM, was developed to infer the transcription factors activities in single cells by integrating single cell RNA-seq data and bulk ChIP-seq data. We were able to validate that BITFAM could indeed infer the transcription factors activities using known biological functions as well as a publicly available dataset in which transcription factor deletion was achieved by CRISPR/Cas9 targeting of transcription factors. In the third project, we assessed the relative effect of DNA sequences and epigenetics modifications on gene expression using a deep learning framework, iSEGnet. I investigated the optimal regions that achieve the best prediction of gene expression. I also explored the most important regions and epigenetics modifications that impact gene expression as well as the regulatory mechanisms in these regions. These projects highlight the value of using machine learning and deep learning approaches to analyze multi-omic datasets and thereby identify regulatory mechanisms underlying gene expression

University of Illinois at Chicago: UIC INDIGO (INtellectual property in DIGital form available online in an Open environment)

Rhodium-Catalyzed Hydroacylation of para-Quinone Methides with Salicylaldehydes: An Approach to α,α-Diaryl-2-Hydroxy Acetophenones

Author: Aijun Lin (1393531)
Chi Yang (130445)
Hequan Yao (1393537)
Shang Gao (1525495)
Publication venue
Publication date
Field of study

A rhodium-catalyzed hydroacylation of para-quinone methides (p-QMs) with salicylaldehydes has been disclosed. This method allows for the construction of α,α-diaryl-2-hydroxy acetophenones through tandem C–H activation/C–C bond formation/aromatization process. Moreover, this unprecedented hydroacylation of trisubstituted alkenes exhibits good yields with broad functional group tolerance as well as gram-scale capacity

FigShare

Sequence Effect on the Topology of 3 + 1 Interlocked Bimolecular DNA G‑Quadruplexes

Author: Shang Gao (1525495)
Xinhua Guo (10289)
Yanwei Cao (2631157)
Yuting Yan (2631160)
Publication venue
Publication date
Field of study

Electrospray ionization mass spectrometry (ESI-MS) combined with fluorescence, circular dichroism, UV spectrophotometer, and native polyacrylamide gel electrophoresis techniques are used to study structural features of interlocked dimers formed by DNA sequence 93del (GGGGTGGGAGGAGGGT) and its derivatives. Herein, we demonstrate that the interlocked dimers can be distinguished from stacked dimers formed by sequences T30923 (GGGTGGGTGGGTGGGT) and T30177 (GTGGTGGGTGGGTGGGT). In addition, loop length, the base at 5′-end, and the isolation of T and TT to the first 4G tract do significantly influence the formation and topologies of interlocked dimers. Furthermore, our results suggest that the 4G tract and the 2G tract in various locations in the 93del derivative sequence can form interlocked structure. This work not only provides new insight into the assembly of 3 + 1 interlocked DNA conformations but also demonstrates that ESI-MS combined with other analytical methods is rapid and useful for DNA structural studies

FigShare

MalishEtAl2023_Ecosphere

Author: Daniel Allen (14008325)
Darin Kopp (14521949)
Megan Malish (12419749)
Shang Gao (1525495)
Thomas Neeson (2908412)
Yang Hong (124259)
Publication venue
Publication date: 30/01/2023
Field of study

Data: Files ending in _current of _future contain hydrological model output, used to caclulate continuity and connectivity metrics. File in map_stats folder contain continuity metric values. blue_fixed.rda and connectivity_wts.csv are used to calculate DCI. Code: ConnectivityMetrics.R contains code to calculate percent wet length, number of dry fragments, and average length of dry fragments. DCI.R contains code to calculate DCI. change_dryperiodlength.R, change_dryperiods.R, and change_zeroflowdays.R contain code to calculate change in continuity metrics.</p

FigShare

Accessing 1,3-Dienes via Palladium-Catalyzed Allylic Alkylation of Pronucleophiles with Skipped Enynes

Author: Aijun Lin (1393531)
Chi Yang (130445)
Hao Liu (12832)
Hequan Yao (1393537)
Shang Gao (1525495)
Zhiyuan Fu (155361)
Publication venue
Publication date
Field of study

An unprecedented palladium-catalyzed allylic alkylation of pronucleophiles with unactivated skipped enynes has been developed. This method provides a straightforward access to a wide array of 1,3-dienes without the need to preinstall leaving groups or employ extra oxidants. The reaction exhibited high atom economy, good functional group tolerance, excellent regioselectivities, and scalability. With D2O as cosolvent, deuterium could be incorporated in high efficiency

FigShare

Controllable Rh(III)-Catalyzed Annulation between Salicylaldehydes and Diazo Compounds: Divergent Synthesis of Chromones and Benzofurans

Author: Aijun Lin (1393531)
Chi Yang (130445)
Hequan Yao (1393537)
Peng Sun (112156)
Shang Gao (1525495)
Songjin Guo (1716931)
Publication venue
Publication date
Field of study

A Rh(III)-catalyzed annulation between salicylaldehydes and diazo compounds with controllable chemoselectivity is described. AgNTf2 favored benzofurans via a tandem C–H activation/decarbonylation/annulation process, while AcOH led to chromones through a C–H activation/annulation pathway. The reaction exhibited good functional group tolerance and scalability. Moreover, only a single regioisomer of benzofuran was obtained due to the in situ decarbonylation orientation effect

FigShare

Accessing 1,3-Dienes via Palladium-Catalyzed Allylic Alkylation of Pronucleophiles with Skipped Enynes

Author: Aijun Lin (1393531)
Chi Yang (130445)
Hao Liu (12832)
Hequan Yao (1393537)
Shang Gao (1525495)
Zhiyuan Fu (155361)
Publication venue
Publication date
Field of study

FigShare

Accessing 1,3-Dienes via Palladium-Catalyzed Allylic Alkylation of Pronucleophiles with Skipped Enynes

Author: Aijun Lin (1393531)
Chi Yang (130445)
Hao Liu (12832)
Hequan Yao (1393537)
Shang Gao (1525495)
Zhiyuan Fu (155361)
Publication venue
Publication date
Field of study

FigShare