25 research outputs found

    A Critical Review of Gene Marker Selection Methods and Cell Count Inference Tools

    Get PDF
    Seasonal influenza virus is a threat for human being. Understanding dynamical change of immune response induced by influenza infection could benefit diagnosis and drug development, using transcriptome analysis. But transcriptomic data is often complicated by the changing cell makeup of the tissue during disease. It’s difficult to distinguish between gene regulations and cell proliferation or migration. Therefore inference of the change in cell counts is necessary, and computational models for cell count inference are introduced in this thesis. Besides, in most models related to prediction of cell quantities, gene marker selection is used as the first step. Thus computational methodology concerning gene marker selection for cell count inference is also reviewed. Different gene marker selection methods are applied to a common dataset to evaluate their behaviors. The uniqueness and expression intensity are the key properties for evaluation of obtained markers. As for predicting cell enrichment, principles of three kinds of schemes are explained. Computational algorithms named CTen and CIBERSORT are introduced as examples of them. Estimation behaviors of these tools are tested by a microarray dataset. Analysis of the estimations shows that they may provide good estimation but are not suitable for careful study of complex problems, e.g. dynamical samples

    Big Data Approaches to Improving the Identification of Drug or Disease Mechanisms for Drug Innovation

    Get PDF
    Advances in science and technology have substantially changed drug research and development (R&D) processes. However, the efficiency of drug R&D, described in the number of new drugs approved per billion US dollars spent, dramatically declined between 1950 to 2010. Some of the main causes to the attrition include the cautious regulator, the potential risk in chemical screening methods for early drug discovery, and the lack of understanding of disease mechanisms. In order to improve the efficiency and productivity in drug R&D, more powerful tools are needed to assist in prediction forecasting and decision making in drug development. This dissertation describes my work in developing computational approaches to provide better understanding of drug or disease mechanisms at the systems level. The first project involves collaboration with RIKEN institute in Japan for innovation of influenza vaccine adjuvant. We performed comparative analysis of RNA-Seq data from mice treated with different adjuvants to identify mechanisms supporting adjuvant activity. In the second project, we predicted immune cell dynamics by linear regression-based algorithms or statistical tools, and suggested a new approach that can improve the discovery of key disease-associated genes. In the third project, we found that the network topological features, especially network betweenness, predominantly define the accuracy of a major drug target inference algorithm. We proposed a novel algorithm, TREAP, which integrated betweenness and differential gene expression and can accurately predict drug targets in a time-efficient manner. Through the projects above, we have demonstrated how computational algorithms can assist in mining big biological data to improve understanding of drug or disease mechanisms for drug innovation and development

    Comprehensively Surveying Structure and Function of RING Domains from Drosophila melanogaster

    Get PDF
    Using a complete set of RING domains from Drosophila melanogaster, all the solved RING domains and cocrystal structures of RING-containing ubiquitin-ligases (RING-E3) and ubiquitin-conjugating enzyme (E2) pairs, we analyzed RING domains structures from their primary to quarternary structures. The results showed that: i) putative orthologs of RING domains between Drosophila melanogaster and the human largely occur (118/139, 84.9%); ii) of the 118 orthologous pairs from Drosophila melanogaster and the human, 117 pairs (117/118, 99.2%) were found to retain entirely uniform domain architectures, only Iap2/Diap2 experienced evolutionary expansion of domain architecture; iii) 4 evolutionary structurally conserved regions (SCRs) are responsible for homologous folding of RING domains at the superfamily level; iv) besides the conserved Cys/His chelating zinc ions, 6 equivalent residues (4 hydrophobic and 2 polar residues) in the SCRs possess good-consensus and conservation- these 4 SCRs function in the structural positioning of 6 equivalent residues as determinants for RING-E3 catalysis; v) members of these RING proteins located nucleus, multiple subcellular compartments, membrane protein and mitochondrion are respectively 42 (42/139, 30.2%), 71 (71/139, 51.1%), 22 (22/139, 15.8%) and 4 (4/139, 2.9%); vi) CG15104 (Topors) and CG1134 (Mul1) in C3HC4, and CG3929 (Deltex) in C3H2C3 seem to display broader E2s binding profiles than other RING-E3s; vii) analyzing intermolecular interfaces of E2/RING-E3 complexes indicate that residues directly interacting with E2s are all from the SCRs in RING domains. Of the 6 residues, 2 hydrophobic ones contribute to constructing the conserved hydrophobic core, while the 2 hydrophobic and 2 polar residues directly participate in E2/RING-E3 interactions. Based on sequence and structural data, SCRs, conserved equivalent residues and features of intermolecular interfaces were extracted, highlighting the presence of a nucleus for RING domain fold and formation of catalytic core in which related residues and regions exhibit preferential evolutionary conservation

    COVID19 Disease Map, a computational knowledge repository of virus-host interaction mechanisms.

    Get PDF
    Funder: Bundesministerium für Bildung und ForschungFunder: Bundesministerium für Bildung und Forschung (BMBF)We need to effectively combine the knowledge from surging literature with complex datasets to propose mechanistic models of SARS-CoV-2 infection, improving data interpretation and predicting key targets of intervention. Here, we describe a large-scale community effort to build an open access, interoperable and computable repository of COVID-19 molecular mechanisms. The COVID-19 Disease Map (C19DMap) is a graphical, interactive representation of disease-relevant molecular mechanisms linking many knowledge sources. Notably, it is a computational resource for graph-based analyses and disease modelling. To this end, we established a framework of tools, platforms and guidelines necessary for a multifaceted community of biocurators, domain experts, bioinformaticians and computational biologists. The diagrams of the C19DMap, curated from the literature, are integrated with relevant interaction and text mining databases. We demonstrate the application of network analysis and modelling approaches by concrete examples to highlight new testable hypotheses. This framework helps to find signatures of SARS-CoV-2 predisposition, treatment response or prioritisation of drug candidates. Such an approach may help deal with new waves of COVID-19 or similar pandemics in the long-term perspective

    Taking the pulse of Earth's tropical forests using networks of highly distributed plots

    Get PDF
    Tropical forests are the most diverse and productive ecosystems on Earth. While better understanding of these forests is critical for our collective future, until quite recently efforts to measure and monitor them have been largely disconnected. Networking is essential to discover the answers to questions that transcend borders and the horizons of funding agencies. Here we show how a global community is responding to the challenges of tropical ecosystem research with diverse teams measuring forests tree-by-tree in thousands of long-term plots. We review the major scientific discoveries of this work and show how this process is changing tropical forest science. Our core approach involves linking long-term grassroots initiatives with standardized protocols and data management to generate robust scaled-up results. By connecting tropical researchers and elevating their status, our Social Research Network model recognises the key role of the data originator in scientific discovery. Conceived in 1999 with RAINFOR (South America), our permanent plot networks have been adapted to Africa (AfriTRON) and Southeast Asia (T-FORCES) and widely emulated worldwide. Now these multiple initiatives are integrated via ForestPlots.net cyber-infrastructure, linking colleagues from 54 countries across 24 plot networks. Collectively these are transforming understanding of tropical forests and their biospheric role. Together we have discovered how, where and why forest carbon and biodiversity are responding to climate change, and how they feedback on it. This long-term pan-tropical collaboration has revealed a large long-term carbon sink and its trends, as well as making clear which drivers are most important, which forest processes are affected, where they are changing, what the lags are, and the likely future responses of tropical forests as the climate continues to change. By leveraging a remarkably old technology, plot networks are sparking a very modern revolution in tropical forest science. In the future, humanity can benefit greatly by nurturing the grassroots communities now collectively capable of generating unique, long-term understanding of Earth's most precious forests. Resumen Los bosques tropicales son los ecosistemas más diversos y productivos del mundo y entender su funcionamiento es crítico para nuestro futuro colectivo. Sin embargo, hasta hace muy poco, los esfuerzos para medirlos y monitorearlos han estado muy desconectados. El trabajo en redes es esencial para descubrir las respuestas a preguntas que trascienden las fronteras y los plazos de las agencias de financiamiento. Aquí mostramos cómo una comunidad global está respondiendo a los desafíos de la investigación en ecosistemas tropicales a través de diversos equipos realizando mediciones árbol por árbol en miles de parcelas permanentes de largo plazo. Revisamos los descubrimientos más importantes de este trabajo y discutimos cómo este proceso está cambiando la ciencia relacionada a los bosques tropicales. El enfoque central de nuestro esfuerzo implica la conexión de iniciativas locales de largo plazo con protocolos estandarizados y manejo de datos para producir resultados que se puedan trasladar a múltiples escalas. Conectando investigadores tropicales, elevando su posición y estatus, nuestro modelo de Red Social de Investigación reconoce el rol fundamental que tienen, para el descubrimiento científico, quienes generan o producen los datos. Concebida en 1999 con RAINFOR (Suramérica), nuestras redes de parcelas permanentes han sido adaptadas en África (AfriTRON) y el sureste asiático (T-FORCES) y ampliamente replicadas en el mundo. Actualmente todas estas iniciativas están integradas a través de la ciber-infraestructura de ForestPlots.net, conectando colegas de 54 países en 24 redes diferentes de parcelas. Colectivamente, estas redes están transformando nuestro conocimiento sobre los bosques tropicales y el rol de éstos en la biósfera. Juntos hemos descubierto cómo, dónde y porqué el carbono y la biodiversidad de los bosques tropicales está respondiendo al cambio climático y cómo se retroalimentan. Esta colaboración pan-tropical de largo plazo ha expuesto un gran sumidero de carbono y sus tendencias, mostrando claramente cuáles son los factores más importantes, qué procesos se ven afectados, dónde ocurren los cambios, los tiempos de reacción y las probables respuestas futuras mientras el clima continúa cambiando. Apalancando lo que realmente es una tecnología antigua, las redes de parcelas están generando una verdadera y moderna revolución en la ciencia tropical. En el futuro, la humanidad puede beneficiarse enormemente si se nutren y cultivan comunidades de investigadores de base, actualmente con la capacidad de generar información única y de largo plazo para entender los que probablemente son los bosques más preciados de la tierra. Resumo Florestas tropicais são os ecossistemas mais diversos e produtivos da Terra. Embora uma boa compreensão destas florestas seja crucial para o nosso futuro coletivo, até muito recentemente os esforços de medições e monitoramento foram amplamente desconexos. É essencial formarmos redes para obtermos respostas que transcendem fronteiras e horizontes de agências financiadoras. Neste estudo nós mostramos como uma comunidade global está respondendo aos desafios da pesquisa de ecossistemas tropicais, com equipes diversas medindo florestas, árvore por árvore, em milhares de parcelas monitoradas à longo prazo. Nós revisamos as maiores descobertas científicas deste trabalho, e mostramos também como este processo está mudando a ciência de florestas tropicais. Nossa abordagem principal envolve unir iniciativas de base a protocolos padronizados e gerenciamento de dados a fim de gerar resultados robustos em escalas ampliadas. Ao conectar pesquisadores tropicais e elevar seus status, nosso modelo de Rede de Pesquisa Social reconhece o papel-chave do produtor dos dados na descoberta científica. Concebida em 1999 com o RAINFOR (América do Sul), nossa rede de parcelas permanentes foi adaptada para África (AfriTRON) e Sudeste asiático (T-FORCES), e tem sido extensamente reproduzida em todo o mundo. Agora estas múltiplas iniciativas estão integradas através de uma infraestrutura cibernética do ForestPlots.net, conectando colegas de 54 países de 24 redes de parcelas. Estas iniciativas estão transformando coletivamente o entendimento das florestas tropicais e seus papéis na biosfera. Juntos nós descobrimos como, onde e por que o carbono e a biodiversidade da floresta estão respondendo às mudanças climáticas, e seus efeitos de retroalimentação. Esta duradoura colaboração pantropical revelou um grande sumidouro de carbono persistente e suas tendências, assim como tem evidenciado quais direcionadores são mais importantes, quais processos florestais são mais afetados, onde eles estão mudando, seus atrasos no tempo de resposta, e as prováveis respostas das florestas tropicais conforme o clima continua a mudar. Dessa forma, aproveitando uma notável tecnologia antiga, redes de parcelas acendem faíscas de uma moderna revolução na ciência das florestas tropicais. No futuro a humanidade pode se beneficiar incentivando estas comunidades basais que agora são coletivamente capazes de gerar conhecimentos únicos e duradouros sobre as florestas mais preciosas da Terra. Résume Les forêts tropicales sont les écosystèmes les plus diversifiés et les plus productifs de la planète. Si une meilleure compréhension de ces forêts est essentielle pour notre avenir collectif, jusqu'à tout récemment, les efforts déployés pour les mesurer et les surveiller ont été largement déconnectés. La mise en réseau est essentielle pour découvrir les réponses à des questions qui dépassent les frontières et les horizons des organismes de financement. Nous montrons ici comment une communauté mondiale relève les défis de la recherche sur les écosystèmes tropicaux avec diverses équipes qui mesurent les forêts arbre après arbre dans de milliers de parcelles permanentes. Nous passons en revue les principales découvertes scientifiques de ces travaux et montrons comment ce processus modifie la science des forêts tropicales. Notre approche principale consiste à relier les initiatives de base à long terme à des protocoles standardisés et une gestion de données afin de générer des résultats solides à grande échelle. En reliant les chercheurs tropicaux et en élevant leur statut, notre modèle de réseau de recherche sociale reconnaît le rôle clé de l'auteur des données dans la découverte scientifique. Conçus en 1999 avec RAINFOR (Amérique du Sud), nos réseaux de parcelles permanentes ont été adaptés à l'Afrique (AfriTRON) et à l'Asie du Sud-Est (T-FORCES) et largement imités dans le monde entier. Ces multiples initiatives sont désormais intégrées via l'infrastructure ForestPlots.net, qui relie des collègues de 54 pays à travers 24 réseaux de parcelles. Ensemble, elles transforment la compréhension des forêts tropicales et de leur rôle biosphérique. Ensemble, nous avons découvert comment, où et pourquoi le carbone forestier et la biodiversité réagissent au changement climatique, et comment ils y réagissent. Cette collaboration pan-tropicale à long terme a révélé un important puits de carbone à long terme et ses tendances, tout en mettant en évidence les facteurs les plus importants, les processus forestiers qui sont affectés, les endroits où ils changent, les décalages et les réactions futures probables des forêts tropicales à mesure que le climat continue de changer. En tirant parti d'une technologie remarquablement ancienne, les réseaux de parcelles déclenchent une révolution très moderne dans la science des forêts tropicales. À l'avenir, l'humanité pourra grandement bénéficier du soutien des communautés de base qui sont maintenant collectivement capables de générer une compréhension unique et à long terme des forêts les plus précieuses de la Terre. Abstrak Hutan tropika adalah di antara ekosistem yang paling produktif dan mempunyai kepelbagaian biodiversiti yang tinggi di seluruh dunia. Walaupun pemahaman mengenai hutan tropika amat penting untuk masa depan kita, usaha-usaha untuk mengkaji dan mengawas hutah-hutan tersebut baru sekarang menjadi lebih diperhubungkan. Perangkaian adalah sangat penting untuk mencari jawapan kepada soalan-soalan yang menjangkaui sempadan dan batasan agensi pendanaan. Di sini kami menunjukkan bagaimana sebuah komuniti global bertindak balas terhadap cabaran penyelidikan ekosistem tropika melalui penglibatan pelbagai kumpulan yang mengukur hutan secara pokok demi pokok dalam beribu-ribu plot jangka panjang. Kami meninjau semula penemuan saintifik utama daripada kerja ini dan menunjukkan bagaimana proses ini sedang mengubah bidang sains hutan tropika. Teras pendekatan kami memberi tumpuan terhadap penghubungan inisiatif akar umbi jangka panjang dengan protokol standar serta pengurusan data untuk mendapatkan hasil skala besar yang kukuh. Dengan menghubungkan penyelidik-penyelidik tropika dan meningkatkan status mereka, model Rangkaian Penyelidikan Sosial kami mengiktiraf kepentingan peranan pengasas data dalam penemuan saintifik. Bermula dengan pengasasan RAINFOR (Amerika Selatan) pada tahun 1999, rangkaian-rangkaian plot kekal kami kemudian disesuaikan untuk Afrika (AfriTRON) dan Asia Tenggara (T-FORCES) dan selanjutnya telah banyak dicontohi di seluruh dunia. Kini, inisiatif-inisiatif tersebut disepadukan melalui infrastruktur siber ForestPlots.net yang menghubungkan rakan sekerja dari 54 negara di 24 buah rangkaian plot. Secara kolektif, rangkaian ini sedang mengubah pemahaman tentang hutan tropika dan peranannya dalam biosfera. Kami telah bekerjasama untuk menemukan bagaimana, di mana dan mengapa karbon serta biodiversiti hutan bertindak balas terhadap perubahan iklim dan juga bagaimana mereka saling bermaklum balas. Kolaborasi pan-tropika jangka panjang ini telah mendedahkan sebuah sinki karbon jangka panjang serta arah alirannya dan juga menjelaskan pemandu-pemandu perubahan yang terpenting, di mana dan bagaimana proses hutan terjejas, masa susul yang ada dan kemungkinan tindakbalas hutan tropika pada perubahan iklim secara berterusan di masa depan. Dengan memanfaatkan pendekatan lama, rangkaian plot sedang menyalakan revolusi yang amat moden dalam sains hutan tropika. Pada masa akan datang, manusia sejagat akan banyak mendapat manfaat jika memupuk komuniti-komuniti akar umbi yang kini berkemampuan secara kolektif menghasilkan pemahaman unik dan jangka panjang mengenai hutan-hutan yang paling berharga di dunia

    Predicting Host Immune Cell Dynamics and Key Disease-Associated Genes Using Tissue Transcriptional Profiles

    No full text
    Motivation: Immune cell dynamics is a critical factor of disease-associated pathology (immunopathology) that also impacts the levels of mRNAs in diseased tissue. Deconvolution algorithms attempt to infer cell quantities in a tissue/organ sample based on gene expression profiles and are often evaluated using artificial, non-complex samples. Their accuracy on estimating cell counts given temporal tissue gene expression data remains not well characterized and has never been characterized when using diseased lung. Further, how to remove the effects of cell migration on transcript counts to improve discovery of disease factors is an open question. Results: Four cell count inference (i.e., deconvolution) tools are evaluated using microarray data from influenza-infected lung sampled at several time points post-infection. The analysis finds that inferred cell quantities are accurate only for select cell types and there is a tendency for algorithms to have a good relative fit (R 2 ) but a poor absolute fit (normalized mean squared error; NMSE), which suggests systemic biases exist. Nonetheless, using cell fraction estimates to adjust gene expression data, we show that genes associated with influenza virus replication and increased infection pathology are more likely to be identified as significant than when applying traditional statistical tests

    Biocontrol of Penicillium digitatum on Postharvest Citrus Fruits by Pseudomonas fluorescens

    No full text
    The effectiveness of the bacteria antagonist Pseudomonas fluorescens to control green mold caused by Penicillium digitatum on oranges (Citrus sinensis Osbeck, cv. Jincheng) and the possible modes of action were evaluated. Whether in vitro or in vivo, treatments with cell-free autoclaved cultures or culture filtrate had limited capacity to suppress P. digitatum, while P. digitatum was significantly inhibited by bacterial fluid (P. fluorescens in the nutrient broth liquid medium) and bacterial suspension (P. fluorescens in sterile distilled water) with living cells. There was a positive relationship between the concentration of P. fluorescens in bacterial suspension and its biological efficacy. In addition, P. fluorescens was effective when applied preventatively but not when applied curatively. In the inoculated wounds, the population of P. fluorescens was an approximately 28- and 34-fold increase after being incubated at 20°C for 8 d and at 4°C for 16 d, respectively, and P. digitatum could effectively stimulate the growth and reproduction of P. fluorescens. Moreover, P. fluorescens was able to inhibit spore germination and germ tube elongation of P. digitatum as well as induce resistance on citrus peel by increasing the chitinase (CHI) activity and advancing the activities peaks of β-1,3-glucanase (GLU), peroxidase (POD), and phenylalanine ammonia lyase (PAL). All of these results support the potential application of P. fluorescens against green mold on postharvest citrus

    The cross-sectional study of hepatic lipase SNPs and plasma lipid levels

    No full text
    By the combination of meta-analysis, the data of the 1,000 Genomes Project Phase 3, and the promoter sequence of hepatic lipase (LIPC), we performed the cross-sectional study to explore the associations of four variants (rs1077835; rs1077834; rs1800588 [C-514T], and rs2070895 [G-250A]) in LIPC promoter with plasma lipid levels. Our results indicate that the first and the next three of the four SNPs are, respectively, reported to be associated with the decreased and increased HDL-c level. Meta-analysis of 87 studies with 101,988 participants indicates that HDL-c level in rs1800588 (C-514T) (pooled mean difference = 0.03, 95%CI (0.03, 0.04), p < .001) and rs2070895 (G-250A) (pooled mean difference = 0.07, 95%CI (0.05, 0.09), p < .001) is higher in allele T or A carriers. Similarly, LDL-c, TC, TG, and BMI levels are generally increased in T or A alleles carriers. We failed to conduct the meta-analysis of rs1077835 and rs1077834 due to the limited previous reports. Data from the 1,000 Genomes indicate that the allele frequencies of the four SNPs in total or subpopulations are almost equal to each other. The paired value r2 and D' of the four SNPs are larger than 0.8, which indicate the linkage disequilibrium of the four variants. The analysis of LIPC promoter indicate that C-514T and G-250A are, respectively, located in transcriptional factor binding sites of USF1and Pbx1b, which may partly explain the effect of the two SNPs on the decreased LIPC activity in the alleles carriers and the corresponding increased plasma lipids hydrolyzed by LIPC. These results may help us to better understand the different effects of the four SNPs on the plasma lipid levels among subpopulations and offer clues for future clinical treatment of dyslipidemia-related diseases

    Single-cell analysis reveals individual spore responses to simulated space vacuum

    No full text
    Outer space is a challenging environment for all forms of life, and dormant spores of bacteria have been frequently used to study the survival of terrestrial life in a space journey. Previous work showed that outer space vacuum alone can kill bacterial spores. However, the responses and mechanisms of resistance of individual spores to space vacuum are unclear. Here, we examined spores" molecular changes under simulated space vacuum (~10−5 Pa) using micro-Raman spectroscopy and found that this vacuum did not cause significant denaturation of spore protein. Then, live-cell microscopy was developed to investigate the temporal events during germination, outgrowth, and growth of individual Bacillus spores. The results showed that after exposure to simulated space vacuum for 10 days, viability of spores of two Bacillus species was reduced up to 35%, but all spores retained their large Ca2 +-dipicolinic acid depot. Some of the killed spores did not germinate, and the remaining germinated but did not proceed to vegetative growth. The vacuum treatment slowed spore germination, and changed average times of all major germination events. In addition, viable vacuum-treated spores exhibited much greater sensitivity than untreated spores to dry heat and hyperosmotic stress. Among spores" resistance mechanisms to high vacuum, DNA-protective a/β−type small acid-soluble proteins, and non- homologous end joining and base excision repair of DNA played the most important roles, especially against multiple cycles of vacuum treatment. Overall, these results give new insight into individual spore"s responses to space vacuum and provide new techniques for microorganism analysis at the single-cell level

    Enhanced Electrochemical Performance of LiNi<sub>0.5</sub>Mn<sub>1.5</sub>O<sub>4</sub> Cathode Material by YPO<sub>4</sub> Surface Modification

    No full text
    Cathode material LiNi<sub>0.5</sub>Mn<sub>1.5</sub>O<sub>4</sub> (LNMO) for lithium-ion batteries is successfully synthesized by a sol–gel method and is further modified by a thin layer of YPO<sub>4</sub> (1, 3, and 5 wt %) through a simple wet chemical strategy. Physical characterizations indicate that the YPO<sub>4</sub> nanolayer has a little impact on the cathode structure. Electrochemical optimization reveals that the 3 wt % YPO<sub>4</sub>-coated LNMO could still deliver a high specific capacity of 107 mAh g<sup>–1</sup> after 240 cycles, with a capacity retention of 77.5%, much higher than that of the pristine electrode. Electrochemical impedance spectroscopy (EIS) analysis proves that the rapid increase of surface impedance could be suppressed by the YPO<sub>4</sub> coating layer and thus facilitates the surface kinetics behavior in repeated cycling. Through further material aging experiments, the improvement of electrochemical performances could be attributed to the formation of Lewis acid YF<sub>3</sub>, converted from the YPO<sub>4</sub> coating layer in the LiPF<sub>6</sub>-based electrolyte, which not only scavenges the surface insulating alkaline species with a high acidity but also accelerates ion exchange on the material surface and thus helps to generate the solid solution Li–Ni–Mn–Y–O on the surface of YPO<sub>4</sub>-coated LNMO
    corecore