Search CORE

66 research outputs found

Integration and visualization of systems biology data in context of the genome

Author: Baliga Nitin S
Bare J Christopher
Koide Tie
Reiss David J
Tenenbaum Dan
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background High-density tiling arrays and new sequencing technologies are generating rapidly increasing volumes of transcriptome and protein-DNA interaction data. Visualization and exploration of this data is critical to understanding the regulatory logic encoded in the genome by which the cell dynamically affects its physiology and interacts with its environment. Results The Gaggle Genome Browser is a cross-platform desktop program for interactively visualizing high-throughput data in the context of the genome. Important features include dynamic panning and zooming, keyword search and open interoperability through the Gaggle framework. Users may bookmark locations on the genome with descriptive annotations and share these bookmarks with other users. The program handles large sets of user-generated data using an in-process database and leverages the facilities of SQL and the R environment for importing and manipulating data. A key aspect of the Gaggle Genome Browser is interoperability. By connecting to the Gaggle framework, the genome browser joins a suite of interconnected bioinformatics tools for analysis and visualization with connectivity to major public repositories of sequences, interactions and pathways. To this flexible environment for exploring and combining data, the Gaggle Genome Browser adds the ability to visualize diverse types of data in relation to its coordinates on the genome. Conclusions Genomic coordinates function as a common key by which disparate biological data types can be related to one another. In the Gaggle Genome Browser, heterogeneous data are joined by their location on the genome to create information-rich visualizations yielding insight into genome organization, transcription and its regulation and, ultimately, a better understanding of the mechanisms that enable the cell to dynamically respond to its environment.</p

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Repositório da Produção USP (Univ. de São Paulo)

The Firegoose: two-way integration of diverse data from different bioinformatics web resources with desktop applications

Author: Baliga Nitin S
Bare J Christopher
Schmid Amy K
Shannon Paul T
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Information resources on the World Wide Web play an indispensable role in modern biology. But integrating data from multiple sources is often encumbered by the need to reformat data files, convert between naming systems, or perform ongoing maintenance of local copies of public databases. Opportunities for new ways of combining and re-using data are arising as a result of the increasing use of web protocols to transmit structured data. Results The Firegoose, an extension to the Mozilla Firefox web browser, enables data transfer between web sites and desktop tools. As a component of the Gaggle integration framework, Firegoose can also exchange data with Cytoscape, the R statistical package, Multiexperiment Viewer (MeV), and several other popular desktop software tools. Firegoose adds the capability to easily use local data to query KEGG, EMBL STRING, DAVID, and other widely-used bioinformatics web sites. Query results from these web sites can be transferred to desktop tools for further analysis with a few clicks. Firegoose acquires data from the web by screen scraping, microformats, embedded XML, or web services. We define a microformat, which allows structured information compatible with the Gaggle to be embedded in HTML documents. We demonstrate the capabilities of this software by performing an analysis of the genes activated in the microbe <it>Halobacterium salinarum NRC-1 </it>in response to anaerobic environments. Starting with microarray data, we explore functions of differentially expressed genes by combining data from several public web resources and construct an integrated view of the cellular processes involved. Conclusion The Firegoose incorporates Mozilla Firefox into the Gaggle environment and enables interactive sharing of data between diverse web resources and desktop software tools without maintaining local copies. Additional web sites can be incorporated easily into the framework using the scripting platform of the Firefox browser. Performing data integration in the browser allows the excellent search and navigation capabilities of the browser to be used in combination with powerful desktop tools.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Accurate Crystal Structure Prediction of New 2D Hybrid Organic Inorganic Perovskites

Author: Baldwin William J.
Bare Zachary J. L.
Csányi Gábor
Karimitari Nima
Kennedy W. Joshua
Muller Evan W.
Sutton Christopher
Publication venue
Publication date: 11/03/2024
Field of study

Low dimensional hybrid organic-inorganic perovskites (HOIPs) represent a promising class of electronically active materials for both light absorption and emission. The design space of HOIPs is extremely large, since a diverse space of organic cations can be combined with different inorganic frameworks. This immense design space allows for tunable electronic and mechanical properties, but also necessitates the development of new tools for in silico high throughput analysis of candidate structures. In this work, we present an accurate, efficient, transferable and widely applicable machine learning interatomic potential (MLIP) for predicting the structure of new 2D HOIPs. Using the MACE architecture, an MLIP is trained on 86 diverse experimentally reported HOIP structures. The model is tested on 73 unseen perovskite compositions, and achieves chemical accuracy with respect to the reference electronic structure method. Our model is then combined with a simple random structure search algorithm to predict the structure of hypothetical HOIPs given only the proposed composition. Success is demonstrated by correctly and reliably recovering the crystal structure of a set of experimentally known 2D perovskites. Such a random structure search is impossible with ab initio methods due to the associated computational cost, but is relatively inexpensive with the MACE potential. Finally, the procedure is used to predict the structure formed by a new organic cation with no previously known corresponding perovskite. Laboratory synthesis of the new hybrid perovskite confirms the accuracy of our prediction. This capability, applied at scale, enables efficient screening of thousands of combinations of organic cations and inorganic layers.Comment: 14 pages and 9 figures in the main text. Supplementary included in pd

arXiv.org e-Print Archive

Leveraging Domain Adaptation for Accurate Machine Learning Predictions of New Halide Perovskites

Author: Adhikari Santosh
Bare Zachary J. L.
DeCost Brian
Gupta Dipannoy Das
Musgrave Charles
Sutton Christopher
Yew Suxuen
Zhang Qi
Publication venue
Publication date: 19/01/2024
Field of study

We combine graph neural networks (GNN) with an inexpensive and reliable structure generation approach based on the bond-valence method (BVM) to train accurate machine learning models for screening 222,960 halide perovskites using statistical estimates of the DFT/PBE formation energy (Ef), and the PBE and HSE band gaps (Eg). The GNNs were fined tuned using domain adaptation (DA) from a source model, which yields a factor of 1.8 times improvement in Ef and 1.2 - 1.35 times improvement in HSE Eg compared to direct training (i.e., without DA). Using these two ML models, 48 compounds were identified out of 222,960 candidates as both stable and that have an HSE Eg that is relevant for photovoltaic applications. For this subset, only 8 have been reported to date, indicating that 40 compounds remain unexplored to the best of our knowledge and therefore offer opportunities for potential experimental examination

arXiv.org e-Print Archive

Niche adaptation by expansion and reprogramming of general transcription factors

Author: Christopher L Plaisier
DasSarma S
David J Reiss
Goodwin Gibbins
J Christopher Bare
Jukes T
Kjelleberg S
Min Pan
Nitin S Baliga
Ohno S
Pekkonen M
Rodriguez‐Valera F
Serdar Turkarslan
Wan Lin Su
Publication venue: Nature Publishing Group
Publication date
Field of study

Experimental analysis of TFB family proteins in a halophilic archaeon reveals complex environment-dependent fitness contributions. Gene conversion events among these proteins can generate novel niche adaptation capabilities, a process that may have contributed to archaeal adaptation to extreme environments

Crossref

PubMed Central

Genetic variants in the KIF6 region and coronary event reduction from statin therapy

Author: AF Stewart
Carmen H. Tong
Charles M. Rowland
Christopher J. Packard
CJ Packard
CP Cannon
D Shiffman
D Shiffman
D Shiffman
DJ Schaid
FM Sacks
G Schmitz
GR Abecasis
Ian Ford
J Shepherd
James J. Devlin
James Shepherd
LA Bare
Lance A. Bare
LM Mangravite
MA Iannone
Marc S. Sabatine
Michele Robertson
OA Iakoubova
OA Iakoubova
OA Iakoubova
Olga A. Iakoubova
PI Bakker de
S Germer
SL Lake
Todd G. Kirchgessner
Wellcome Trust Case Control Consortium
Yonghong Li
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

A single nucleotide polymorphism (SNP) in KIF6, a member of the KIF9 family of kinesins, is associated with differential coronary event reduction from statin therapy in four randomized controlled trials; this SNP (rs20455) is also associated with the risk for coronary heart disease (CHD) in multiple prospective studies. We investigated whether other common SNPs in the KIF6 region were associated with event reduction from statin therapy. Of the 170 SNPs in the KIF6 region investigated in the Cholesterol and Recurrent Events trial (CARE), 28 were associated with differential event reduction from statin therapy (Pinteraction < 0.1 in Caucasians, adjusted for age and sex) and were further investigated in the Pravastatin or Atorvastatin Evaluation and Infection Therapy-Thrombolysis In Myocardial Infarction 22 (PROVE IT-TIMI22) and West of Scotland Coronary Prevention Study (WOSCOPS). These analyses revealed that two SNPs (rs9462535 and rs9471077), in addition to rs20455, were associated with event reduction from statin therapy (Pinteraction < 0.1 in each of the three studies). The relative risk reduction ranged from 37 to 50% (P < 0.01) in carriers of the minor alleles of these SNPs and from −4 to 13% (P > 0.4) in non-carriers. These three SNPs are in high linkage disequilibrium with one another (r2 > 0.84). Functional studies of these variants may help to understand the role of KIF6 in the pathogenesis of CHD and differential response to statin therapy

Crossref

Springer - Publisher Connector

PubMed Central

Enlighten

Prevalence of transcription promoters within archaeal operons and coding sequences

Author: Abhishek Pratap
Adhya S
Amelia Peterson
Amy K Schmid
Baliga NS
Breiman L
Bruz Marzolf
Dan Martin
David J Reiss
Eric W Deutsch
Fang‐Yin Lo
J Christopher Bare
Marc T Facciotti
Min Pan
Nitin S Baliga
Phu T Van
Reeve JN
Tie Koide
Wyming Lee Pang
Publication venue: Nature Publishing Group
Publication date: 01/01/2009
Field of study

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of ∼64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein–DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3′ ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes—events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements

Crossref

PubMed Central

eScholarship - University of California

Motion for a Resolution tabled by Mr Richard Balfe for entry in the register pursuant to Rule 49 of the Rules of Procedure on supply of military equipment to states where basic human rights are not respected. Working Documents 1982-83, Document 1-265/82, 19 May 1982

Author: Adam A. Margolin
Adam D. Ewing
Cristian Caloian
Dorota H. Sendorek
J. Christopher Bare
Joshua M. Stuart
Kathleen E. Houlahan
Kyle Ellrott
Paul C. Boutros
Takafumi N. Yamaguchi
Thea C. Norman
Publication venue
Publication date: 01/01/1982
Field of study

Abstract Background The clinical sequencing of cancer genomes to personalize therapy is becoming routine across the world. However, concerns over patient re-identification from these data lead to questions about how tightly access should be controlled. It is not thought to be possible to re-identify patients from somatic variant data. However, somatic variant detection pipelines can mistakenly identify germline variants as somatic ones, a process called “germline leakage”. The rate of germline leakage across different somatic variant detection pipelines is not well-understood, and it is uncertain whether or not somatic variant calls should be considered re-identifiable. To fill this gap, we quantified germline leakage across 259 sets of whole-genome somatic single nucleotide variant (SNVs) predictions made by 21 teams as part of the ICGC-TCGA DREAM Somatic Mutation Calling Challenge. Results The median somatic SNV prediction set contained 4325 somatic SNVs and leaked one germline polymorphism. The level of germline leakage was inversely correlated with somatic SNV prediction accuracy and positively correlated with the amount of infiltrating normal cells. The specific germline variants leaked differed by tumour and algorithm. To aid in quantitation and correction of leakage, we created a tool, called GermlineFilter, for use in public-facing somatic SNV databases. Conclusions The potential for patient re-identification from leaked germline variants in somatic SNV predictions has led to divergent open data access policies, based on different assessments of the risks. Indeed, a single, well-publicized re-identification event could reshape public perceptions of the values of genomic data sharing. We find that modern somatic SNV prediction pipelines have low germline-leakage rates, which can be further reduced, especially for cloud-sharing, using pre-filtering software

University of Toronto Research Repository

Crossref

Archive of European Integration

Directory of Open Access Journals

eScholarship - University of California

UQ eSpace (University of Queensland)

The Francis Crick Institute

Astrometry and geodesy with radio interferometry: experiments, models, results

Author: Aoki S.
Argus D. F.
Argus D. F.
Argus D. F.
Baader H.-R.
Bare C.
Bartel N.
Bassiri S.
Batty M. J.
Bolton J. G.
Bowring B. R.
Brosche P.
Brosche P.
Brosche P.
Brosche P.
Broten N. W.
Burke B. F.
Carr T. D.
Carter W. E.
Cartwright D. E.
Chao B. F.
Chao B. F.
Charlot P.
Charlot P.
Charlot P.
Chen G.
Christopher S. Jacobs
Clark T. A.
Clark T. A.
Cohen M. H.
Cohen M. H.
Davidson J. M.
Davis J. L.
DeMets C.
DeMets C.
Dreyer J. L. E.
Edge D. O.
Einstein A.
Einstein A.
Elgered G.
Fairhead L.
Fallon F. W.
Fang M.
Fey A. L.
Fich M.
Folkner W. M.
Freedman A. P.
Fu L. L.
Fukushima T.
Fukushima T.
Gardner C. S.
Gardner C. S.
Gaspar P.
Gilbert F.
Gipson J. M.
Gross R. S.
Gwinn C. R.
Gwinn C. R.
Haas R.
Hartmann T.
Hartmann T.
Hartmann T.
Hellings R. W.
Henstock D. R.
Herring T. A.
Herring T. A.
Herring T. A.
Herring T. A.
Hinteregger H. F.
Hjellming R. M.
Ho C. M.
Hosokawa M.
Hubble E. P.
Jacobs C. S.
Jansky K. G.
Jansky K. G.
John L. Fanselow
Johnston K. J.
Kellermann K. I.
Kerr F. J.
Kinoshita H.
Kinoshita H.
Kogut A.
Kovalevsky J.
Le Provost C.
Lebach D. E.
Lestrade J.-F.
Lestrade J.-F.
Lieske J. H.
Linfield R. P.
Ma C.
Ma C.
MacDoran P. F.
MacMillan D. S.
MacMillan D. S.
MacMillan D. S.
Marcaide J. M.
Marini J. W.
Mathews P. M.
Mathews P. M.
Mathews P. M.
May J.
Minster J. B.
Mitrovica J. X.
Moran J. M.
Morgan P. J.
Moyer T. D.
Napier P. J.
Naudet C. J.
Niell A. E.
Ojars J. Sovers
Ong K. M.
Pagiatakis S. D.
Patnaik A.
Peltier W. R.
Piner B. G.
Polatidis A. G.
Pyne T.
Rabbel W.
Ray J. R.
Ray J. R.
Ray R. D.
Reber G.
Reber G.
Reid M. J.
Rius A.
Robertson D. S.
Robertson D. S.
Robertson D. S.
Robertson D. S.
Robertson D. S.
Rogers A. E. E.
Rogers A. E. E.
Rosen R. D.
Scherneck H. G.
Schmidt M.
Seidelmann P. K.
Seiler U.
Shahid-Saless B.
Shapiro I. I.
Shapiro I. I.
Smith E. K.
Sonett C. P.
Souchay J.
Sovers O. J.
Sovers O. J.
Sovers O. J.
Standish E. M.
Taylor G. B.
Teitelbaum L. P.
Thakkar D. D.
Thayer G. D.
Treuhaft R. N.
Treuhaft R. N.
Tushingham A. M.
van Dam T. M.
van Dam T. M.
van Vleck J. H.
Vigue Y.
Wade C. M.
Wade C. M.
Wahr J. M.
Wahr J. M.
Walter H. G.
Walter H. G.
Ward S. N.
Watkins M. M.
Webb F. H.
Williams J. G.
Williams J. G.
Williams J. G.
Wilson B. D.
Yahil A.
Yoder C. F.
Zebker H. A.
Zhu S. Y.
Zhu S. Y.
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/1997
Field of study

Summarizes current status of radio interferometry at radio frequencies between Earth-based receivers, for astrometric and geodetic applications. Emphasizes theoretical models of VLBI observables that are required to extract results at the present accuracy levels of 1 cm and 1 nanoradian. Highlights the achievements of VLBI during the past two decades in reference frames, Earth orientation, atmospheric effects on microwave propagation, and relativity.Comment: 83 pages, 19 Postscript figures. To be published in Rev. Mod. Phys., Vol. 70, Oct. 199

arXiv.org e-Print Archive

CiteSeerX

Crossref

Recommended from our members

A US perspective on closing the carbon cycle to defossilize difficult-to-electrify segments of our economy

Author: Arenholz Elke
Autrey Tom
Bare Simon R
Biddinger Elizabeth J
Boettcher Shannon
Bowden Mark E
Britt Phillip F
Brown Robert C
Bullock R Morris
Chen Jingguang G
Daniel Claus
Delferro Massimiliano
Dorhout Peter K
Efroymson Rebecca A
Gaffney Kelly J
Gagliardi Laura
Harper Aaron S
Heldebrant David J
Helms Brett A
Huang Wenyu
Jordahl James L
Karakaya Canan
Kian Kourosh Cyrus
Kidder Michelle K
Kothandaraman Jotheeswari
Lercher Johannes
Liu Ping
Luca Oana R
Lyubovsky Maxim
Male Jonathan L
Malhotra Deepika
Miller Daniel J
Morris James R
Mueller Karl T
O’Brien Casey P
Palomino Robert M
Prozorov Tanya
Qi Long
Rallo Robert
Rana Rachita
Rioux Robert M
Rodriguez José A
Rousseau Roger
Russell Jake C
Sadow Aaron D
Sarazen Michele L
Schaidle Joshua A
Schulte Lisa A
Senanayake Sanjaya D
Shaw Wendy J
Sholl David S
Smith Emily A
Stevens Michaela Burke
Surendranath Yogesh
Tarpeh William A
Tassone Christopher J
Toma Francesca M
Tran Ba
Tumas William
Vlachos Dionisios G
Vogt Bryan D
Walton Krista S
Weber Robert S
Yang Jenny Y
Publication venue: eScholarship, University of California
Publication date: 01/05/2024
Field of study

Electrification to reduce or eliminate greenhouse gas emissions is essential to mitigate climate change. However, a substantial portion of our manufacturing and transportation infrastructure will be difficult to electrify and/or will continue to use carbon as a key component, including areas in aviation, heavy-duty and marine transportation, and the chemical industry. In this Roadmap, we explore how multidisciplinary approaches will enable us to close the carbon cycle and create a circular economy by defossilizing these difficult-to-electrify areas and those that will continue to need carbon. We discuss two approaches for this: developing carbon alternatives and improving our ability to reuse carbon, enabled by separations. Furthermore, we posit that co-design and use-driven fundamental science are essential to reach aggressive greenhouse gas reduction targets

eScholarship - University of California