134 research outputs found

    GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations

    Get PDF
    Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics

    Engineering Enzyme Specificity Using Computational Design of a Defined-Sequence Library

    Get PDF
    Engineered biosynthetic pathways have the potential to produce high-value molecules from inexpensive feedstocks, but a key limitation is engineering enzymes with high activity and specificity for new reactions. Here, we developed a method for combining structure-based computational protein design with library-based enzyme screening, in which inter-residue correlations favored by the design are encoded into a defined-sequence library. We validated this approach by engineering a glucose 6-oxidase enzyme for use in a proposed pathway to convert D-glucose into D-glucaric acid. The most active variant, identified after only one round of diversification and screening of only 10,000 wells, is approximately 400-fold more active on glucose than is the wild-type enzyme. We anticipate that this strategy will be broadly applicable to the discovery of new enzymes for engineered biological pathways.United States. Office of Naval Research. Young Investigator Program (Grant N000140510656)National Science Foundation (U.S.) (Synthetic Biology Engineering Research Center. Grant EEC-0540879)MIT Faculty Start-up FundCodon Devices, Inc

    T Cell Responses to Human Endogenous Retroviruses in HIV-1 Infection

    Get PDF
    Human endogenous retroviruses (HERVs) are remnants of ancient infectious agents that have integrated into the human genome. Under normal circumstances, HERVs are functionally defective or controlled by host factors. In HIV-1-infected individuals, intracellular defense mechanisms are compromised. We hypothesized that HIV-1 infection would remove or alter controls on HERV activity. Expression of HERV could potentially stimulate a T cell response to HERV antigens, and in regions of HIV-1/HERV similarity, these T cells could be cross-reactive. We determined that the levels of HERV production in HIV-1-positive individuals exceed those of HIV-1-negative controls. To investigate the impact of HERV activity on specific immunity, we examined T cell responses to HERV peptides in 29 HIV-1-positive and 13 HIV-1-negative study participants. We report T cell responses to peptides derived from regions of HERV detected by ELISPOT analysis in the HIV-1-positive study participants. We show an inverse correlation between anti-HERV T cell responses and HIV-1 plasma viral load. In HIV-1-positive individuals, we demonstrate that HERV-specific T cells are capable of killing cells presenting their cognate peptide. These data indicate that HIV-1 infection leads to HERV expression and stimulation of a HERV-specific CD8+ T cell response. HERV-specific CD8+ T cells have characteristics consistent with an important role in the response to HIV-1 infection: a phenotype similar to that of T cells responding to an effectively controlled virus (cytomegalovirus), an inverse correlation with HIV-1 plasma viral load, and the ability to lyse cells presenting their target peptide. These characteristics suggest that elicitation of anti-HERV-specific immune responses is a novel approach to immunotherapeutic vaccination. As endogenous retroviral sequences are fixed in the human genome, they provide a stable target, and HERV-specific T cells could recognize a cell infected by any HIV-1 viral variant. HERV-specific immunity is an important new avenue for investigation in HIV-1 pathogenesis and vaccine design

    Pairwise selection assembly for sequence-independent construction of long-length DNA

    Get PDF
    The engineering of biological components has been facilitated by de novo synthesis of gene-length DNA. Biological engineering at the level of pathways and genomes, however, requires a scalable and cost-effective assembly of DNA molecules that are longer than ∼10 kb, and this remains a challenge. Here we present the development of pairwise selection assembly (PSA), a process that involves hierarchical construction of long-length DNA through the use of a standard set of components and operations. In PSA, activation tags at the termini of assembly sub-fragments are reused throughout the assembly process to activate vector-encoded selectable markers. Marker activation enables stringent selection for a correctly assembled product in vivo, often obviating the need for clonal isolation. Importantly, construction via PSA is sequence-independent, and does not require primary sequence modification (e.g. the addition or removal of restriction sites). The utility of PSA is demonstrated in the construction of a completely synthetic 91-kb chromosome arm from Saccharomyces cerevisiae

    The Stem Cell Discovery Engine: an integrated repository and analysis system for cancer stem cell comparisons

    Get PDF
    Mounting evidence suggests that malignant tumors are initiated and maintained by a subpopulation of cancerous cells with biological properties similar to those of normal stem cells. However, descriptions of stem-like gene and pathway signatures in cancers are inconsistent across experimental systems. Driven by a need to improve our understanding of molecular processes that are common and unique across cancer stem cells (CSCs), we have developed the Stem Cell Discovery Engine (SCDE)—an online database of curated CSC experiments coupled to the Galaxy analytical framework. The SCDE allows users to consistently describe, share and compare CSC data at the gene and pathway level. Our initial focus has been on carefully curating tissue and cancer stem cell-related experiments from blood, intestine and brain to create a high quality resource containing 53 public studies and 1098 assays. The experimental information is captured and stored in the multi-omics Investigation/Study/Assay (ISA-Tab) format and can be queried in the data repository. A linked Galaxy framework provides a comprehensive, flexible environment populated with novel tools for gene list comparisons against molecular signatures in GeneSigDB and MSigDB, curated experiments in the SCDE and pathways in WikiPathways. The SCDE is available at http://discovery.hsci.harvard.edu

    Community-driven development for computational biology at Sprints, Hackathons and Codefests

    Get PDF
    Background: Computational biology comprises a wide range of technologies and approaches. Multiple technologies can be combined to create more powerful workflows if the individuals contributing the data or providing tools for its interpretation can find mutual understanding and consensus. Much conversation and joint investigation are required in order to identify and implement the best approaches. Traditionally, scientific conferences feature talks presenting novel technologies or insights, followed up by informal discussions during coffee breaks. In multi-institution collaborations, in order to reach agreement on implementation details or to transfer deeper insights in a technology and practical skills, a representative of one group typically visits the other. However, this does not scale well when the number of technologies or research groups is large. Conferences have responded to this issue by introducing Birds-of-a-Feather (BoF) sessions, which offer an opportunity for individuals with common interests to intensify their interaction. However, parallel BoF sessions often make it hard for participants to join multiple BoFs and find common ground between the different technologies, and BoFs are generally too short to allow time for participants to program together. Results: This report summarises our experience with computational biology Codefests, Hackathons and Sprints, which are interactive developer meetings. They are structured to reduce the limitations of traditional scientific meetings described above by strengthening the interaction among peers and letting the participants determine the schedule and topics. These meetings are commonly run as loosely scheduled "unconferences" (self-organized identification of participants and topics for meetings) over at least two days, with early introductory talks to welcome and organize contributors, followed by intensive collaborative coding sessions. We summarise some prominent achievements of those meetings and describe differences in how these are organised, how their audience is addressed, and their outreach to their respective communities. Conclusions: Hackathons, Codefests and Sprints share a stimulating atmosphere that encourages participants to jointly brainstorm and tackle problems of shared interest in a self-driven proactive environment, as well as providing an opportunity for new participants to get involved in collaborative projects

    Incorporating Genomics and Bioinformatics across the Life Sciences Curriculum

    Get PDF
    Undergraduate life sciences education needs an overhaul, as clearly described in the National Research Council of the National Academies’ publication BIO 2010: Transforming Undergraduate Education for Future Research Biologists. Among BIO 2010’s top recommendations is the need to involve students in working with real data and tools that reflect the nature of life sciences research in the 21st century [1]. Education research studies support the importance of utilizing primary literature, designing and implementing experiments, and analyzing results in the context of a bona fide scientific question [1–12] in cultivating the analytical skills necessary to become a scientist. Incorporating these basic scientific methodologies in undergraduate education leads to increased undergraduate and post-graduate retention in the sciences [13–16]. Toward this end, many undergraduate teaching organizations offer training and suggestions for faculty to update and improve their teaching approaches to help students learn as scientists, through design and discovery (e.g., Council of Undergraduate Research [www.cur.org] and Project Kaleidoscope [ www.pkal.org])

    Tim-3 expression defines a novel population of dysfunctional T cells with highly elevated frequencies in progressive HIV-1 infection

    Get PDF
    Progressive loss of T cell functionality is a hallmark of chronic infection with human immunodeficiency virus 1 (HIV-1). We have identified a novel population of dysfunctional T cells marked by surface expression of the glycoprotein Tim-3. The frequency of this population was increased in HIV-1–infected individuals to a mean of 49.4 ± SD 12.9% of CD8+ T cells expressing Tim-3 in HIV-1–infected chronic progressors versus 28.5 ± 6.8% in HIV-1–uninfected individuals. Levels of Tim-3 expression on T cells from HIV-1–infected inviduals correlated positively with HIV-1 viral load and CD38 expression and inversely with CD4+ T cell count. In progressive HIV-1 infection, Tim-3 expression was up-regulated on HIV-1–specific CD8+ T cells. Tim-3–expressing T cells failed to produce cytokine or proliferate in response to antigen and exhibited impaired Stat5, Erk1/2, and p38 signaling. Blocking the Tim-3 signaling pathway restored proliferation and enhanced cytokine production in HIV-1–specific T cells. Thus, Tim-3 represents a novel target for the therapeutic reversal of HIV-1–associated T cell dysfunction
    corecore