100 research outputs found

    What Difference Does Quantity Make? On the Epistemology of Big Data Biology

    Get PDF
    publication-status: Acceptedtypes: ArticleIs Big Data science a whole new way of doing research? And what difference does data quantity make to knowledge production strategies and their outputs? I argue that the novelty of Big Data science does not lie in the sheer quantity of data involved, but rather in (1) the prominence and status acquired by data as commodity and recognised output, both within and outside of the scientific community and (2) the methods, infrastructures, technologies, skills and knowledge developed to handle data. These developments generate the impression that data-intensive research is a new mode of doing science, with its own epistemology and norms. To assess this claim, one needs to consider the ways in which data are actually disseminated and used to generate knowledge. Accordingly, this article reviews the development of sophisticated ways to disseminate, integrate and re-use data acquired on model organisms over the last three decades of work in experimental biology. I focus on online databases as prominent infrastructures set up to organise and interpret such data and examine the wealth and diversity of expertise, resources and conceptual scaffolding that such databases draw upon. This illuminates some of the conditions under which Big Data needs to be curated to support processes of discovery across biological subfields, which in turn highlights the difficulties caused by the lack of adequate curation for the vast majority of data in the life sciences. In closing, I reflect on the difference that data quantity is making to contemporary biology, the methodological and epistemic challenges of identifying and analysing data given these developments, and the opportunities and worries associated with Big Data discourse and methods.Economic and Social Research CouncilES/F028180/1Leverhulme TrustRPG-2013-153European Union’s Seventh Framework Programme (FP7/2007-2013ERC grant agreement number 335925

    Frozen magma lenses below the oceanic crust

    Get PDF
    Author Posting. Β© The Authors, 2005. This is the author's version of the work. It is posted here by permission of Nature Publishing Group for personal use, not for redistribution. The definitive version was published in Nature 436 (2005): 1149-1152, doi:10.1038/nature03944.The Earth's oceanic crust crystallizes from magmatic systems generated at mid-ocean ridges. Whereas a single magma body residing within the mid-crust is thought to be responsible for the generation of the upper oceanic crust, it remains unclear if the lower crust is formed from the same magma body, or if it mainly crystallizes from magma lenses located at the base of the crust. Thermal modelling, tomography, compliance and wide-angle seismic studies, supported by geological evidence, suggest the presence of gabbroic-melt accumulations within the Moho transition zone in the vicinity of fast- to intermediate-spreading centres. Until now, however, no reflection images have been obtained of such a structure within the Moho transition zone. Here we show images of groups of Moho transition zone reflection events that resulted from the analysis of approximately 1,500 km of multichannel seismic data collected across the intermediate-spreading-rate Juan de Fuca ridge. From our observations we suggest that gabbro lenses and melt accumulations embedded within dunite or residual mantle peridotite are the most probable cause for the observed reflectivity, thus providing support for the hypothesis that the crust is generated from multiple magma bodies

    Extragalactic Radio Continuum Surveys and the Transformation of Radio Astronomy

    Full text link
    Next-generation radio surveys are about to transform radio astronomy by discovering and studying tens of millions of previously unknown radio sources. These surveys will provide new insights to understand the evolution of galaxies, measuring the evolution of the cosmic star formation rate, and rivalling traditional techniques in the measurement of fundamental cosmological parameters. By observing a new volume of observational parameter space, they are also likely to discover unexpected new phenomena. This review traces the evolution of extragalactic radio continuum surveys from the earliest days of radio astronomy to the present, and identifies the challenges that must be overcome to achieve this transformational change.Comment: To be published in Nature Astronomy 18 Sept 201

    e-Science and biological pathway semantics

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The development of e-Science presents a major set of opportunities and challenges for the future progress of biological and life scientific research. Major new tools are required and corresponding demands are placed on the high-throughput data generated and used in these processes. Nowhere is the demand greater than in the semantic integration of these data. Semantic Web tools and technologies afford the chance to achieve this semantic integration. Since pathway knowledge is central to much of the scientific research today it is a good test-bed for semantic integration. Within the context of biological pathways, the BioPAX initiative, part of a broader movement towards the standardization and integration of life science databases, forms a necessary prerequisite for its successful application of e-Science in health care and life science research. This paper examines whether BioPAX, an effort to overcome the barrier of disparate and heterogeneous pathway data sources, addresses the needs of e-Science.</p> <p>Results</p> <p>We demonstrate how BioPAX pathway data can be used to ask and answer some useful biological questions. We find that BioPAX comes close to meeting a broad range of e-Science needs, but certain semantic weaknesses mean that these goals are missed. We make a series of recommendations for re-modeling some aspects of BioPAX to better meet these needs.</p> <p>Conclusion</p> <p>Once these semantic weaknesses are addressed, it will be possible to integrate pathway information in a manner that would be useful in e-Science.</p

    Adaptive Introgression across Species Boundaries in Heliconius Butterflies

    Get PDF
    It is widely documented that hybridisation occurs between many closely related species, but the importance of introgression in adaptive evolution remains unclear, especially in animals. Here, we have examined the role of introgressive hybridisation in transferring adaptations between mimetic Heliconius butterflies, taking advantage of the recent identification of a gene regulating red wing patterns in this genus. By sequencing regions both linked and unlinked to the red colour locus, we found a region that displays an almost perfect genotype by phenotype association across four species, H. melpomene, H. cydno, H. timareta, and H. heurippa. This particular segment is located 70 kb downstream of the red colour specification gene optix, and coalescent analysis indicates repeated introgression of adaptive alleles from H. melpomene into the H. cydno species clade. Our analytical methods complement recent genome scale data for the same region and suggest adaptive introgression has a crucial role in generating adaptive wing colour diversity in this group of butterflies

    Clinical Phenotypes and Prognosis of Dilated Cardiomyopathy Caused by Truncating Variants in the TTN Gene

    Get PDF
    BACKGROUND: Truncating variants in the TTN gene (TTNtv) are the commonest cause of heritable dilated cardiomyopathy. This study aimed to study the phenotypes and outcomes of TTNtv carriers. METHODS: Five hundred thirty-seven individuals (61% men; 317 probands) with TTNtv were recruited in 14 centers (372 [69%] with baseline left ventricular systolic dysfunction [LVSD]). Baseline and longitudinal clinical data were obtained. The primary end point was a composite of malignant ventricular arrhythmia and end-stage heart failure. The secondary end point was left ventricular reverse remodeling (left ventricular ejection fraction increase by β‰₯10% or normalization to β‰₯50%). RESULTS: Median follow-up was 49 (18-105) months. Men developed LVSD more frequently and earlier than women (45Β±14 versus 49Β±16 years, respectively; P=0.04). By final evaluation, 31%, 45%, and 56% had atrial fibrillation, frequent ventricular ectopy, and nonsustained ventricular tachycardia, respectively. Seventy-six (14.2%) individuals reached the primary end point (52 [68%] end-stage heart failure events, 24 [32%] malignant ventricular arrhythmia events). Malignant ventricular arrhythmia end points most commonly occurred in patients with severe LVSD. Male sex (hazard ratio, 1.89 [95% CI, 1.04-3.44]; P=0.04) and left ventricular ejection fraction (per 10% decrement from left ventricular ejection fraction, 50%; hazard ratio, 1.63 [95% CI, 1.30-2.04]; P<0.001) were independent predictors of the primary end point. Two hundred seven of 300 (69%) patients with LVSD had evidence of left ventricular reverse remodeling. In a subgroup of 29 of 74 (39%) patients with initial left ventricular reverse remodeling, there was a subsequent left ventricular ejection fraction decrement. TTNtv location was not associated with statistically significant differences in baseline clinical characteristics, left ventricular reverse remodeling, or outcomes on multivariable analysis (P=0.07). CONCLUSIONS: TTNtv is characterized by frequent arrhythmia, but malignant ventricular arrhythmias are most commonly associated with severe LVSD. Male sex and LVSD are independent predictors of outcomes. Mutation location does not impact clinical phenotype or outcomes

    Genetic diversity and demographic instability in Riftia pachyptila tubeworms from eastern Pacific hydrothermal vents

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Deep-sea hydrothermal vent animals occupy patchy and ephemeral habitats supported by chemosynthetic primary production. Volcanic and tectonic activities controlling the turnover of these habitats contribute to demographic instability that erodes genetic variation within and among colonies of these animals. We examined DNA sequences from one mitochondrial and three nuclear gene loci to assess genetic diversity in the siboglinid tubeworm, <it>Riftia pachyptila</it>, a widely distributed constituent of vents along the East Pacific Rise and GalΓ‘pagos Rift.</p> <p>Results</p> <p>Genetic differentiation (<it>F</it><sub><it>ST</it></sub>) among populations increased with geographical distances, as expected under a linear stepping-stone model of dispersal. Low levels of DNA sequence diversity occurred at all four loci, allowing us to exclude the hypothesis that an idiosyncratic selective sweep eliminated mitochondrial diversity alone. Total gene diversity declined with tectonic spreading rates. The southernmost populations, which are subjected to superfast spreading rates and high probabilities of extinction, are relatively homogenous genetically.</p> <p>Conclusions</p> <p>Compared to other vent species, DNA sequence diversity is extremely low in <it>R. pachyptila</it>. Though its dispersal abilities appear to be effective, the low diversity, particularly in southern hemisphere populations, is consistent with frequent local extinction and (re)colonization events.</p

    The removal of multiplicative, systematic bias allows integration of breast cancer gene expression datasets – improving meta-analysis and prediction of prognosis

    Get PDF
    BACKGROUND: The number of gene expression studies in the public domain is rapidly increasing, representing a highly valuable resource. However, dataset-specific bias precludes meta-analysis at the raw transcript level, even when the RNA is from comparable sources and has been processed on the same microarray platform using similar protocols. Here, we demonstrate, using Affymetrix data, that much of this bias can be removed, allowing multiple datasets to be legitimately combined for meaningful meta-analyses. RESULTS: A series of validation datasets comparing breast cancer and normal breast cell lines (MCF7 and MCF10A) were generated to examine the variability between datasets generated using different amounts of starting RNA, alternative protocols, different generations of Affymetrix GeneChip or scanning hardware. We demonstrate that systematic, multiplicative biases are introduced at the RNA, hybridization and image-capture stages of a microarray experiment. Simple batch mean-centering was found to significantly reduce the level of inter-experimental variation, allowing raw transcript levels to be compared across datasets with confidence. By accounting for dataset-specific bias, we were able to assemble the largest gene expression dataset of primary breast tumours to-date (1107), from six previously published studies. Using this meta-dataset, we demonstrate that combining greater numbers of datasets or tumours leads to a greater overlap in differentially expressed genes and more accurate prognostic predictions. However, this is highly dependent upon the composition of the datasets and patient characteristics. CONCLUSION: Multiplicative, systematic biases are introduced at many stages of microarray experiments. When these are reconciled, raw data can be directly integrated from different gene expression datasets leading to new biological findings with increased statistical power

    LAP2 Is Widely Overexpressed in Diverse Digestive Tract Cancers and Regulates Motility of Cancer Cells

    Get PDF
    BACKGROUND: Lamina-associated polypeptides 2 (LAP2) is a nuclear protein that connects the nuclear lamina with chromatin. Although its critical roles in genetic disorders and hematopoietic malignancies have been described, its expression and roles in digestive tract cancers have been poorly characterized. METHODS: To examine the expression of LAP2 in patient tissues, we performed immunohistochemistry and real-time PCR. To examine motility of cancer cells, we employed Boyden chamber, wound healing and Matrigel invasion assays. To reveal its roles in metastasis in vivo, we used a liver metastasis xenograft model. To investigate the underlying mechanism, a cDNA microarray was conducted. RESULTS: Immunohistochemistry in patient tissues showed widespread expression of LAP2 in diverse digestive tract cancers including stomach, pancreas, liver, and bile duct cancers. Real-time PCR confirmed that LAP2Ξ² is over-expressed in gastric cancer tissues. Knockdown of LAP2Ξ² did not affect proliferation of most digestive tract cancer cells except pancreatic cancer cells. However, knockdown of LAP2Ξ² decreased motility of all tested cancer cells. Moreover, overexpression of LAP2Ξ² increased motility of gastric and pancreatic cancer cells. In the liver metastasis xenograft model, LAP2Ξ² increased metastatic efficacy of gastric cancer cells and mortality in tested mice. cDNA microarrays showed the possibility that myristoylated alanine-rich C kinase substrate (MARCKS) and interleukin6 (IL6) may mediate LAP2Ξ²-regulated motility of cancer cells. CONCLUSIONS: From the above results, we conclude that LAP2 is widely overexpressed in diverse digestive tract cancers and LAP2Ξ² regulates motility of cancer cells and suggest that LAP2Ξ² may have utility for diagnostics and therapeutics in digestive tract cancers

    The Druze: A Population Genetic Refugium of the Near East

    Get PDF
    BACKGROUND: Phylogenetic mitochondrial DNA haplogroups are highly partitioned across global geographic regions. A unique exception is the X haplogroup, which has a widespread global distribution without major regions of distinct localization. PRINCIPAL FINDINGS: We have examined mitochondrial DNA sequence variation together with Y-chromosome-based haplogroup structure among the Druze, a religious minority with a unique socio-demographic history residing in the Near East. We observed a striking overall pattern of heterogeneous parental origins, consistent with Druze oral tradition, together with both a high frequency and a high diversity of the mitochondrial DNA (mtDNA) X haplogroup within a confined regional subpopulation. Furthermore demographic modeling indicated low migration rates with nearby populations. CONCLUSIONS: These findings were enabled through the use of a paternal kindred based sampling approach, and suggest that the Galilee Druze represent a population isolate, and that the combination of a high frequency and diversity of the mtDNA X haplogroup signifies a phylogenetic refugium, providing a sample snapshot of the genetic landscape of the Near East prior to the modern age
    • …
    corecore