Search CORE

4,128 research outputs found

An Introduction to Programming for Bioscientists: A Python-based Primer

Author: Ekmekci Berk
McAnany Charles E.
Mura Cameron
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 17/05/2016
Field of study

Computing has revolutionized the biological sciences over the past several decades, such that virtually all contemporary research in the biosciences utilizes computer programs. The computational advances have come on many fronts, spurred by fundamental developments in hardware, software, and algorithms. These advances have influenced, and even engendered, a phenomenal array of bioscience fields, including molecular evolution and bioinformatics; genome-, proteome-, transcriptome- and metabolome-wide experimental studies; structural genomics; and atomistic simulations of cellular-scale molecular assemblies as large as ribosomes and intact viruses. In short, much of post-genomic biology is increasingly becoming a form of computational biology. The ability to design and write computer programs is among the most indispensable skills that a modern researcher can cultivate. Python has become a popular programming language in the biosciences, largely because (i) its straightforward semantics and clean syntax make it a readily accessible first language; (ii) it is expressive and well-suited to object-oriented programming, as well as other modern paradigms; and (iii) the many available libraries and third-party toolkits extend the functionality of the core language into virtually every biological domain (sequence and structure analyses, phylogenomics, workflow management systems, etc.). This primer offers a basic introduction to coding, via Python, and it includes concrete examples and exercises to illustrate the language's usage and capabilities; the main text culminates with a final project in structural bioinformatics. A suite of Supplemental Chapters is also provided. Starting with basic concepts, such as that of a 'variable', the Chapters methodically advance the reader to the point of writing a graphical user interface to compute the Hamming distance between two DNA sequences.Comment: 65 pages total, including 45 pages text, 3 figures, 4 tables, numerous exercises, and 19 pages of Supporting Information; currently in press at PLOS Computational Biolog

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

A Data Transformation System for Biological Data Sources

Author: Buneman Peter
Davidson Susan
Hart Kyle
Overton Chris
Wong L.
Publication venue
Publication date: 01/01/1995
Field of study

Scientific data of importance to biologists in the Human Genome Project resides not only in conventional databases, but in structured files maintained in a number of different formats (e.g. ASN.1 and ACE) as well a.s sequence analysis packages (e.g. BLAST and FASTA). These formats and packages contain a number of data types not found in conventional databases, such as lists and variants, and may be deeply nested. We present in this paper techniques for querying and transforming such data, and illustrate their use in a prototype system developed in conjunction with the Human Genome Center for Chromosome 22. We also describe optimizations performed by the system, a crucial issue for bulk data

CiteSeerX

Edinburgh Research Explorer

ScholarlyCommons@Penn

Bio++: a set of C++ libraries for sequence analysis, phylogenetics, molecular evolution and population genetics

Author: Bazin Eric
Belkhir Khalid
Dutheil Julien
Gaillard Sylvain
Galtier Nicolas
Glémin Sylvain
Ranwez Vincent
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: A large number of bioinformatics applications in the fields of bio-sequence analysis, molecular evolution and population genetics typically share input/ouput methods, data storage requirements and data analysis algorithms. Such common features may be conveniently bundled into re-usable libraries, which enable the rapid development of new methods and robust applications. RESULTS: We present Bio++, a set of Object Oriented libraries written in C++. Available components include classes for data storage and handling (nucleotide/amino-acid/codon sequences, trees, distance matrices, population genetics datasets), various input/output formats, basic sequence manipulation (concatenation, transcription, translation, etc.), phylogenetic analysis (maximum parsimony, markov models, distance methods, likelihood computation and maximization), population genetics/genomics (diversity statistics, neutrality tests, various multi-locus analyses) and various algorithms for numerical calculus. CONCLUSION: Implementation of methods aims at being both efficient and user-friendly. A special concern was given to the library design to enable easy extension and new methods development. We defined a general hierarchy of classes that allow the developer to implement its own algorithms while remaining compatible with the rest of the libraries. Bio++ source code is distributed free of charge under the CeCILL general public licence from its website

Springer - Publisher Connector

Directory of Open Access Journals

Computational and Mathematical Modelling of the EGF Receptor System

Author: A Citri
B Schoeberl
BN Kholodenko
BS Hendricks
BS Hendricks
C Lemerle
CG Johnson
CV Suresh Babu
CV Suresh Babu
D Walker
D-Y Lee
FA Brightman
H Kitano
HS Wiley
IV Maly
JJ Hornberg
JM Haugh
JP Goldman
L Rendell
M Hucka
M Tomita
ML Blinov
MP Cary
P. Mendes
R Breitling
RB Jones
RJ Orton
SS Andrews
SY Shvartsman
W Kolch
WJ Gullick
WWA Schamel
Publication venue: Humana Press
Publication date: 01/01/2008
Field of study

This chapter gives an overview of computational and mathematical modelling of the EGF receptor system. It begins with a survey of motivations for producing such models, then describes the main approaches that are taken to carrying out such modelling, viz. differential equations and individual-based modelling. Finally, a number of projects that applying modelling and simulation techniques to various aspects of the EGF receptor system are described

Crossref

Kent Academic Repository

Recommended from our members

Computational Strategies for Scalable Genomics Analysis.

Author: Shi Lizhen
Wang Zhong
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The revolution in next-generation DNA sequencing technologies is leading to explosive data growth in genomics, posing a significant challenge to the computing infrastructure and software algorithms for genomics analysis. Various big data technologies have been explored to scale up/out current bioinformatics solutions to mine the big genomics data. In this review, we survey some of these exciting developments in the applications of parallel distributed computing and special hardware to genomics. We comment on the pros and cons of each strategy in the context of ease of development, robustness, scalability, and efficiency. Although this review is written for an audience from the genomics and bioinformatics fields, it may also be informative for the audience of computer science with interests in genomics applications

eScholarship - University of California

Scientific iconoclasm and active imagination: synthetic cells as techo-schientific mandalas

Author: Zwart Hub
Publication venue
Publication date: 01/01/2018
Field of study

Metaphors allow us to come to terms with abstract and complex information, by comparing it to something which is structured, familiar and concrete. Although modern science is “iconoclastic”, as Gaston Bachelard phrases it, scientists are at the same time prolific producers of metaphoric images themselves. Synthetic biology is an outstanding example of a technoscientific discourse replete with metaphors, including textual metaphors such as the “Morse code” of life, the “barcode” of life and the “book” of life. This paper focuses on a different type of metaphor, however, namely on the archetypal metaphor of the mandala as a symbol of restored unity and wholeness. Notably, mandala images emerge in textual materials related to one of the new “frontiers” of contemporary technoscience, namely the building of a synthetic cell: a laboratory artefact that functions like a cell and is even able to replicate itself. The mandala symbol suggests that, after living systems have been successfully reduced to the elementary building blocks and barcodes of life, the time has now come to put these fragments together again. We can only claim to understand life, synthetic cell experts argue, if we are able to technically reproduce a fully functioning cell. This holistic turn towards the cell as a meaningful whole also requires convergence at the “subject pole”: the building of a synthetic cell as a practice of the self, representing a turn towards integration, of multiple perspectives and various forms of expertise

PhilPapers

Crossref

Directory of Open Access Journals

Radboud Repository

Erasmus University Digital Repository

The Five Hundred and Thirty-Eighth Report of the Curricular Affairs Committee: New Bioinformatics Track in Cell and Molecular Biology.

Author: University of Rhode Island Faculty Senate
Publication venue: DigitalCommons@URI
Publication date: 23/02/2017
Field of study

DigitalCommons@URI