1,000 research outputs found

    Gene regulatory network modelling with evolutionary algorithms -an integrative approach

    Get PDF
    Building models for gene regulation has been an important aim of Systems Biology over the past years, driven by the large amount of gene expression data that has become available. Models represent regulatory interactions between genes and transcription factors and can provide better understanding of biological processes, and means of simulating both natural and perturbed systems (e.g. those associated with disease). Gene regulatory network (GRN) quantitative modelling is still limited, however, due to data issues such as noise and restricted length of time series, typically used for GRN reverse engineering. These issues create an under-determination problem, with many models possibly fitting the data. However, large amounts of other types of biological data and knowledge are available, such as cross-platform measurements, knockout experiments, annotations, binding site affinities for transcription factors and so on. It has been postulated that integration of these can improve model quality obtained, by facilitating further filtering of possible models. However, integration is not straightforward, as the different types of data can provide contradictory information, and are intrinsically noisy, hence large scale integration has not been fully explored, to date. Here, we present an integrative parallel framework for GRN modelling, which employs evolutionary computation and different types of data to enhance model inference. Integration is performed at different levels. (i) An analysis of cross-platform integration of time series microarray data, discussing the effects on the resulting models and exploring crossplatform normalisation techniques, is presented. This shows that time-course data integration is possible, and results in models more robust to noise and parameter perturbation, as well as reduced noise over-fitting. (ii) Other types of measurements and knowledge, such as knock-out experiments, annotated transcription factors, binding site affinities and promoter sequences are integrated within the evolutionary framework to obtain more plausible GRN models. This is performed by customising initialisation, mutation and evaluation of candidate model solutions. The different data types are investigated and both qualitative and quantitative improvements are obtained. Results suggest that caution is needed in order to obtain improved models from combined data, and the case study presented here provides an example of how this can be achieved. Furthermore, (iii), RNA-seq data is studied in comparison to microarray experiments, to identify overlapping features and possibilities of integration within the framework. The extension of the framework to this data type is straightforward and qualitative improvements are obtained when combining predicted interactions from single-channel and RNA-seq datasets

    Organization and evolution of information within eukaryotic genomes.

    Get PDF

    Development of a Plasmonic On-Chip System to Characterize Changes from External Perturbations in Cardiomyocytes

    Get PDF
    Today’s heart-on-a-chip devices are hoped to be the state-of-the-art cell and tissue characterizing tool, in clinically applicable regenerative medicine and cardiac tissue engineering. Due to the coupled electromechanical activity of cardiomyocytes (CM), a comprehensive heart-on-a-chip device as a cell characterizing tool must encompass the capability to quantify cellular contractility, conductivity, excitability, and rhythmicity. This dissertation focuses on developing a successful and statistically relevant surface plasmon resonance (SPR) biosensor for simultaneous recording of neonatal rat cardiomyocytes’ electrophysiological profile and mechanical motion under normal and perturbed conditions. The surface plasmon resonance technique can quantify (1) molecular binding onto a metal film, (2) bulk refractive index changes of the medium near (nm) the metal film, and (3) dielectric property changes of the metal film. We used thin gold metal films (also called chips) as our plasmonic sensor and obtained a periodic signal from spontaneously contracting CMs on the chip. Furthermore, we took advantage of a microfluidic module for controlled drug delivery to CMs on-chip, inhibiting and promoting their signaling pathways under dynamic flow. We identified that ionic channel activity of each contraction period of a live CM syncytium on a gold metal sensor would account for the non-specific ion adsorption onto the metal surface in a periodic manner. Moreover, the contraction of cardiomyocytes following their ion channel activity displaces the medium, changing its bulk refractive index near the metal surface. Hence, the real-time electromechanical activity of CMs using SPR sensors may be extracted as a time series we call the Plasmonic Cardio-Eukaryography Signal (P-CeG). The P-CeG signal render opportunities, where state-of-the-art heart-on-a-chip device complexities may subside to a simpler, faster and cheaper platform for label-free, non-invasive, and high throughput cellular characterization

    Integrated mining of feature spaces for bioinformatics domain discovery

    Get PDF
    One of the major challenges in the field of bioinformatics is the elucidation of protein folding for the functional annotation of proteins. The factors that govern protein folding include the chemical, physical, and environmental conditions of the protein\u27s surroundings, which can be measured and exploited for computational discovery purposes. These conditions enable the protein to transform from a sequence of amino acids to a globular three-dimensional structure. Information concerning the folded state of a protein has significant potential to explain biochemical pathways and their involvement in disorders and diseases. This information impacts the ways in which genetic diseases are characterized and cured and in which designer drugs are created. With the exponential growth of protein databases and the limitations of experimental protein structure determination, sophisticated computational methods have been developed and applied to search for, detect, and compare protein homology. Most computational tools developed for protein structure prediction are primarily based on sequence similarity searches. These approaches have improved the prediction accuracy of high sequence similarity proteins but have failed to perform well with proteins of low sequence similarity. Data mining offers unique algorithmic computational approaches that have been used widely in the development of automatic protein structure classification and prediction. In this dissertation, we present a novel approach for the integration of physico-chemical properties and effective feature extraction techniques for the classification of proteins. Our approaches overcome one of the major obstacles of data mining in protein databases, the encapsulation of different hydrophobicity residue properties into a much reduced feature space that possess high degrees of specificity and sensitivity in protein structure classification. We have developed three unique computational algorithms for coherent feature extraction on selected scale properties of the protein sequence. When plagued by the problem of the unequal cardinality of proteins, our proposed integration scheme effectively handles the varied sizes of proteins and scales well with increasing dimensionality of these sequences. We also detail a two-fold methodology for protein functional annotation. First, we exhibit our success in creating an algorithm that provides a means to integrate multiple physico-chemical properties in the form of a multi-layered abstract feature space, with each layer corresponding to a physico-chemical property. Second, we discuss a wavelet-based segmentation approach that efficiently detects regions of property conservation across all layers of the created feature space. Finally, we present a unique graph-theory based algorithmic framework for the identification of conserved hydrophobic residue interaction patterns using identified scales of hydrophobicity. We report that these discriminatory features are specific to a family of proteins, which consist of conserved hydrophobic residues that are then used for structural classification. We also present our rigorously tested validation schemes, which report significant degrees of accuracy to show that homologous proteins exhibit the conservation of physico-chemical properties along the protein backbone. We conclude our discussion by summarizing our results and contributions and by listing our goals for future research

    Computational Methods on Study of Differentially Expressed Proteins in Maize Proteomes Associated with Resistance to Aflatoxin Accumulation

    Get PDF
    Plant breeders have focused on improving maize resistance to Aspergillus flavus infection and aflatoxin accumulation by breeding with genotypes having the desirable traits. Various maize inbred lines have been developed for the breeding of resistance. Identification of differentially expressed proteins among such maize inbred lines will facilitate the development of gene markers and expedite the breeding process. Computational biology and proteomics approaches on the investigation of differentially expressed proteins were explored in this research. The major research objectives included 1) application of computational methods in homology and comparative modeling to study 3D protein structures and identify single nucleotide polymorphisms (SNPs) involved in changes of protein structures and functions, which can in turn increase the efficiency of the development of DNA markers; 2) investigation of methods on total protein profiling including purification, separation, visualization, and computational analysis at the proteome level. Special research goals were set on the development of open source computational methods using Matlab image processing tools to quantify and compare protein expression levels visualized by 2D protein electrophoresis gel techniques

    Biometrics

    Get PDF
    Biometrics uses methods for unique recognition of humans based upon one or more intrinsic physical or behavioral traits. In computer science, particularly, biometrics is used as a form of identity access management and access control. It is also used to identify individuals in groups that are under surveillance. The book consists of 13 chapters, each focusing on a certain aspect of the problem. The book chapters are divided into three sections: physical biometrics, behavioral biometrics and medical biometrics. The key objective of the book is to provide comprehensive reference and text on human authentication and people identity verification from both physiological, behavioural and other points of view. It aims to publish new insights into current innovations in computer systems and technology for biometrics development and its applications. The book was reviewed by the editor Dr. Jucheng Yang, and many of the guest editors, such as Dr. Girija Chetty, Dr. Norman Poh, Dr. Loris Nanni, Dr. Jianjiang Feng, Dr. Dongsun Park, Dr. Sook Yoon and so on, who also made a significant contribution to the book

    Associative Pattern Recognition for Biological Regulation Data

    Get PDF
    In the last decade, bioinformatics data has been accumulated at an unprecedented rate, thanks to the advancement in sequencing technologies. Such rapid development poses both challenges and promising research topics. In this dissertation, we propose a series of associative pattern recognition algorithms in biological regulation studies. In particular, we emphasize efficiently recognizing associative patterns between genes, transcription factors, histone modifications and functional labels using heterogeneous data sources (numeric, sequences, time series data and textual labels). In protein-DNA associative pattern recognition, we introduce an efficient algorithm for affinity test by searching for over-represented DNA sequences using a hash function and modulo addition calculation. This substantially improves the efficiency of \textit{next generation sequencing} data analysis. In gene regulatory network inference, we propose a framework for refining weak networks based on transcription factor binding sites, thus improved the precision of predicted edges by up to 52%. In histone modification code analysis, we propose an approach to genome-wide combinatorial pattern recognition for histone code to function associative pattern recognition, and achieved improvement by up to 38.1%38.1\%. We also propose a novel shape based modification pattern analysis approach, using this to successfully predict sub-classes of genes in flowering-time category. We also propose a combination to combination associative pattern recognition, and achieved better performance compared against multi-label classification and bidirectional associative memory methods. Our proposed approaches recognize associative patterns from different types of data efficiently, and provides a useful toolbox for biological regulation analysis. This dissertation presents a road-map to associative patterns recognition at genome wide level
    corecore