330 research outputs found

    Similarity-Detection and Localization

    Full text link
    The detection of similarities between long DNA and protein sequences is studied using concepts of statistical physics. It is shown that mutual similarities can be detected by sequence alignment methods only if their amount exceeds a threshold value. The onset of detection is a continuous phase transition which can be viewed as a localization-delocalization transition. The ``fidelity'' of the alignment is the order parameter of that transition; it leads to criteria for the selection of optimal alignment parameters.Comment: 4 pages including 4 figures (308kb post-script file

    Sequence Alignment with Matched Sections

    Get PDF
    In molecular biology, two finite sequences are compared by displaying one sequence written over another in an alignment. The number of alignments of two sequences is related to the Stanton-Cowan numbers. This paper gives asymptotics for the number of alignments of two sequences of length n with matching sections of size at least b

    Integrative missing value estimation for microarray data

    Get PDF
    BACKGROUND: Missing value estimation is an important preprocessing step in microarray analysis. Although several methods have been developed to solve this problem, their performance is unsatisfactory for datasets with high rates of missing data, high measurement noise, or limited numbers of samples. In fact, more than 80% of the time-series datasets in Stanford Microarray Database contain less than eight samples. RESULTS: We present the integrative Missing Value Estimation method (iMISS) by incorporating information from multiple reference microarray datasets to improve missing value estimation. For each gene with missing data, we derive a consistent neighbor-gene list by taking reference data sets into consideration. To determine whether the given reference data sets are sufficiently informative for integration, we use a submatrix imputation approach. Our experiments showed that iMISS can significantly and consistently improve the accuracy of the state-of-the-art Local Least Square (LLS) imputation algorithm by up to 15% improvement in our benchmark tests. CONCLUSION: We demonstrated that the order-statistics-based integrative imputation algorithms can achieve significant improvements over the state-of-the-art missing value estimation approaches such as LLS and is especially good for imputing microarray datasets with a limited number of samples, high rates of missing data, or very noisy measurements. With the rapid accumulation of microarray datasets, the performance of our approach can be further improved by incorporating larger and more appropriate reference datasets

    An integrative modular approach to systematically predict gene-phenotype associations

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Complex human diseases are often caused by multiple mutations, each of which contributes only a minor effect to the disease phenotype. To study the basis for these complex phenotypes, we developed a network-based approach to identify coexpression modules specifically activated in particular phenotypes. We integrated these modules, protein-protein interaction data, Gene Ontology annotations, and our database of gene-phenotype associations derived from literature to predict novel human gene-phenotype associations. Our systematic predictions provide us with the opportunity to perform a global analysis of human gene pleiotropy and its underlying regulatory mechanisms.</p> <p>Results</p> <p>We applied this method to 338 microarray datasets, covering 178 phenotype classes, and identified 193,145 phenotype-specific coexpression modules. We trained random forest classifiers for each phenotype and predicted a total of 6,558 gene-phenotype associations. We showed that 40.9% genes are pleiotropic, highlighting that pleiotropy is more prevalent than previously expected. We collected 77 ChIP-chip datasets studying 69 transcription factors binding over 16,000 targets under various phenotypic conditions. Utilizing this unique data source, we confirmed that dynamic transcriptional regulation is an important force driving the formation of phenotype specific gene modules.</p> <p>Conclusion</p> <p>We created a genome-wide gene to phenotype mapping that has many potential implications, including providing potential new drug targets and uncovering the basis for human disease phenotypes. Our analysis of these phenotype-specific coexpression modules reveals a high prevalence of gene pleiotropy, and suggests that phenotype-specific transcription factor binding may contribute to phenotypic diversity. All resources from our study are made freely available on our online Phenotype Prediction Database <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p

    Sequence information signal processor

    Get PDF
    An electronic circuit is used to compare two sequences, such as genetic sequences, to determine which alignment of the sequences produces the greatest similarity. The circuit includes a linear array of series-connected processors, each of which stores a single element from one of the sequences and compares that element with each successive element in the other sequence. For each comparison, the processor generates a scoring parameter that indicates which segment ending at those two elements produces the greatest degree of similarity between the sequences. The processor uses the scoring parameter to generate a similar scoring parameter for a comparison between the stored element and the next successive element from the other sequence. The processor also delivers the scoring parameter to the next processor in the array for use in generating a similar scoring parameter for another pair of elements. The electronic circuit determines which processor and alignment of the sequences produce the scoring parameter with the highest value

    Sequence information signal processor for local and global string comparisons

    Get PDF
    A sequence information signal processing integrated circuit chip designed to perform high speed calculation of a dynamic programming algorithm based upon the algorithm defined by Waterman and Smith. The signal processing chip of the present invention is designed to be a building block of a linear systolic array, the performance of which can be increased by connecting additional sequence information signal processing chips to the array. The chip provides a high speed, low cost linear array processor that can locate highly similar global sequences or segments thereof such as contiguous subsequences from two different DNA or protein sequences. The chip is implemented in a preferred embodiment using CMOS VLSI technology to provide the equivalent of about 400,000 transistors or 100,000 gates. Each chip provides 16 processing elements, and is designed to provide 16 bit, two's compliment operation for maximum score precision of between -32,768 and +32,767. It is designed to provide a comparison between sequences as long as 4,194,304 elements without external software and between sequences of unlimited numbers of elements with the aid of external software. Each sequence can be assigned different deletion and insertion weight functions. Each processor is provided with a similarity measure device which is independently variable. Thus, each processor can contribute to maximum value score calculation using a different similarity measure

    Gene Aging Nexus: a web database and data mining platform for microarray data on aging

    Get PDF
    The recent development of microarray technology provided unprecedented opportunities to understand the genetic basis of aging. So far, many microarray studies have addressed aging-related expression patterns in multiple organisms and under different conditions. The number of relevant studies continues to increase rapidly. However, efficient exploitation of these vast data is frustrated by the lack of an integrated data mining platform or other unifying bioinformatic resource to enable convenient cross-laboratory searches of array signals. To facilitate the integrative analysis of microarray data on aging, we developed a web database and analysis platform ‘Gene Aging Nexus’ (GAN) that is freely accessible to the research community to query/analyze/visualize cross-platform and cross-species microarray data on aging. By providing the possibility of integrative microarray analysis, GAN should be useful in building the systems-biology understanding of aging. GAN is accessible at

    Feasibility of trial procedures for a randomised controlled trial of a community based group exercise intervention for falls prevention for visually impaired older people: the VIOLET study

    Get PDF
    Background Visually impaired older people (VIOP) have a higher risk of falling than their sighted peers, and are likely to avoid physical activity. The aim was to adapt the existing Falls Management Exercise (FaME) programme for VIOP, delivered in the community, and to investigate the feasibility of conducting a definitive randomised controlled trial (RCT) of this adapted intervention. Methods Two-centre randomised mixed methods pilot trial and economic evaluation of the adapted group-based FaME programme for VIOP versus usual care. A one hour exercise programme ran weekly over 12 weeks at the study sites (Newcastle and Glasgow), delivered by third sector (voluntary and community) organisations. Participants were advised to exercise at home for an additional two hours over the week. Those randomised to the usual activities group received no intervention. Outcome measures were completed at baseline, 12 and 24 weeks. The potential primary outcome was the Short Form Falls Efficacy Scale – International (SFES-I). Participants’ adherence was assessed by reviewing attendance records and self-reported compliance to the home exercises. Adherence with the course content (fidelity) by instructors was assessed by a researcher. Adverse events were collected in a weekly phone call. Results Eighteen participants, drawn from community-living VIOP were screened; 68 met the inclusion criteria; 64 participants were randomised with 33 allocated to the intervention and 31 to the usual activities arm. 94% of participants provided data at the 12 week visit and 92% at 24 weeks. Adherence was high. The intervention was found to be safe with 76% attending nine or more classes. Median time for home exercise was 50 min per week. There was little or no evidence that fear of falling, balance and falls risk, physical activity, emotional, attitudinal or quality of life outcomes differed between trial arms at follow-up. Conclusions The intervention, FaME, was implemented successfully for VIOP and all progression criteria for a main trial were met. The lack of difference between groups on fear of falling was unsurprising given it was a pilot study but there may have been other contributory factors including suboptimal exercise dose and apparent low risk of falls in participants. These issues need addressing for a future trial

    Integrating the promotion of physical activity within a smoking cessation programme: Findings from collaborative action research in UK Stop Smoking Services

    Get PDF
    Background: Within the framework of collaborative action research, the aim was to explore the feasibility of developing and embedding physical activity promotion as a smoking cessation aid within UK 6/7-week National Health Service (NHS) Stop Smoking Services. Methods: In Phase 1 three initial cycles of collaborative action research (observation, reflection, planning, implementation and re-evaluation), in an urban Stop Smoking Service, led to the development of an integrated intervention in which physical activity was promoted as a cessation aid, with the support of a theoretically based self-help guide, and self monitoring using pedometers. In Phase 2 advisors underwent training and offered the intervention, and changes in physical activity promoting behaviour and beliefs were monitored. Also, changes in clients’ stage of readiness to use physical activity as a cessation aid, physical activity beliefs and behaviour and physical activity levels were assessed, among those who attended the clinic at 4-week post-quit. Qualitative data were collected, in the form of clinic observation, informal interviews with advisors and field notes. Results: The integrated intervention emerged through cycles of collaboration as something quite different to previous practice. Based on field notes, there were many positive elements associated with the integrated intervention in Phase 2. Self-reported advisors’ physical activity promoting behaviour increased as a result of training and adapting to the intervention. There was a significant advancement in clients’ stage of readiness to use physical activity as a smoking cessation aid. Conclusions: Collaboration with advisors was key in ensuring that a feasible intervention was developed as an aid to smoking cessation. There is scope to further develop tailored support to increasing physical activity and smoking cessation, mediated through changes in perceptions about the benefits of, and confidence to do physical activity
    • …
    corecore