27 research outputs found

    Scanner calibration revisited

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Calibration of a microarray scanner is critical for accurate interpretation of microarray results. Shi et al. (<it>BMC Bioinformatics</it>, 2005, <b>6</b>, Art. No. S11 Suppl. 2.) reported usage of a Full Moon BioSystems slide for calibration. Inspired by the Shi et al. work, we have calibrated microarray scanners in our previous research. We were puzzled however, that most of the signal intensities from a biological sample fell below the sensitivity threshold level determined by the calibration slide. This conundrum led us to re-investigate the quality of calibration provided by the Full Moon BioSystems slide as well as the accuracy of the analysis performed by Shi et al.</p> <p>Methods</p> <p>Signal intensities were recorded on three different microarray scanners at various photomultiplier gain levels using the same calibration slide from Full Moon BioSystems. Data analysis was conducted on raw signal intensities without normalization or transformation of any kind. Weighted least-squares method was used to fit the data.</p> <p>Results</p> <p>We found that initial analysis performed by Shi et al. did not take into account autofluorescence of the Full Moon BioSystems slide, which led to a grossly distorted microarray scanner response. Our analysis revealed that a power-law function, which is explicitly accounting for the slide autofluorescence, perfectly described a relationship between signal intensities and fluorophore quantities.</p> <p>Conclusions</p> <p>Microarray scanners respond in a much less distorted fashion than was reported by Shi et al. Full Moon BioSystems calibration slides are inadequate for performing calibration. We recommend against using these slides.</p

    Molecular taxonomy. Bioinformatics and practical evaluation

    Get PDF
    Summary Molecular taxonomy is a field that studies the diversity of organisms based on molecular markers. This work is devoted to develop a methodology of molecular taxonomy of small organisms. The ribosomal RNA (rRNA) is used as a molecular marker since its nucleotide sequence includes stretches of various levels of conservation, which can be used as species, genus and taxa specific regions. The organisms live in complex communities. To discover the composition of these communities, a hybridization assay employing oligonucleotide microarrays is developed to indicate the presence of a certain rRNA, in a sample under investigation. An additional method based on the pyrosequencing process is proposed here. In this case the mixture of rRNA genes is directly sequenced and the proportion of individual sequences is then calculated from the obtained pyrogram. The work comprises two parts: theoretical bioinformatics and practical evaluation. The first part tackles the problem of DNA-RNA duplex stability prediction. As a result, an ad hoc stability function is proposed. An algorithm and a program are developed for the design of oligonucleotides employed in the microarray approach. The kinetics of DNA-RNA duplex dissociation is considered as well. In addition, the formalism of the pyrosequencing approach is elaborated theoretically. The experimental part deals with the issues of oligonucleotide microarray establishment, including fabrication, immobilization, hybridization and scanning. A real-time kinetic setup for observing the RNA-DNA duplex dissociation was developed. The theoretical findings and quality of the oligonucleotide design are practically evaluated. The theory is found to be in a good accordance with experiment. The pyrosequencing approach is tested as well and is demonstrated to have enough power to discover the composition of a complex mixture of rRNA genes

    An algorithm for the determination and quantification of components of nucleic acid mixtures based on single sequencing reactions

    Get PDF
    BACKGROUND: Determination and quantification of nucleic acid components in a mixture is usually accomplished by microarray approaches, where the mixtures are hybridized against specific probes. As an alternative, we propose here that a single sequencing reaction from a mixture of nucleic acids holds enough information to potentially distinguish the different components, provided it is known which components can occur in the mixture. RESULTS: We describe an algorithm that is based on a set of linear equations which can be solved when the sequencing profiles of the individual components are known and when the number of sequenced nucleotides is larger than the number of components in the mixture. We have implemented the procedure for one type of sequencing approach, pyrosequencing, which produces a stepwise output of peaks that is particularly suitable for the procedure. As an example we use signature sequences from ribosomal RNA to distinguish and quantify several different species in a mixture. Using simulations, we show that the procedure may also be applicable for dideoxy sequencing on capillary sequencers, requiring only some instrument specific adaptations of protocols and software. CONCLUSION: The parallel sequencing approach described here may become a simple and cheap alternative to microarray experiments which aim at routine re-determination and quantification of known nucleic acid components from environmental samples or tissue samples

    Beyond Affymetrix Arrays: Expanding the Set of Known Hybridization Isotherms and Observing Pre-Wash Signal Intensities

    Get PDF
    Microarray hybridization studies have attributed the nonlinearity of hybridization isotherms to probe saturation and post-hybridization washing. Both processes are thought to distort \u27true\u27 target abundance because immobilized probes are saturated with excess target and stringent washing removes loosely bound targets. Yet the paucity of studies aimed at understanding hybridization and dissociation makes it difficult to align physicochemical theory to microarray results. To fill the void, we first examined hybridization isotherms generated on different microarray platforms using a ribosomal RNA target and then investigated hybridization signals at equilibrium and after stringent wash. Hybridization signal at equilibrium was achieved by treating the microarray with isopropanol, which prevents nucleic acids from dissolving into solution. Our results suggest that (i) the shape of hybridization isotherms varied by microarray platform with some being hyperbolic or linear, and others following a power-law; (ii) at equilibrium, fluorescent signal of different probes hybridized to the same target were not similar even with excess of target and (iii) the amount of target removed by stringent washing depended upon the hybridization time, the probe sequence and the presence/absence of nonspecific targets. Possible physicochemical interpretations of the results and future studies are discussed

    Generation, Analysis and Functional Annotation of Expressed Sequence Tags From the Sheepshead Minnow (\u3ci\u3eCyprinodon variegatus\u3c/i\u3e)

    Get PDF
    Background: Sheepshead minnow (Cyprinodon variegatus) are small fish capable of withstanding exposure to very low levels of dissolved oxygen, as well as extreme temperatures and salinities. It is an important model in understanding the impacts and biological response to hypoxia and co-occurring compounding stressors such as polycyclic aromatic hydrocarbons, endocrine disrupting chemicals, metals and herbicides. Here, we initiated a project to sequence and analyze over 10,000 ESTs generated from the Sheepshead minnow (Cyprinodon variegatus) as a resource for investigating stressor responses. Results: We sequenced 10,858 EST clones using a normalized cDNA library made from larval, embryonic and adult suppression subtractive hybridization-PCR (SSH) libraries. Post- sequencing processing led to 8,099 high quality sequences. Clustering analysis of these ESTs indentified 4,223 unique sequences containing 1,053 contigs and 3,170 singletons. BLASTX searches produced 1,394 significant (E-value \u3c 10(-5)) hits and further Gene Ontology (GO) analysis annotated 388 of these genes. All the EST sequences were deposited by Expressed Sequence Tags database (dbEST) in GenBank (GenBank: GE329585 to GE337683). Gene discovery and annotations are presented and discussed. This set of ESTs represents a significant proportion of the Sheepshead minnow (Cyprinodon variegatus) transcriptome, and provides a material basis for the development of microarrays useful for further gene expression studies in association with stressors such as hypoxia, cadmium, chromium and pyrene

    Constructing a fish metabolic network model

    Get PDF
    We report the construction of a genome-wide fish metabolic network model, MetaFishNet, and its application to analyzing high throughput gene expression data. This model is a stepping stone to broader applications of fish systems biology, for example by guiding study design through comparison with human metabolism and the integration of multiple data types. MetaFishNet resources, including a pathway enrichment analysis tool, are accessible at http://metafishnet.appspot.com

    An algorithm and program for finding sequence specific oligo-nucleotide probes for species identification

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The identification of species or species groups with specific oligo-nucleotides as molecular signatures is becoming increasingly popular for bacterial samples. However, it shows also great promise for other small organisms that are taxonomically difficult to tract.</p> <p>Results</p> <p>We have devised here an algorithm that aims to find the optimal probes for any given set of sequences. The program requires only a crude alignment of these sequences as input and is optimized for performance to deal also with very large datasets. The algorithm is designed such that the position of mismatches in the probes influences the selection and makes provision of single nucleotide outloops. Program implementations are available for Linux and Windows.</p

    Cryptic sequence features in the active postmortem transcriptome

    No full text
    Abstract Background Our previous study found that more than 500 transcripts significantly increased in abundance in the zebrafish and mouse several hours to days postmortem relative to live controls. The current literature suggests that most mRNAs are post-transcriptionally regulated in stressful conditions. We rationalized that the postmortem transcripts must contain sequence features (3- to 9- mers) that are unique from those in the rest of the transcriptome and that these features putatively serve as binding sites for proteins and/or non-coding RNAs involved in post-transcriptional regulation. Results We identified 5117 and 2245 over-represented sequence features in the mouse and zebrafish, respectively, which represents less than 1.5% of all possible features. Some of these features were disproportionately distributed along the transcripts with high densities in the 3′ untranslated regions of the zebrafish (0.3 mers/nt) and the open reading frames of the mouse (0.6 mers/nt). Yet, the highest density (2.3 mers/nt) occurred in the open reading frames of 11 mouse transcripts that lacked 3′ or 5′ untranslated regions. These results suggest the transcripts with high density of features might serve as ‘molecular sponges’ that sequester RNA binding proteins and/or microRNAs, and thus indirectly increase the stability and gene expression of other transcripts. In addition, some of the features were identified as binding sites for Rbfox and Hud proteins that are also involved in increasing transcript stability and gene expression. Conclusions Our results are consistent with the hypothesis that transcripts involved in responding to extreme stress, such as organismal death, have sequence features that make them different from the rest of the transcriptome. Some of these features serve as putative binding sites for proteins and non-coding RNAs that determine transcript stability and fate. A small number of the transcripts have high density sequence features, which are presumably involved in sequestering RNA binding proteins and microRNAs and thus preventing regulatory interactions among other transcripts. Our results provide baseline data on post-transcriptional regulation in stressful conditions that has implications for regulation in disease, starvation, and cancer
    corecore