248 research outputs found

    Genome Majority Vote Improves Gene Predictions

    Get PDF
    Recent studies have noted extensive inconsistencies in gene start sites among orthologous genes in related microbial genomes. Here we provide the first documented evidence that imposing gene start consistency improves the accuracy of gene start-site prediction. We applied an algorithm using a genome majority vote (GMV) scheme to increase the consistency of gene starts among orthologs. We used a set of validated Escherichia coli genes as a standard to quantify accuracy. Results showed that the GMV algorithm can correct hundreds of gene prediction errors in sets of five or ten genomes while introducing few errors. Using a conservative calculation, we project that GMV would resolve many inconsistencies and errors in publicly available microbial gene maps. Our simple and logical solution provides a notable advance toward accurate gene maps

    Gene prediction in metagenomic fragments: A large scale machine learning approach

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Metagenomics is an approach to the characterization of microbial genomes via the direct isolation of genomic sequences from the environment without prior cultivation. The amount of metagenomic sequence data is growing fast while computational methods for metagenome analysis are still in their infancy. In contrast to genomic sequences of single species, which can usually be assembled and analyzed by many available methods, a large proportion of metagenome data remains as unassembled anonymous sequencing reads. One of the aims of all metagenomic sequencing projects is the identification of novel genes. Short length, for example, Sanger sequencing yields on average 700 bp fragments, and unknown phylogenetic origin of most fragments require approaches to gene prediction that are different from the currently available methods for genomes of single species. In particular, the large size of metagenomic samples requires fast and accurate methods with small numbers of false positive predictions.</p> <p>Results</p> <p>We introduce a novel gene prediction algorithm for metagenomic fragments based on a two-stage machine learning approach. In the first stage, we use linear discriminants for monocodon usage, dicodon usage and translation initiation sites to extract features from DNA sequences. In the second stage, an artificial neural network combines these features with open reading frame length and fragment GC-content to compute the probability that this open reading frame encodes a protein. This probability is used for the classification and scoring of gene candidates. With large scale training, our method provides fast single fragment predictions with good sensitivity and specificity on artificially fragmented genomic DNA. Additionally, this method is able to predict translation initiation sites accurately and distinguishes complete from incomplete genes with high reliability.</p> <p>Conclusion</p> <p>Large scale machine learning methods are well-suited for gene prediction in metagenomic DNA fragments. In particular, the combination of linear discriminants and neural networks is promising and should be considered for integration into metagenomic analysis pipelines. The data sets can be downloaded from the URL provided (see Availability and requirements section).</p

    Vinorelbine/carboplatin vs gemcitabine/carboplatin in advanced NSCLC shows similar efficacy, but different impact of toxicity

    Get PDF
    This randomised phase III study in advanced non-small cell lung cancer (NSCLC) patients was conducted to compare vinorelbine/carboplatin (VC) and gemcitabine/carboplatin (GC) regarding efficacy, health-related quality of life (HRQOL) and toxicity. Chemonaive patients with NSCLC stage IIIB/IV and WHO performance status 0–2 were eligible. No upper age limit was defined. Patients received vinorelbine 25 mg m−2 or gemcitabine 1000 mg m−2 on days 1 and 8 and carboplatin AUC4 on day 1 and three courses with 3-week cycles. HRQOL questionnaires were completed at baseline, before chemotherapy and every 8 weeks until 49 weeks. During 14 months, 432 patients were included (VC, n=218; GC, n=214). Median survival was 7.3 vs 6.4 months, 1-year survival 28 vs 30% and 2-year survival 7 vs 7% in the VC and GC arm, respectively (P=0.89). HRQOL, represented by global QOL, nausea/vomiting, dyspnoea and pain, showed no significant differences. More grade 3–4 anaemia (P<0.01), thrombocytopenia (P<0.01) and transfusions of blood (P<0.01) or platelets (P<0.01) were observed in the GC arm. There was more grade 3–4 leucopoenia (P<0.01) in the VC arm, but the rate of neutropenic infections was the same (P=0.87). In conclusion, overall survival and HRQOL are similar, while grade 3–4 toxicity requiring interventions are less frequent when VC is compared to GC in advanced NSCLC

    Pathway Projector: Web-Based Zoomable Pathway Browser Using KEGG Atlas and Google Maps API

    Get PDF
    BACKGROUND: Biochemical pathways provide an essential context for understanding comprehensive experimental data and the systematic workings of a cell. Therefore, the availability of online pathway browsers will facilitate post-genomic research, just as genome browsers have contributed to genomics. Many pathway maps have been provided online as part of public pathway databases. Most of these maps, however, function as the gateway interface to a specific database, and the comprehensiveness of their represented entities, data mapping capabilities, and user interfaces are not always sufficient for generic usage. METHODOLOGY/PRINCIPAL FINDINGS: We have identified five central requirements for a pathway browser: (1) availability of large integrated maps showing genes, enzymes, and metabolites; (2) comprehensive search features and data access; (3) data mapping for transcriptomic, proteomic, and metabolomic experiments, as well as the ability to edit and annotate pathway maps; (4) easy exchange of pathway data; and (5) intuitive user experience without the requirement for installation and regular maintenance. According to these requirements, we have evaluated existing pathway databases and tools and implemented a web-based pathway browser named Pathway Projector as a solution. CONCLUSIONS/SIGNIFICANCE: Pathway Projector provides integrated pathway maps that are based upon the KEGG Atlas, with the addition of nodes for genes and enzymes, and is implemented as a scalable, zoomable map utilizing the Google Maps API. Users can search pathway-related data using keywords, molecular weights, nucleotide sequences, and amino acid sequences, or as possible routes between compounds. In addition, experimental data from transcriptomic, proteomic, and metabolomic analyses can be readily mapped. Pathway Projector is freely available for academic users at (http://www.g-language.org/PathwayProjector/)

    Encoding optical control in LCK kinase to quantitatively investigate its activity in live cells.

    Get PDF
    LCK is a tyrosine kinase that is essential for initiating T-cell antigen receptor (TCR) signaling. A complete understanding of LCK function is constrained by a paucity of methods to quantitatively study its function within live cells. To address this limitation, we generated LCK*, in which a key active-site lysine is replaced by a photocaged equivalent, using genetic code expansion. This strategy enabled fine temporal and spatial control over kinase activity, thus allowing us to quantify phosphorylation kinetics in situ using biochemical and imaging approaches. We find that autophosphorylation of the LCK active-site loop is indispensable for its catalytic activity and that LCK can stimulate its own activation by adopting a more open conformation, which can be modulated by point mutations. We then show that CD4 and CD8, T-cell coreceptors, can enhance LCK activity, thereby helping to explain their effect in physiological TCR signaling. Our approach also provides general insights into SRC-family kinase dynamics

    Differential hypoglycaemic, anorectic, autonomic and emetic effects of the glucagon-like peptide receptor agonist, exendin-4, in the conscious telemetered ferret.

    Get PDF
    Background: Rodents are incapable of emesis and consequently the emetic potential of glucagon-like peptide-1 receptor (GLP-1R) agonists in studies designed to assess a potential blood glucose lowering action of the compound was missed. Therefore, we investigated if the ferret, a carnivore with demonstrated translation capability in emesis research, would identify the emetic potential of the GLP-1R agonist, exendin-4, and any associated effects on gastric motor function, appetite and cardiovascular homeostasis. Methods: The biological activity of the GLP-1R ligands was investigated in vivo using a glucose tolerance test in pentobarbitone-anesthetised ferrets and in vitro using organ bath studies. Radiotelemetry was used to investigate the effect of exendin-4 on gastric myoelectric activity (GMA) and cardiovascular function in conscious ferrets; behaviour was also simultaneously assessed. Western blot was used to characterize GLP-1R distribution in the gastrointestinal and brain tissues. Results: In anesthetised ferrets, exendin-4 (30 nmol/kg, s.c.) reduced experimentally elevated blood glucose levels by 36.3%, whereas the GLP-1R antagonist, exendin (9–39) (300 nmol/kg, s.c.) antagonised the effect and increased AUC0–120 by 31.0% when injected alone (P < 0.05). In animals with radiotelemetry devices, exendin-4 (100 nmol/kg, s.c.) induced emesis in 1/9 ferrets, but inhibited food intake and decreased heart rate variability (HRV) in all animals (P < 0.05). In the animals not exhibiting emesis, there was no effect on GMA, mean arterial blood pressure, heart rate, or core body temperature. In the ferret exhibiting emesis, there was a shift in the GMA towards bradygastria with a decrease in power, and a concomitant decrease in HRV. Western blot revealed GLP-1R throughout the gastrointestinal tract but exendin-4 (up to 300 nM) and exendin (9–39), failed to contract or relax isolated ferret gut tissues. GLP-1R were found in all major brain regions and the levels were comparable those in the vagus nerve. Conclusions: Peripherally administered exendin-4 reduced blood glucose and inhibited feeding with a low emetic potential similar to that in humans (11% vs 12.8%). A disrupted GMA only occurred in the animal exhibiting emesis raising the possibility that disruption of the GMA may influence the probability of emesis occurring in response to treatment with GLP-1R agonists

    Comparative Microbial Modules Resource: Generation and Visualization of Multi-species Biclusters

    Get PDF
    The increasing abundance of large-scale, high-throughput datasets for many closely related organisms provides opportunities for comparative analysis via the simultaneous biclustering of datasets from multiple species. These analyses require a reformulation of how to organize multi-species datasets and visualize comparative genomics data analyses results. Recently, we developed a method, multi-species cMonkey, which integrates heterogeneous high-throughput datatypes from multiple species to identify conserved regulatory modules. Here we present an integrated data visualization system, built upon the Gaggle, enabling exploration of our method's results (available at http://meatwad.bio.nyu.edu/cmmr.html). The system can also be used to explore other comparative genomics datasets and outputs from other data analysis procedures – results from other multiple-species clustering programs or from independent clustering of different single-species datasets. We provide an example use of our system for two bacteria, Escherichia coli and Salmonella Typhimurium. We illustrate the use of our system by exploring conserved biclusters involved in nitrogen metabolism, uncovering a putative function for yjjI, a currently uncharacterized gene that we predict to be involved in nitrogen assimilation

    Clinical use of biomarkers of survival in pulmonary fibrosis

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Biologic predictors or biomarkers of survival in pulmonary fibrosis with a worse prognosis, more specifically in idiopathic pulmonary fibrosis would help the clinician in deciding whether or not to treat since treatment carries a potential risk for adverse events. These decisions are made easier if accurate and objective measurements of the patients' clinical status can predict the risk of progression to death.</p> <p>Method</p> <p>A literature review is given on different biomarkers of survival in interstitial lung disease, mainly in IPF, since this disease has the worst prognosis.</p> <p>Conclusion</p> <p>Serum biomarkers, and markers measured by medical imaging as HRCT, pertechnegas, DTPA en FDG-PET are not ready for clinical use to predict mortality in different forms of ILD. A baseline FVC, a change of FVC of more than 10%, and change in 6MWD are clinically helpful predictors of survival.</p
    corecore