443 research outputs found

    Domino Tatami Covering is NP-complete

    Full text link
    A covering with dominoes of a rectilinear region is called \emph{tatami} if no four dominoes meet at any point. We describe a reduction from planar 3SAT to Domino Tatami Covering. As a consequence it is NP-complete to decide whether there is a perfect matching of a graph that meets every 4-cycle, even if the graph is restricted to be an induced subgraph of the grid-graph. The gadgets used in the reduction were discovered with the help of a SAT-solver.Comment: 10 pages, accepted at The International Workshop on Combinatorial Algorithms (IWOCA) 201

    MIPS: analysis and annotation of genome information in 2007

    Get PDF
    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) combines automatic processing of large amounts of sequences with manual annotation of selected model genomes. Due to the massive growth of the available data, the depth of annotation varies widely between independent databases. Also, the criteria for the transfer of information from known to orthologous sequences are diverse. To cope with the task of global in-depth genome annotation has become unfeasible. Therefore, our efforts are dedicated to three levels of annotation: (i) the curation of selected genomes, in particular from fungal and plant taxa (e.g. CYGD, MNCDB, MatDB), (ii) the comprehensive, consistent, automatic annotation employing exhaustive methods for the computation of sequence similarities and sequence-related attributes as well as the classification of individual sequences (SIMAP, PEDANT and FunCat) and (iii) the compilation of manually curated databases for protein interactions based on scrutinized information from the literature to serve as an accepted set of reliable annotated interaction data (MPACT, MPPI, CORUM). All databases and tools described as well as the detailed descriptions of our projects can be accessed through the MIPS web server (http://mips.gsf.de)

    Impact of smoking status on the relative efficacy of the EGFR TKI/angiogenesis inhibitor combination therapy in advanced NSCLC-a systematic review and meta-analysis.

    Get PDF
    BACKGROUND The ETOP 10-16 BOOSTER trial failed to demonstrate a progression-free survival (PFS) benefit for adding bevacizumab to osimertinib in second line. An exploratory subgroup analysis, however, suggested a PFS benefit of the combination in patients with a smoking history and prompted us to do this study. METHODS A systematic review and meta-analysis to evaluate the differential effect of smoking status on the benefit of adding an angiogenesis inhibitor to epidermal growth factor receptor (EGFR)-tyrosine kinase inhibitor therapy was carried out. All relevant randomized controlled trials appearing in main oncology congresses or in PubMed as of 1 November 2021 were used according to the Preferred Reporting Items for Systematic Review and Meta-Analyses statement. Primarily PFS according to smoking status, and secondarily overall survival (OS) were of interest. Pooled and interaction hazard ratios (HRs) were estimated by fixed or random effects models, depending on the detected degree of heterogeneity. Bias was assessed using the revised Cochrane tool for randomized controlled trials (RoB 2). RESULTS Information by smoking was available for 1291 patients for PFS (seven studies) and 678 patients for OS (four studies). The risk of bias was low for all studies. Combination treatment significantly prolonged PFS for smokers [n = 502, HR = 0.55, 95% confidence interval (CI): 0.44-0.69] but not for nonsmokers (n = 789, HR = 0.92, 95% CI: 0.66-1.27; treatment-by-smoking interaction P = 0.02). Similarly, a significant OS benefit was found for smokers (n = 271, HR = 0.66, 95% CI: 0.47-0.93) but not for nonsmokers (n = 407, HR = 1.07, 95% CI: 0.82-1.42; treatment-by-smoking interaction P = 0.03). CONCLUSION In advanced EGFR-non-small-cell lung cancer patients, the addition of an angiogenesis inhibitor to EGFR-tyrosine kinase inhibitor therapy provides a statistically significant PFS and OS benefit in smokers, but not in non-smokers. The biological basis for this observation should be pursued and could determine whether this might be due to a specific co-mutational pattern produced by tobacco exposure

    Evaluation of clustering algorithms for gene expression data

    Get PDF
    BACKGROUND: Cluster analysis is an integral part of high dimensional data analysis. In the context of large scale gene expression data, a filtered set of genes are grouped together according to their expression profiles using one of numerous clustering algorithms that exist in the statistics and machine learning literature. A closely related problem is that of selecting a clustering algorithm that is "optimal" in some sense from a rather impressive list of clustering algorithms that currently exist. RESULTS: In this paper, we propose two validation measures each with two parts: one measuring the statistical consistency (stability) of the clusters produced and the other representing their biological functional congruence. Smaller values of these indices indicate better performance for a clustering algorithm. We illustrate this approach using two case studies with publicly available gene expression data sets: one involving a SAGE data of breast cancer patients and the other involving a time course cDNA microarray data on yeast. Six well known clustering algorithms UPGMA, K-Means, Diana, Fanny, Model-Based and SOM were evaluated. CONCLUSION: No single clustering algorithm may be best suited for clustering genes into functional groups via expression profiles for all data sets. The validation measures introduced in this paper can aid in the selection of an optimal algorithm, for a given data set, from a collection of available clustering algorithms

    Extending CATH: increasing coverage of the protein structure universe and linking structure with function

    Get PDF
    CATH version 3.3 (class, architecture, topology, homology) contains 128 688 domains, 2386 homologous superfamilies and 1233 fold groups, and reflects a major focus on classifying structural genomics (SG) structures and transmembrane proteins, both of which are likely to add structural novelty to the database and therefore increase the coverage of protein fold space within CATH. For CATH version 3.4 we have significantly improved the presentation of sequence information and associated functional information for CATH superfamilies. The CATH superfamily pages now reflect both the functional and structural diversity within the superfamily and include structural alignments of close and distant relatives within the superfamily, annotated with functional information and details of conserved residues. A significantly more efficient search function for CATH has been established by implementing the search server Solr (http://lucene.apache.org/solr/). The CATH v3.4 webpages have been built using the Catalyst web framework

    Experimental Demonstration of Multidimensional Switching Nodes for All-Optical Data Center Networks

    Get PDF
    This paper reports on a novel ring-based data center architecture composed of multidimensional switching nodes. The nodes are interconnected with multicore fibers and can provide switching in three different physical, hierarchically overlaid dimensions (space, wavelength, and time). The proposed architecture allows for scaling in different dimensions while at the same time providing support for connections with different granularity. The ring topology reduces the number of different physical links required, leading to simplified cabling and easier link management, while optical bypass holds the prospect of low latency and low-power consumption. The performance of the multidimensional switching nodes has been investigated in an experimental demonstration comprising three network nodes connected with multicore fibers. Both high capacity wavelength connections and time-shared subwavelength connections have been established for connecting different nodes by switching in different physical dimensions. Error-free performance (BER < 10-9) has been achieved for all the connections with various granularity in all the investigated switching scenarios. The scalability of the system has been studied by increasing the transmission capacity to 1 Tbit/s/core equivalent to 7 Tbit/s total throughput in a single seven-core multicore fiber. The error-free performance (BER < 10-9) for all the connections confirms that the proposed architecture can meet the existing demands in data centers and accommodate the future traffic growth

    Enrichment of homologs in insignificant BLAST hits by co-complex network alignment

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Homology is a crucial concept in comparative genomics. The algorithm probably most widely used for homology detection in comparative genomics, is BLAST. Usually a stringent score cutoff is applied to distinguish putative homologs from possible false positive hits. As a consequence, some BLAST hits are discarded that are in fact homologous.</p> <p>Results</p> <p>Analogous to the use of the genomics context in genome alignments, we test whether conserved functional context can be used to select candidate homologs from insignificant BLAST hits. We make a co-complex network alignment between complex subunits in yeast and human and find that proteins with an insignificant BLAST hit that are part of homologous complexes, are likely to be homologous themselves. Further analysis of the distant homologs we recovered using the co-complex network alignment, shows that a large majority of these distant homologs are in fact ancient paralogs.</p> <p>Conclusions</p> <p>Our results show that, even though evolution takes place at the sequence and genome level, co-complex networks can be used as circumstantial evidence to improve confidence in the homology of distantly related sequences.</p

    Fungal Virulence and Development Is Regulated by Alternative Pre-mRNA 3′End Processing in Magnaporthe oryzae

    Get PDF
    RNA-binding proteins play a central role in post-transcriptional mechanisms that control gene expression. Identification of novel RNA-binding proteins in fungi is essential to unravel post-transcriptional networks and cellular processes that confer identity to the fungal kingdom. Here, we carried out the functional characterisation of the filamentous fungus-specific RNA-binding protein RBP35 required for full virulence and development in the rice blast fungus. RBP35 contains an N-terminal RNA recognition motif (RRM) and six Arg-Gly-Gly tripeptide repeats. Immunoblots identified two RBP35 protein isoforms that show a steady-state nuclear localisation and bind RNA in vitro. RBP35 coimmunoprecipitates in vivo with Cleavage Factor I (CFI) 25 kDa, a highly conserved protein involved in polyA site recognition and cleavage of pre-mRNAs. Several targets of RBP35 have been identified using transcriptomics including 14-3-3 pre-mRNA, an important integrator of environmental signals. In Magnaporthe oryzae, RBP35 is not essential for viability but regulates the length of 3′UTRs of transcripts with developmental and virulence-associated functions. The Δrbp35 mutant is affected in the TOR (target of rapamycin) signaling pathway showing significant changes in nitrogen metabolism and protein secretion. The lack of clear RBP35 orthologues in yeast, plants and animals indicates that RBP35 is a novel auxiliary protein of the polyadenylation machinery of filamentous fungi. Our data demonstrate that RBP35 is the fungal equivalent of metazoan CFI 68 kDa and suggest the existence of 3′end processing mechanisms exclusive to the fungal kingdom

    AlignNemo: A Local Network Alignment Method to Integrate Homology and Topology

    Get PDF
    Local network alignment is an important component of the analysis of protein-protein interaction networks that may lead to the identification of evolutionary related complexes. We present AlignNemo, a new algorithm that, given the networks of two organisms, uncovers subnetworks of proteins that relate in biological function and topology of interactions. The discovered conserved subnetworks have a general topology and need not to correspond to specific interaction patterns, so that they more closely fit the models of functional complexes proposed in the literature. The algorithm is able to handle sparse interaction data with an expansion process that at each step explores the local topology of the networks beyond the proteins directly interacting with the current solution. To assess the performance of AlignNemo, we ran a series of benchmarks using statistical measures as well as biological knowledge. Based on reference datasets of protein complexes, AlignNemo shows better performance than other methods in terms of both precision and recall. We show our solutions to be biologically sound using the concept of semantic similarity applied to Gene Ontology vocabularies. The binaries of AlignNemo and supplementary details about the algorithms and the experiments are available at: sourceforge.net/p/alignnemo
    corecore