347,238 research outputs found

    Infantile Convulsions with Paroxysmal Dyskinesia (ICCA Syndrome) and Copy Number Variation at Human Chromosome 16p11

    Get PDF
    BACKGROUND: Benign infantile convulsions and paroxysmal dyskinesia are episodic cerebral disorders that can share common genetic bases. They can be co-inherited as one single autosomal dominant trait (ICCA syndrome); the disease ICCA gene maps at chromosome 16p12-q12. Despite intensive and conventional mutation screening, the ICCA gene remains unknown to date. The critical area displays highly complicated genomic architecture and is the site of deletions and duplications associated with various diseases. The possibility that the ICCA syndrome is related to the existence of large-scale genomic alterations was addressed in the present study. METHODOLOGY/PRINCIPAL FINDINGS: A combination of whole genome and dedicated oligonucleotide array comparative genomic hybridization coupled with quantitative polymerase chain reaction was used. Low copy number of a region corresponding to a genomic variant (Variation_7105) located at 16p11 nearby the centromere was detected with statistical significance at much higher frequency in patients from ICCA families than in ethnically matched controls. The genomic variant showed no apparent difference in size and copy number between patients and controls, making it very unlikely that the genomic alteration detected here is ICCA-specific. Furthermore, no other genomic alteration that would directly cause the ICCA syndrome in those nine families was detected in the ICCA critical area. CONCLUSIONS/SIGNIFICANCE: Our data excluded that inherited genomic deletion or duplication events directly cause the ICCA syndrome; rather, they help narrowing down the critical ICCA region dramatically and indicate that the disease ICCA genetic defect lies very close to or within Variation_7105 and hence should now be searched in the corresponding genomic area and its surrounding regions

    Elite Bases Regression: A Real-time Algorithm for Symbolic Regression

    Full text link
    Symbolic regression is an important but challenging research topic in data mining. It can detect the underlying mathematical models. Genetic programming (GP) is one of the most popular methods for symbolic regression. However, its convergence speed might be too slow for large scale problems with a large number of variables. This drawback has become a bottleneck in practical applications. In this paper, a new non-evolutionary real-time algorithm for symbolic regression, Elite Bases Regression (EBR), is proposed. EBR generates a set of candidate basis functions coded with parse-matrix in specific mapping rules. Meanwhile, a certain number of elite bases are preserved and updated iteratively according to the correlation coefficients with respect to the target model. The regression model is then spanned by the elite bases. A comparative study between EBR and a recent proposed machine learning method for symbolic regression, Fast Function eXtraction (FFX), are conducted. Numerical results indicate that EBR can solve symbolic regression problems more effectively.Comment: The 2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2017

    FVQA: Fact-based Visual Question Answering

    Full text link
    Visual Question Answering (VQA) has attracted a lot of attention in both Computer Vision and Natural Language Processing communities, not least because it offers insight into the relationships between two important sources of information. Current datasets, and the models built upon them, have focused on questions which are answerable by direct analysis of the question and image alone. The set of such questions that require no external information to answer is interesting, but very limited. It excludes questions which require common sense, or basic factual knowledge to answer, for example. Here we introduce FVQA, a VQA dataset which requires, and supports, much deeper reasoning. FVQA only contains questions which require external information to answer. We thus extend a conventional visual question answering dataset, which contains image-question-answerg triplets, through additional image-question-answer-supporting fact tuples. The supporting fact is represented as a structural triplet, such as . We evaluate several baseline models on the FVQA dataset, and describe a novel model which is capable of reasoning about an image on the basis of supporting facts.Comment: 16 page

    Science and Technology Cooperation in Cross-border Regions::A Proximity Approach with Evidence for Northern Europe

    Get PDF
    Given the sheer number of cross-border regions (CBRs) within the EU, their socio-economic importance has been recognized both by policy-makers and academics. Recently, the novel concept of cross-border regional innovation system has been introduced to guide the assessment of integration processes in CBRs. A central focus of this concept is set on analyzing the impact of varying types of proximity (cognitive, technological, etc.) on cross-border cooperation. Previous empirical applications of the concept have, however, relied on individual case studies and varying methodologies, thus complicating and constraining comparisons between different CBRs. Here a broader view is provided by comparing 28 Northern European CBRs. The empirical analysis utilizes economic, science and technology (S&T) statistics to construct proximity indicators and measures S&T integration in the context of cross-border cooperation. The findings from descriptive statistics and exploratory count data regressions show that technological and cognitive proximity measures are significantly related to S&T cooperation activities (cross-border co-publications and co-patents). Taken together, our empirical approach underlines the feasibility of utilizing the proximity approach for comparative analyses in CBR settings

    Identifying statistical dependence in genomic sequences via mutual information estimates

    Get PDF
    Questions of understanding and quantifying the representation and amount of information in organisms have become a central part of biological research, as they potentially hold the key to fundamental advances. In this paper, we demonstrate the use of information-theoretic tools for the task of identifying segments of biomolecules (DNA or RNA) that are statistically correlated. We develop a precise and reliable methodology, based on the notion of mutual information, for finding and extracting statistical as well as structural dependencies. A simple threshold function is defined, and its use in quantifying the level of significance of dependencies between biological segments is explored. These tools are used in two specific applications. First, for the identification of correlations between different parts of the maize zmSRp32 gene. There, we find significant dependencies between the 5' untranslated region in zmSRp32 and its alternatively spliced exons. This observation may indicate the presence of as-yet unknown alternative splicing mechanisms or structural scaffolds. Second, using data from the FBI's Combined DNA Index System (CODIS), we demonstrate that our approach is particularly well suited for the problem of discovering short tandem repeats, an application of importance in genetic profiling.Comment: Preliminary version. Final version in EURASIP Journal on Bioinformatics and Systems Biology. See http://www.hindawi.com/journals/bsb

    A comparative study on global wavelet and polynomial models for nonlinear regime-switching systems

    Get PDF
    A comparative study of wavelet and polynomial models for non-linear Regime-Switching (RS) systems is carried out. RS systems, considered in this study, are a class of severely non-linear systems, which exhibit abrupt changes or dramatic breaks in behaviour, due to RS caused by associated events. Both wavelet and polynomial models are used to describe discontinuous dynamical systems, where it is assumed that no a priori information about the inherent model structure and the relative regime switches of the underlying dynamics is known, but only observed input-output data are available. An Orthogonal Least Squares (OLS) algorithm interfered with by an Error Reduction Ratio (ERR) index and regularised by an Approximate Minimum Description Length (AMDL) criterion, is used to construct parsimonious wavelet and polynomial models. The performance of the resultant wavelet models is compared with that of the relative polynomial models, by inspecting the predictive capability of the associated representations. It is shown from numerical results that wavelet models are superior to polynomial models, in respect of generalisation properties, for describing severely non-linear RS systems
    corecore