Search CORE

953 research outputs found

Designing labeled graph classifiers by exploiting the R\'enyi entropy of the dissimilarity representation

Author: Livi Lorenzo
Publication venue: 'MDPI AG'
Publication date: 20/04/2017
Field of study

Representing patterns as labeled graphs is becoming increasingly common in the broad field of computational intelligence. Accordingly, a wide repertoire of pattern recognition tools, such as classifiers and knowledge discovery procedures, are nowadays available and tested for various datasets of labeled graphs. However, the design of effective learning procedures operating in the space of labeled graphs is still a challenging problem, especially from the computational complexity viewpoint. In this paper, we present a major improvement of a general-purpose classifier for graphs, which is conceived on an interplay between dissimilarity representation, clustering, information-theoretic techniques, and evolutionary optimization algorithms. The improvement focuses on a specific key subroutine devised to compress the input data. We prove different theorems which are fundamental to the setting of the parameters controlling such a compression operation. We demonstrate the effectiveness of the resulting classifier by benchmarking the developed variants on well-known datasets of labeled graphs, considering as distinct performance indicators the classification accuracy, computing time, and parsimony in terms of structural complexity of the synthesized classification models. The results show state-of-the-art standards in terms of test set accuracy and a considerable speed-up for what concerns the computing time.Comment: Revised versio

arXiv.org e-Print Archive

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

Uncertainty shocks of Trump election in an interval model of stock market

Author: Qiao Kenan
Sun Yuying
Wang Shouyang
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2021
Field of study

This paper proposes a new class of nonlinear interval models for interval-valued time series. By matching the interval model with interval observations, we develop a nonlinear minimum-distance estimation method for the proposed models, and establish the asymptotic theory for the proposed estimators. Superior to traditional point-based methods, the proposed interval modelling approach can assess the change in both the trend and volatility simultaneously. Within the proposed interval framework, this paper examines the impact of the 2016 US presidential election (henceforth Trump election) on the US stock market as a case study. Considering the validity of daily high-low range as a proxy of market efficiency, we employ an interval-valued return to jointly measure the fundamental value movement and market efficiency simultaneously. Empirical results suggest a strong evidence that the Trump election has increased the level/trend and lowered the volatility of the S&P 500 index in both ex ante and ex post analysis. Furthermore, a longer half-life period for the impact on fundamental value (62.4 days) than high-low range (15.9 days) has shown that the impact of Trump's victory on fundamental value is more persistent than its impact on market efficiency

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A precise bare simulation approach to the minimization of some distances. Foundations

Author: Broniatowski Michel
Stummer Wolfgang
Publication venue
Publication date: 04/07/2021
Field of study

In information theory -- as well as in the adjacent fields of statistics, machine learning, artificial intelligence, signal processing and pattern recognition -- many flexibilizations of the omnipresent Kullback-Leibler information distance (relative entropy) and of the closely related Shannon entropy have become frequently used tools. To tackle corresponding constrained minimization (respectively maximization) problems by a newly developed dimension-free bare (pure) simulation method, is the main goal of this paper. Almost no assumptions (like convexity) on the set of constraints are needed, within our discrete setup of arbitrary dimension, and our method is precise (i.e., converges in the limit). As a side effect, we also derive an innovative way of constructing new useful distances/divergences. To illustrate the core of our approach, we present numerous examples. The potential for widespread applicability is indicated, too; in particular, we deliver many recent references for uses of the involved distances/divergences and entropies in various different research fields (which may also serve as an interdisciplinary interface)

arXiv.org e-Print Archive

Models of Integration Given Multiple Sources of Information

Author: D. Friedman
D. Massaro
Publication venue
Publication date
Field of study

Research Papers in Economics

ISIPTA'07: Proceedings of the Fifth International Symposium on Imprecise Probability: Theories and Applications

Author: De Cooman Gert
Vejnarová Jirina
Zaffalon Marco
Publication venue: SIPTA - International Society for Imprecise Probability: Theories and Applications
Publication date: 01/01/2007
Field of study

Ghent University Academic Bibliography

Archivsystem Ask23

Dependence methods for financial time series with application to portfolio diversification

Author: WANG HAO
Publication venue
Publication date: 14/03/2015
Field of study

Pubblicazioni Aperte Digitali Interateneo Sapienza

Archivio della ricerca- Università di Roma La Sapienza

Dependence methods for financial time series with application to portfolio diversification

Author: WANG HAO
Publication venue
Publication date: 14/03/2015
Field of study

Archivio della ricerca- Università di Roma La Sapienza

Recommended from our members

Statistical aspects of credit scoring

Author: Henley William Edward
Publication venue
Publication date: 01/01/1995
Field of study

This thesis is concerned with statistical aspects of credit scoring, the process of determining how likely an applicant for credit is to default with repayments. In Chapters 1-4 a detailed introduction to credit scoring methodology is presented, including evaluation of previous published work on credit scoring and a review of discrimination and classification techniques. In Chapter 5 we describe different approaches to measuring the absolute and relative performance of credit scoring models. Two significance tests are proposed for comparing the bad rate amongst the accepts (or the error rate) from two classifiers. In Chapter 6 we consider different approaches to reject inference, the procedure of allocating class membership probabilities to the rejects. One reason for needing reject inference is to reduce the sample selection bias that results from using a sample consisting only of accepted applicants to build new scorecards. We show that the characteristic vectors for the rejects do not contain information about the parameters of the observed data likelihood, unless extra information or assumptions are included. Methods of reject inference which incorporate additional information are proposed. In Chapter 7 we make comparisons of a range of different parametric and nonparametric classification techniques for credit scoring: linear regression, logistic regression, projection pursuit regression, Poisson regression, decision trees and decision graphs. We conclude that classifier performance is fairly insensitive to the particular technique adopted. In Chapter 8 we describe the application of the k-NN method to credit scoring. We propose using an adjusted version of the Eucidean distance metric, which is designed to incorporate knowledge of class separation contained in the data. We evaluate properties of the k-NN classifier through empirical studies and make comparisons with existing techniques

Open Research Online (The Open University)