191 research outputs found
Correlation, hierarchies, and networks in financial markets
We discuss some methods to quantitatively investigate the properties of
correlation matrices. Correlation matrices play an important role in portfolio
optimization and in several other quantitative descriptions of asset price
dynamics in financial markets. Specifically, we discuss how to define and
obtain hierarchical trees, correlation based trees and networks from a
correlation matrix. The hierarchical clustering and other procedures performed
on the correlation matrix to detect statistically reliable aspects of the
correlation matrix are seen as filtering procedures of the correlation matrix.
We also discuss a method to associate a hierarchically nested factor model to a
hierarchical tree obtained from a correlation matrix. The information retained
in filtering procedures and its stability with respect to statistical
fluctuations is quantified by using the Kullback-Leibler distance.Comment: 37 pages, 9 figures, 3 table
When do improved covariance matrix estimators enhance portfolio optimization? An empirical comparative study of nine estimators
The use of improved covariance matrix estimators as an alternative to the
sample estimator is considered an important approach for enhancing portfolio
optimization. Here we empirically compare the performance of 9 improved
covariance estimation procedures by using daily returns of 90 highly
capitalized US stocks for the period 1997-2007. We find that the usefulness of
covariance matrix estimators strongly depends on the ratio between estimation
period T and number of stocks N, on the presence or absence of short selling,
and on the performance metric considered. When short selling is allowed,
several estimation methods achieve a realized risk that is significantly
smaller than the one obtained with the sample covariance method. This is
particularly true when T/N is close to one. Moreover many estimators reduce the
fraction of negative portfolio weights, while little improvement is achieved in
the degree of diversification. On the contrary when short selling is not
allowed and T>N, the considered methods are unable to outperform the sample
covariance in terms of realized risk but can give much more diversified
portfolios than the one obtained with the sample covariance. When T<N the use
of the sample covariance matrix and of the pseudoinverse gives portfolios with
very poor performance.Comment: 30 page
Correlation filtering in financial time series
We apply a method to filter relevant information from the correlation
coefficient matrix by extracting a network of relevant interactions. This
method succeeds to generate networks with the same hierarchical structure of
the Minimum Spanning Tree but containing a larger amount of links resulting in
a richer network topology allowing loops and cliques. In Tumminello et al.
\cite{TumminielloPNAS05}, we have shown that this method, applied to a
financial portfolio of 100 stocks in the USA equity markets, is pretty
efficient in filtering relevant information about the clustering of the system
and its hierarchical structure both on the whole system and within each
cluster. In particular, we have found that triangular loops and 4 element
cliques have important and significant relations with the market structure and
properties. Here we apply this filtering procedure to the analysis of
correlation in two different kind of interest rate time series (16 Eurodollars
and 34 US interest rates).Comment: 10 pages 7 figure
Economic sector identification in a set of stocks traded at the New York Stock Exchange: a comparative analysis
We review some methods recently used in the literature to detect the
existence of a certain degree of common behavior of stock returns belonging to
the same economic sector. Specifically, we discuss methods based on random
matrix theory and hierarchical clustering techniques. We apply these methods to
a set of stocks traded at the New York Stock Exchange. The investigated time
series are recorded at a daily time horizon.
All the considered methods are able to detect economic information and the
presence of clusters characterized by the economic sector of stocks. However,
different methodologies provide different information about the considered set.
Our comparative analysis suggests that the application of just a single method
could not be able to extract all the economic information present in the
correlation coefficient matrix of a set of stocks.Comment: 13 pages, 8 figures, 2 Table
Sector identification in a set of stock return time series traded at the London Stock Exchange
We compare some methods recently used in the literature to detect the
existence of a certain degree of common behavior of stock returns belonging to
the same economic sector. Specifically, we discuss methods based on random
matrix theory and hierarchical clustering techniques. We apply these methods to
a portfolio of stocks traded at the London Stock Exchange. The investigated
time series are recorded both at a daily time horizon and at a 5-minute time
horizon. The correlation coefficient matrix is very different at different time
horizons confirming that more structured correlation coefficient matrices are
observed for long time horizons. All the considered methods are able to detect
economic information and the presence of clusters characterized by the economic
sector of stocks. However different methods present a different degree of
sensitivity with respect to different sectors. Our comparative analysis
suggests that the application of just a single method could not be able to
extract all the economic information present in the correlation coefficient
matrix of a stock portfolio.Comment: 28 pages, 13 figures, 3 Tables. Proceedings of the conference on
"Applications of Random Matrices to Economy and other Complex Systems",
Krakow (Poland), May 25-28 2005. Submitted for pubblication to Acta Phys. Po
Generation of hierarchically correlated multivariate symbolic sequences
We introduce an algorithm to generate multivariate series of symbols from a
finite alphabet with a given hierarchical structure of similarities. The target
hierarchical structure of similarities is arbitrary, for instance the one
obtained by some hierarchical clustering procedure as applied to an empirical
matrix of Hamming distances. The algorithm can be interpreted as the finite
alphabet equivalent of the recently introduced hierarchically nested factor
model (M. Tumminello et al. EPL 78 (3) 30006 (2007)). The algorithm is based on
a generating mechanism that is different from the one used in the mutation rate
approach. We apply the proposed methodology for investigating the relationship
between the bootstrap value associated with a node of a phylogeny and the
probability of finding that node in the true phylogeny.Comment: 7 pages, 6 figures, 1 tabl
Community characterization of heterogeneous complex systems
We introduce an analytical statistical method to characterize the communities
detected in heterogeneous complex systems. By posing a suitable null
hypothesis, our method makes use of the hypergeometric distribution to assess
the probability that a given property is over-expressed in the elements of a
community with respect to all the elements of the investigated set. We apply
our method to two specific complex networks, namely a network of world movies
and a network of physics preprints. The characterization of the elements and of
the communities is done in terms of languages and countries for the movie
network and of journals and subject categories for papers. We find that our
method is able to characterize clearly the identified communities. Moreover our
method works well both for large and for small communities.Comment: 8 pages, 1 figure and 2 table
Statistically validated networks in bipartite complex systems
Many complex systems present an intrinsic bipartite nature and are often
described and modeled in terms of networks [1-5]. Examples include movies and
actors [1, 2, 4], authors and scientific papers [6-9], email accounts and
emails [10], plants and animals that pollinate them [11, 12]. Bipartite
networks are often very heterogeneous in the number of relationships that the
elements of one set establish with the elements of the other set. When one
constructs a projected network with nodes from only one set, the system
heterogeneity makes it very difficult to identify preferential links between
the elements. Here we introduce an unsupervised method to statistically
validate each link of the projected network against a null hypothesis taking
into account the heterogeneity of the system. We apply our method to three
different systems, namely the set of clusters of orthologous genes (COG) in
completely sequenced genomes [13, 14], a set of daily returns of 500 US
financial stocks, and the set of world movies of the IMDb database [15]. In all
these systems, both different in size and level of heterogeneity, we find that
our method is able to detect network structures which are informative about the
system and are not simply expression of its heterogeneity. Specifically, our
method (i) identifies the preferential relationships between the elements, (ii)
naturally highlights the clustered structure of investigated systems, and (iii)
allows to classify links according to the type of statistically validated
relationships between the connected nodes.Comment: Main text: 13 pages, 3 figures, and 1 Table. Supplementary
information: 15 pages, 3 figures, and 2 Table
Networks in biological systems: An investigation of the Gene Ontology as an evolving network
Many biological systems can be described as networks where different elements interact, in order to perform biological processes. We introduce a network associated with the Gene Ontology. Specifically, we construct a correlation-based
network where the vertices are the terms of the Gene Ontology and the link between each two terms is weighted on the basis of the number of genes that they have in common. We analyze a filtered network obtained from the correlation-based network and we characterize its evolution over different releases of the Gene Ontology
A tool for filtering information in complex systems
We introduce a technique to filter out complex data-sets by extracting a
subgraph of representative links. Such a filtering can be tuned up to any
desired level by controlling the genus of the resulting graph. We show that
this technique is especially suitable for correlation based graphs giving
filtered graphs which preserve the hierarchical organization of the minimum
spanning tree but containing a larger amount of information in their internal
structure. In particular in the case of planar filtered graphs (genus equal to
0) triangular loops and 4 element cliques are formed. The application of this
filtering procedure to 100 stocks in the USA equity markets shows that such
loops and cliques have important and significant relations with the market
structure and properties.Comment: 8 pages, 3 figures, 4 table
- …