35,185 research outputs found
Mesoscopic Community Structure of Financial Markets Revealed by Price and Sign Fluctuations
The mesoscopic organization of complex systems, from financial markets to the
brain, is an intermediate between the microscopic dynamics of individual units
(stocks or neurons, in the mentioned cases), and the macroscopic dynamics of
the system as a whole. The organization is determined by "communities" of units
whose dynamics, represented by time series of activity, is more strongly
correlated internally than with the rest of the system. Recent studies have
shown that the binary projections of various financial and neural time series
exhibit nontrivial dynamical features that resemble those of the original data.
This implies that a significant piece of information is encoded into the binary
projection (i.e. the sign) of such increments. Here, we explore whether the
binary signatures of multiple time series can replicate the same complex
community organization of the financial market, as the original weighted time
series. We adopt a method that has been specifically designed to detect
communities from cross-correlation matrices of time series data. Our analysis
shows that the simpler binary representation leads to a community structure
that is almost identical with that obtained using the full weighted
representation. These results confirm that binary projections of financial time
series contain significant structural information.Comment: 15 pages, 7 figure
Breast Cancer: Modelling and Detection
This paper reviews a number of the mathematical models used in cancer modelling and then chooses a specific cancer, breast carcinoma, to illustrate how the modelling can be used in aiding detection. We then discuss mathematical models that underpin mammographic image analysis, which complements models of tumour growth and facilitates diagnosis and treatment of cancer. Mammographic images are notoriously difficult to interpret, and we give an overview of the primary image enhancement technologies that have been introduced, before focusing on a more detailed description of some of our own recent work on the use of physics-based modelling in mammography. This theoretical approach to image analysis yields a wealth of information that could be incorporated into the mathematical models, and we conclude by describing how current mathematical models might be enhanced by use of this information, and how these models in turn will help to meet some of the major challenges in cancer detection
On the Feasibility of Transfer-learning Code Smells using Deep Learning
Context: A substantial amount of work has been done to detect smells in
source code using metrics-based and heuristics-based methods. Machine learning
methods have been recently applied to detect source code smells; however, the
current practices are considered far from mature. Objective: First, explore the
feasibility of applying deep learning models to detect smells without extensive
feature engineering, just by feeding the source code in tokenized form. Second,
investigate the possibility of applying transfer-learning in the context of
deep learning models for smell detection. Method: We use existing metric-based
state-of-the-art methods for detecting three implementation smells and one
design smell in C# code. Using these results as the annotated gold standard, we
train smell detection models on three different deep learning architectures.
These architectures use Convolution Neural Networks (CNNs) of one or two
dimensions, or Recurrent Neural Networks (RNNs) as their principal hidden
layers. For the first objective of our study, we perform training and
evaluation on C# samples, whereas for the second objective, we train the models
from C# code and evaluate the models over Java code samples. We perform the
experiments with various combinations of hyper-parameters for each model.
Results: We find it feasible to detect smells using deep learning methods. Our
comparative experiments find that there is no clearly superior method between
CNN-1D and CNN-2D. We also observe that performance of the deep learning models
is smell-specific. Our transfer-learning experiments show that
transfer-learning is definitely feasible for implementation smells with
performance comparable to that of direct-learning. This work opens up a new
paradigm to detect code smells by transfer-learning especially for the
programming languages where the comprehensive code smell detection tools are
not available
Measuring relative opinion from location-based social media: A case study of the 2016 U.S. presidential election
Social media has become an emerging alternative to opinion polls for public
opinion collection, while it is still posing many challenges as a passive data
source, such as structurelessness, quantifiability, and representativeness.
Social media data with geotags provide new opportunities to unveil the
geographic locations of users expressing their opinions. This paper aims to
answer two questions: 1) whether quantifiable measurement of public opinion can
be obtained from social media and 2) whether it can produce better or
complementary measures compared to opinion polls. This research proposes a
novel approach to measure the relative opinion of Twitter users towards public
issues in order to accommodate more complex opinion structures and take
advantage of the geography pertaining to the public issues. To ensure that this
new measure is technically feasible, a modeling framework is developed
including building a training dataset by adopting a state-of-the-art approach
and devising a new deep learning method called Opinion-Oriented Word Embedding.
With a case study of the tweets selected for the 2016 U.S. presidential
election, we demonstrate the predictive superiority of our relative opinion
approach and we show how it can aid visual analytics and support opinion
predictions. Although the relative opinion measure is proved to be more robust
compared to polling, our study also suggests that the former can advantageously
complement the later in opinion prediction
Community detection for correlation matrices
A challenging problem in the study of complex systems is that of resolving,
without prior information, the emergent, mesoscopic organization determined by
groups of units whose dynamical activity is more strongly correlated internally
than with the rest of the system. The existing techniques to filter
correlations are not explicitly oriented towards identifying such modules and
can suffer from an unavoidable information loss. A promising alternative is
that of employing community detection techniques developed in network theory.
Unfortunately, this approach has focused predominantly on replacing network
data with correlation matrices, a procedure that tends to be intrinsically
biased due to its inconsistency with the null hypotheses underlying the
existing algorithms. Here we introduce, via a consistent redefinition of null
models based on random matrix theory, the appropriate correlation-based
counterparts of the most popular community detection techniques. Our methods
can filter out both unit-specific noise and system-wide dependencies, and the
resulting communities are internally correlated and mutually anti-correlated.
We also implement multiresolution and multifrequency approaches revealing
hierarchically nested sub-communities with `hard' cores and `soft' peripheries.
We apply our techniques to several financial time series and identify
mesoscopic groups of stocks which are irreducible to a standard, sectorial
taxonomy, detect `soft stocks' that alternate between communities, and discuss
implications for portfolio optimization and risk management.Comment: Final version, accepted for publication on PR
Resolving Structure in Human Brain Organization: Identifying Mesoscale Organization in Weighted Network Representations
Human brain anatomy and function display a combination of modular and
hierarchical organization, suggesting the importance of both cohesive
structures and variable resolutions in the facilitation of healthy cognitive
processes. However, tools to simultaneously probe these features of brain
architecture require further development. We propose and apply a set of methods
to extract cohesive structures in network representations of brain connectivity
using multi-resolution techniques. We employ a combination of soft
thresholding, windowed thresholding, and resolution in community detection,
that enable us to identify and isolate structures associated with different
weights. One such mesoscale structure is bipartivity, which quantifies the
extent to which the brain is divided into two partitions with high connectivity
between partitions and low connectivity within partitions. A second,
complementary mesoscale structure is modularity, which quantifies the extent to
which the brain is divided into multiple communities with strong connectivity
within each community and weak connectivity between communities. Our methods
lead to multi-resolution curves of these network diagnostics over a range of
spatial, geometric, and structural scales. For statistical comparison, we
contrast our results with those obtained for several benchmark null models. Our
work demonstrates that multi-resolution diagnostic curves capture complex
organizational profiles in weighted graphs. We apply these methods to the
identification of resolution-specific characteristics of healthy weighted graph
architecture and altered connectivity profiles in psychiatric disease.Comment: Comments welcom
Complex networks in climate dynamics - Comparing linear and nonlinear network construction methods
Complex network theory provides a powerful framework to statistically
investigate the topology of local and non-local statistical interrelationships,
i.e. teleconnections, in the climate system. Climate networks constructed from
the same global climatological data set using the linear Pearson correlation
coefficient or the nonlinear mutual information as a measure of dynamical
similarity between regions, are compared systematically on local, mesoscopic
and global topological scales. A high degree of similarity is observed on the
local and mesoscopic topological scales for surface air temperature fields
taken from AOGCM and reanalysis data sets. We find larger differences on the
global scale, particularly in the betweenness centrality field. The global
scale view on climate networks obtained using mutual information offers
promising new perspectives for detecting network structures based on nonlinear
physical processes in the climate system.Comment: 24 pages, 10 figure
- …