7,957 research outputs found
Country corruption analysis with self organizing maps and support vector machines.
During recent years, the empirical research on corruption has grown considerably. Possible links between government corruption and terrorism have attracted an increasing interest in this research field. Most of the existing literature discusses the topic from a socio-economical perspective and only few studies tackle this research field from a data mining point of view. In this paper, we apply data mining techniques onto a cross-country database linking macro-economical variables to perceived levels of corruption. In the first part, self organizing maps are applied to study the interconnections between these variables. Afterwards, support vector machines are trained on part of the data and used to forecast corruption for other countries. Large deviations for specific countries between these models' predictions and the actual values can prove useful for further research. Finally, projection of the forecasts onto a self organizing map allows a detailed comparison between the different models' behavior.cross-country;
A new perspective on the competitiveness of nations
The capability of firms to survive and to have a competitive advantage in global markets depends on, amongst other things, the efficiency of public institutions, the excellence of educational, health and communications infrastructures, as well as on the political and economic stability of their home country. The measurement of competitiveness and strategy development is thus an important issue for policy-makers. Despite many attempts to provide objectivity in the development of measures of national competitiveness, there are inherently subjective judgments that involve, for example, how data sets are aggregated and importance weights are applied. Generally, either equal weighting is assumed in calculating a final index, or subjective weights are specified. The same problem also occurs in the subjective assignment of countries to different clusters. Developed as such, the value of these type indices may be questioned by users. The aim of this paper is to explore methodological transparency as a viable solution to problems created by existing aggregated indices. For this purpose, a methodology composed of three steps is proposed. To start, a hierarchical clustering analysis is used to assign countries to appropriate clusters. In current methods, country clustering is generally based on GDP. However, we suggest that GDP alone is insufficient for purposes of country clustering. In the proposed methodology, 178 criteria are used for this purpose. Next, relationships between the criteria and classification of the countries are determined using artificial neural networks (ANNs). ANN provides an objective method for determining the attribute/criteria weights, which are, for the most part, subjectively specified in existing methods. Finally, in our third step, the countries of interest are ranked based on weights generated in the previous step. Beyond the ranking of countries, the proposed methodology can also be used to identify those attributes that a given country should focus on in order to improve its position relative to other countries, i.e., to transition from its current cluster to the next higher one
SOM-based algorithms for qualitative variables
It is well known that the SOM algorithm achieves a clustering of data which
can be interpreted as an extension of Principal Component Analysis, because of
its topology-preserving property. But the SOM algorithm can only process
real-valued data. In previous papers, we have proposed several methods based on
the SOM algorithm to analyze categorical data, which is the case in survey
data. In this paper, we present these methods in a unified manner. The first
one (Kohonen Multiple Correspondence Analysis, KMCA) deals only with the
modalities, while the two others (Kohonen Multiple Correspondence Analysis with
individuals, KMCA\_ind, Kohonen algorithm on DISJonctive table, KDISJ) can take
into account the individuals, and the modalities simultaneously.Comment: Special Issue apr\`{e}s WSOM 03 \`{a} Kitakiush
COMPLEX-IT: A Case-Based Modelling and Scenario Simulation Platform for Social Inquiry
COMPLEX-IT is a case-based, mixed-methods platform for applied social inquiry into complex data/systems, designed to increase non-expert access to the tools of computational social science (i.e., cluster analysis, artificial intelligence, data visualization, data forecasting, and scenario simulation). In particular, COMPLEX-IT aids applied social inquiry though a heavy emphasis on learning about the complex data/system under study, which it does by (a) identifying and forecasting major and minor clusters/trends; (b) visualizing their complex causality; and (c) simulating scenarios for potential interventions. COMPLEX-IT is accessible through the web or can be run locally and is powered by R and the Shiny web framework
Contemporary disaster management framework quantification of flood risk in rural Lower Shire Valley, Malawi
Despite floods and droughts accounting for 80% and 70% disaster related deaths and
economic loss respectively in Sub-Saharan Africa (SSA), there have been very few
attempts in SSA to quantify flood-related vulnerability and risk, especially as they relate
to the rural poor. This thesis quantifies and profiles the flood risk of rural communities
in SSA focusing on the Lower Shire Valley, Malawi. Given the challenge of hydrometeorological
data quality in SSA to support quantitative flood risk assessments, the
work first reconstructs and extends hydro-meteorological data using Artificial Neural
Networks (ANNs). These data then formed the input to a coupled IPCC-Sustainable
Development Frameworks for quantifying flood vulnerability and risk. Flood risk was
obtained by integrating hazard and vulnerability. Flood hazard was characterised in
terms of flood depth and inundation area obtained through hydraulic modelling of the
catchment with Lisflood-FP, while the vulnerability was indexed through analysis of
exposure, susceptibility and capacity and linked to social, economic, environmental and
physical perspectives. Data on these were collected through structured interviews
carried out with the communities and stakeholders in the valley and later analysed. The
implementation of the entire analysis within a GIS environment enabled the
visualisation of spatial variability in flood risk in the valley. The results show
predominantly medium levels in hazardousness, vulnerability and risk. The
vulnerability is dominated by a high to very high susceptibility component largely
because of the high to very high socio-economic and environmental vulnerability.
Economic and physical capacities tend to be predominantly low but social capacity is
significantly high, resulting in overall medium levels of capacity-induced vulnerability.
Exposure manifests as medium. Both the vulnerability and risk showed marginal spatial
variability. Given all this, the thesis argues for the need to mainstream disaster reduction
in the rather plethoric conventional socio-economic developmental programmes in SSA.
Additionally, the low spatial variability in both the risk and vulnerability in the valley
suggests that any such interventions need to be valley-wide to be effective
- …