2,197 research outputs found
Coalitions’ Weights in a Dispersed System with Pawlak Conflict Model
The article addresses the issues related to making decisions by an ensemble of classifiers.
Classifiers are built based on local tables, the set of local tables is called a
dispersed knowledge. The paper discusses a novel application of Pawlak analysis
model to examine the relations between classifiers and to create coalitions of classifiers.
Each coalition has access to some aggregated knowledge on the basis of which
joint decisions are made. Various types of coalitions are formed—a strong coalitions
consisting of a large number and significant classifiers, and a weak coalitions consisting
of insignificant classifiers. The new contributions of the paper is a systematical
investigation of the weights of coalitions that influence the final decision. Four
different method of calculating the strength of the coalitions have been applied. Each
of these methods consider another aspect of the structure of the coalitions. Generally,
it has been experimentally confirmed that, for a method that correctly identifies
the relations between base classifiers, the use of coalitions weights improves the
quality of classification. More specifically, it has been statistically confirmed that the
best results are generated by the weighting method that is based on the size of the
coalitions and the method based on the unambiguous of the decisions
Combination of linear classifiers using score function -- analysis of possible combination strategies
In this work, we addressed the issue of combining linear classifiers using
their score functions. The value of the scoring function depends on the
distance from the decision boundary. Two score functions have been tested and
four different combination strategies were investigated. During the
experimental study, the proposed approach was applied to the heterogeneous
ensemble and it was compared to two reference methods -- majority voting and
model averaging respectively. The comparison was made in terms of seven
different quality criteria. The result shows that combination strategies based
on simple average, and trimmed average are the best combination strategies of
the geometrical combination
Using the ISO/IEC 9126 product quality model to classify defects : a Controlled Experiment
Background: Existing software defect classification schemes support multiple tasks, such as root cause analysis and process improvement guidance. However, existing schemes do not assist in assigning defects to a broad range of high level software goals, such as software quality characteristics like functionality, maintainability, and usability. Aim: We investigate whether a classification based on the ISO/IEC 9126 software product quality model is reliable and useful to link defects to quality aspects impacted. Method: Six different subjects, divided in two groups with respect to their expertise, classified 78 defects from an industrial web application using the ISO/IEC 9126 quality main characteristics and sub-characteristics, and a set of proposed extended guidelines. Results: The ISO/IEC 9126 model is reasonably reliable when used to classify defects, even using incomplete defect reports. Reliability and variability is better for the six high level main characteristics of the model than for the 22 sub- characteristics. Conclusions: The ISO/IEC 9126 software quality model provides a solid foundation for defect classification. We also recommend, based on the follow up qualitative analysis performed, to use more complete defect reports and tailor the quality model to the context of us
Argumentation dialogues in web-based GDSS: an approach using machine learning techniques
Tese de doutoramento em InformaticsA tomada de decisão está presente no dia a dia de qualquer pessoa, mesmo que muitas vezes ela
não tenha consciência disso. As decisões podem estar relacionadas com problemas quotidianos, ou
podem estar relacionadas com questões mais complexas, como é o caso das questões organizacionais.
Normalmente, no contexto organizacional, as decisões são tomadas em grupo.
Os Sistemas de Apoio à Decisão em Grupo têm sido estudados ao longo das últimas décadas com o
objetivo de melhorar o apoio prestado aos decisores nas mais diversas situações e/ou problemas a resolver.
Existem duas abordagens principais à implementação de Sistemas de Apoio à Decisão em Grupo:
a abordagem clássica, baseada na agregação matemática das preferências dos diferentes elementos do
grupo e as abordagens baseadas na negociação automática (e.g. Teoria dos Jogos, Argumentação, entre
outras).
Os atuais Sistemas de Apoio à Decisão em Grupo baseados em argumentação podem gerar uma
enorme quantidade de dados. O objetivo deste trabalho de investigação é estudar e desenvolver modelos
utilizando técnicas de aprendizagem automática para extrair conhecimento dos diálogos argumentativos
realizados pelos decisores, mais concretamente, pretende-se criar modelos para analisar, classificar e
processar esses dados, potencializando a geração de novo conhecimento que será utilizado tanto por
agentes inteligentes, como por decisiores reais. Promovendo desta forma a obtenção de consenso entre
os membros do grupo. Com base no estudo da literatura e nos desafios em aberto neste domĂnio,
formulou-se a seguinte hipĂłtese de investigação - É possĂvel usar tĂ©cnicas de aprendizagem automática
para apoiar diálogos argumentativos em Sistemas de Apoio à Decisão em Grupo baseados na web.
No âmbito dos trabalhos desenvolvidos, foram aplicados algoritmos de classificação supervisionados
a um conjunto de dados contendo argumentos extraĂdos de debates online, criando um classificador
de frases argumentativas que pode classificar automaticamente (A favor/Contra) frases argumentativas
trocadas no contexto da tomada de decisão. Foi desenvolvido um modelo de clustering dinâmico para
organizar as conversas com base nos argumentos utilizados. Além disso, foi proposto um Sistema de
Apoio Ă DecisĂŁo em Grupo baseado na web que possibilita apoiar grupos de decisores independentemente
de sua localização geográfica. O sistema permite a criação de problemas multicritério e a configuração
das preferências, intenções e interesses de cada decisor. Este sistema de apoio à decisão baseado na
web inclui os dashboards de relatórios inteligentes que são gerados através dos resultados dos trabalhos
alcançados pelos modelos anteriores já referidos. A concretização de cada um dos objetivos permitiu
validar as questões de investigação identificadas e assim responder positivamente à hipótese definida.Decision-making is present in anyone’s daily life, even if they are often unaware of it. Decisions can be
related to everyday problems, or they can be related to more complex issues, such as organizational
issues. Normally, in the organizational context, decisions are made in groups.
Group Decision Support Systems have been studied over the past decades with the aim of improving
the support provided to decision-makers in the most diverse situations and/or problems to be solved.
There are two main approaches to implementing Group Decision Support Systems: the classical approach,
based on the mathematical aggregation of the preferences of the different elements of the group, and the
approaches based on automatic negotiation (e.g. Game Theory, Argumentation, among others).
Current argumentation-based Group Decision Support Systems can generate an enormous amount
of data. The objective of this research work is to study and develop models using automatic learning techniques
to extract knowledge from argumentative dialogues carried out by decision-makers, more specifically,
it is intended to create models to analyze, classify and process these data, enhancing the generation
of new knowledge that will be used both by intelligent agents and by real decision-makers. Promoting in
this way the achievement of consensus among the members of the group. Based on the literature study
and the open challenges in this domain, the following research hypothesis was formulated - It is possible
to use machine learning techniques to support argumentative dialogues in web-based Group Decision
Support Systems.
As part of the work developed, supervised classification algorithms were applied to a data set containing
arguments extracted from online debates, creating an argumentative sentence classifier that can
automatically classify (For/Against) argumentative sentences exchanged in the context of decision-making.
A dynamic clustering model was developed to organize conversations based on the arguments used. In
addition, a web-based Group Decision Support System was proposed that makes it possible to support
groups of decision-makers regardless of their geographic location. The system allows the creation of multicriteria
problems and the configuration of preferences, intentions, and interests of each decision-maker.
This web-based decision support system includes dashboards of intelligent reports that are generated
through the results of the work achieved by the previous models already mentioned. The achievement of
each objective allowed validation of the identified research questions and thus responded positively to the
defined hypothesis.I also thank to Fundação para a Ciência e a Tecnologia, for the Ph.D. grant funding with the reference: SFRH/BD/137150/2018
Are coalitions needed when classifiers make decisions?
Cooperation and coalitions’ formation are usually the preferred behavior when conflict situation occurs in real life. The question arises: is this approach should also be used when an ensemble of classifiers makes decisions? In this paper different approaches to classification based on dispersed knowledge are analysed and compared. The first group of approaches does not generate coalitions. Each local classifier generate a classification vector based on the local table, and then one of the most popular fusion methods is used (the sum method or the maximum method). In addition, the approach in which the final classification is made by the strongest classifier is analysed. The second group of approaches uses a coalitions creating method. The final classification is generated based on the coalitions’ predictions by using the two, mentioned above, fusion methods. In addition, the approach is analysed in which the final classification is made by the strongest coalition. For both groups of approaches, with and without coalitions, methods based on the maximum correlation and methods based on the covering rules are considered. The main conclusion that is made in this article is as follows. When classifiers generate fair and rational classification vectors, it is better to consider a coalition-based approach and the fusion method that collectively takes into account all vectors generated by classifiers
Combination of linear classifiers using score function -- analysis of possible combination strategies
In this work, we addressed the issue of combining linear classifiers using
their score functions. The value of the scoring function depends on the
distance from the decision boundary. Two score functions have been tested and
four different combination strategies were investigated. During the
experimental study, the proposed approach was applied to the heterogeneous
ensemble and it was compared to two reference methods -- majority voting and
model averaging respectively. The comparison was made in terms of seven
different quality criteria. The result shows that combination strategies based
on simple average, and trimmed average are the best combination strategies of
the geometrical combination
Social Media for Cities, Counties and Communities
Social media (i.e., Twitter, Facebook, Flickr, YouTube) and other tools and services with user- generated content have made a staggering amount of information (and misinformation) available. Some government officials seek to leverage these resources to improve services and communication with citizens, especially during crises and emergencies. Yet, the sheer volume of social data streams generates substantial noise that must be filtered. Potential exists to rapidly identify issues of concern for emergency management by detecting meaningful patterns or trends in the stream of messages and information flow. Similarly, monitoring these patterns and themes over time could provide officials with insights into the perceptions and mood of the community that cannot be collected through traditional methods (e.g., phone or mail surveys) due to their substantive costs, especially in light of reduced and shrinking budgets of governments at all levels. We conducted a pilot study in 2010 with government officials in Arlington, Virginia (and to a lesser extent representatives of groups from Alexandria and Fairfax, Virginia) with a view to contributing to a general understanding of the use of social media by government officials as well as community organizations, businesses and the public. We were especially interested in gaining greater insight into social media use in crisis situations (whether severe or fairly routine crises, such as traffic or weather disruptions)
Cascade ligand- and structure-based virtual screening to identify new trypanocidal compounds inhibiting putrescine uptake
Chagas disease is a neglected tropical disease endemic to Latin America, though migratory movements have recently spread it to other regions. Here, we have applied a cascade virtual screening campaign combining ligand- and structure-based methods. In order to find novel inhibitors of putrescine uptake in Trypanosoma cruzi, an ensemble of linear ligand-based classifiers obtained by has been applied as initial screening filter, followed by docking into a homology model of the putrescine permease TcPAT12. 1,000 individual linear classifiers were inferred from a balanced dataset. Subsequently, different schemes were tested to combine the individual classifiers: MIN operator, average ranking, average score, average voting, with MIN operator leading to the best performance. The homology model was based on the arginine/agmatine antiporter (AdiC) from Escherichia coli as template. It showed 64% coverage of the entire query sequence and it was selected based on the normalized Discrete Optimized Protein Energy parameter and the GA341 score. The modeled structure had 96% in the allowed area of Ramachandran's plot, and none of the residues located in non-allowed regions were involved in the active site of the transporter. Positivity Predictive Value surfaces were applied to optimize the score thresholds to be used in the ligand-based virtual screening step: for that purpose Positivity Predictive Value was charted as a function of putative yields of active in the range 0.001-0.010 and the Se/Sp ratio. With a focus on drug repositioning opportunities, DrugBank and Sweetlead databases were subjected to screening. Among 8 hits, cinnarizine, a drug frequently prescribed for motion sickness and balance disorder, was tested against T. cruzi epimastigotes and amastigotes, confirming its trypanocidal effects and its inhibitory effects on putrescine uptake. Furthermore, clofazimine, an antibiotic with already proven trypanocidal effects, also displayed inhibitory effects on putrescine uptake. Two other hits, meclizine and butoconazole, also displayed trypanocidal effects (in the case of meclizine, against both epimastigotes and amastigotes), without inhibiting putrescine uptake.Facultad de Ciencias ExactasLaboratorio de InvestigaciĂłn y Desarrollo de Bioactivo
- …