73,306 research outputs found
Automatic offensive language detection from Twitter data using machine learning and feature selection of metadata
The popularity of social networks has only increased
in recent years. In theory, the use of social media was proposed
so we could share our views online, keep in contact with loved
ones or share good moments of life. However, the reality is
not so perfect, so you have people sharing hate speech-related
messages, or using it to bully specific individuals, for instance,
or even creating robots where their only goal is to target specific
situations or people. Identifying who wrote such text is not easy
and there are several possible ways of doing it, such as using
natural language processing or machine learning algorithms
that can investigate and perform predictions using the metadata associated with it. In this work, we present an initial
investigation of which are the best machine learning techniques
to detect offensive language in tweets. After an analysis of the
current trend in the literature about the recent text classification
techniques, we have selected Linear SVM and Naive Bayes
algorithms for our initial tests. For the preprocessing of data,
we have used different techniques for attribute selection that
will be justified in the literature section. After our experiments,
we have obtained 92% of accuracy and 95% of recall to detect
offensive language with Naive Bayes and 90% of accuracy and
92% of recall with Linear SVM. From our understanding, these
results overcome our related literature and are a good indicative
of the importance of the data description approach we have used
A Probabilistic Approach for Assessing the Significance of Contextual Variables in Nonparametric Frontier Models: an Application for Brazilian Banks
This article presents an empirical application illustrating the use of a nonparametric frontier model relying on a probabilistic definition of the production frontier. The significance of the variable nonperforming loans in productive efficiency is assessed, for a sample of Brazilian banks, using the concepts of condicional and unconditional efficiency measures, in a context where it is not necessary to impose any particular distribution for the production data. The analysis is robust relative to the assumptions of separability.
Potentials and limits to generate employment and income by the National Programme for Production and Use of Biodiesel
This study analyses the National Programme for Production and Use of Biodiesel launched by the Brazil Federal Government in 2005 as a public policy to generate sustainable employment and income within the context of development of new alternative sources of energy. It also verifies the impact of PNPB on occupation and income rate of farmers participating in the projects of production of biodiesel through field research carried out on 93 family farms participating in projects already implemented in the State of Goiás. The choice of producers was made at random from a list of all producers who had already gone through a complete cycle of production and stretched across 33 municipalities in the second half of 2007. The survey data was obtained through a closed-ended questionnaire which was designed to ascertain: 1) the increase of occupation and income regarding producers participating in the projects, 2) ways of including these farmers into the programme, 3) technical assistance offered to them (according to the guidelines of the programme) and 4) the evaluation of the programme by participating farmers. The SPSS software was used for processing and data analysis. The results show that most of the objectives of the programme, such as generation of occupation and income by family farming, are being achieved.biofuels, biodiesel, family farm, public policy, Agribusiness, Agricultural Finance, Industrial Organization,
Information-Entropic for Travelling Solitons in Lorentz and CPT Breaking Systems
In this work we group three research topics apparently disconnected, namely
solitons, Lorentz symmetry breaking and entropy. Following a recent work [Phys.
Lett. B 713 (2012) 304], we show that it is possible to construct in the
context of travelling wave solutions a configurational entropy measure in
functional space, from the field configurations. Thus, we investigate the
existence and properties of travelling solitons in Lorentz and CPT breaking
scenarios for a class of models with two interacting scalar fields. Here, we
obtain a complete set of exact solutions for the model studied which display
both double and single-kink configurations. In fact, such models are very
important in applications that include Bloch branes, Skyrmions, Yang-Mills,
Q-balls, oscillons and various superstring-motivated theories. We find that the
so-called Configurational Entropy (CE) for travelling solitons, which we name
as travelling Configurational Entropy (TCE), shows that the best value of
parameter responsible to break the Lorentz symmetry is one where the energy
density is distributed equally around the origin. In this way, the
information-theoretical measure of travelling solitons in Lorentz symmetry
violation scenarios opens a new window to probe situations where the parameters
responsible for breaking the symmetries are random. In this case, the TCE
selects the best value
D-Oscillons in the Standard Model-Extension
In this work we investigate the consequences of the Lorentz symmetry
violation on extremely long-living, time-dependent, and spatially localized
field configurations, named oscillons. This is accomplished in ()
dimensions for two interacting scalar field theories in the so-called Standard
Model-Extension context. We show that -dimensional scalar field lumps can
present a typical size , where is the associated
length scale of extra dimensions in Kaluza-Klein theories. Here, the size
is shown to strongly depend on the terms that control the Lorentz
violation of the theory. This implies either contraction or dilation of the
average radius , and a new rule for its composition, likewise.
Moreover, we show that the spatial dimensions for existence of oscillating
lumps have an upper limit, opening new possibilities to probe the existence of
a -dimensional oscillons at TeV energy scale. Moreover, in a cosmological
scenario with Lorentz symmetry breaking, we argue that in the early Universe
with an extremely high energy density and a strong Lorentz violation, the
typical size was highly dilated. With the expansion and subsequent
cooling of the Universe, we propose that it passed through a phase transition
towards a Lorentz symmetry, wherein tends to be compact.Comment: 8 pages, final version to appear in PR
- …