73,306 research outputs found

    Automatic offensive language detection from Twitter data using machine learning and feature selection of metadata

    Get PDF
    The popularity of social networks has only increased in recent years. In theory, the use of social media was proposed so we could share our views online, keep in contact with loved ones or share good moments of life. However, the reality is not so perfect, so you have people sharing hate speech-related messages, or using it to bully specific individuals, for instance, or even creating robots where their only goal is to target specific situations or people. Identifying who wrote such text is not easy and there are several possible ways of doing it, such as using natural language processing or machine learning algorithms that can investigate and perform predictions using the metadata associated with it. In this work, we present an initial investigation of which are the best machine learning techniques to detect offensive language in tweets. After an analysis of the current trend in the literature about the recent text classification techniques, we have selected Linear SVM and Naive Bayes algorithms for our initial tests. For the preprocessing of data, we have used different techniques for attribute selection that will be justified in the literature section. After our experiments, we have obtained 92% of accuracy and 95% of recall to detect offensive language with Naive Bayes and 90% of accuracy and 92% of recall with Linear SVM. From our understanding, these results overcome our related literature and are a good indicative of the importance of the data description approach we have used

    A Probabilistic Approach for Assessing the Significance of Contextual Variables in Nonparametric Frontier Models: an Application for Brazilian Banks

    Get PDF
    This article presents an empirical application illustrating the use of a nonparametric frontier model relying on a probabilistic definition of the production frontier. The significance of the variable nonperforming loans in productive efficiency is assessed, for a sample of Brazilian banks, using the concepts of condicional and unconditional efficiency measures, in a context where it is not necessary to impose any particular distribution for the production data. The analysis is robust relative to the assumptions of separability.

    Potentials and limits to generate employment and income by the National Programme for Production and Use of Biodiesel

    Get PDF
    This study analyses the National Programme for Production and Use of Biodiesel launched by the Brazil Federal Government in 2005 as a public policy to generate sustainable employment and income within the context of development of new alternative sources of energy. It also verifies the impact of PNPB on occupation and income rate of farmers participating in the projects of production of biodiesel through field research carried out on 93 family farms participating in projects already implemented in the State of Goiás. The choice of producers was made at random from a list of all producers who had already gone through a complete cycle of production and stretched across 33 municipalities in the second half of 2007. The survey data was obtained through a closed-ended questionnaire which was designed to ascertain: 1) the increase of occupation and income regarding producers participating in the projects, 2) ways of including these farmers into the programme, 3) technical assistance offered to them (according to the guidelines of the programme) and 4) the evaluation of the programme by participating farmers. The SPSS software was used for processing and data analysis. The results show that most of the objectives of the programme, such as generation of occupation and income by family farming, are being achieved.biofuels, biodiesel, family farm, public policy, Agribusiness, Agricultural Finance, Industrial Organization,

    Information-Entropic for Travelling Solitons in Lorentz and CPT Breaking Systems

    Full text link
    In this work we group three research topics apparently disconnected, namely solitons, Lorentz symmetry breaking and entropy. Following a recent work [Phys. Lett. B 713 (2012) 304], we show that it is possible to construct in the context of travelling wave solutions a configurational entropy measure in functional space, from the field configurations. Thus, we investigate the existence and properties of travelling solitons in Lorentz and CPT breaking scenarios for a class of models with two interacting scalar fields. Here, we obtain a complete set of exact solutions for the model studied which display both double and single-kink configurations. In fact, such models are very important in applications that include Bloch branes, Skyrmions, Yang-Mills, Q-balls, oscillons and various superstring-motivated theories. We find that the so-called Configurational Entropy (CE) for travelling solitons, which we name as travelling Configurational Entropy (TCE), shows that the best value of parameter responsible to break the Lorentz symmetry is one where the energy density is distributed equally around the origin. In this way, the information-theoretical measure of travelling solitons in Lorentz symmetry violation scenarios opens a new window to probe situations where the parameters responsible for breaking the symmetries are random. In this case, the TCE selects the best value

    D-Oscillons in the Standard Model-Extension

    Full text link
    In this work we investigate the consequences of the Lorentz symmetry violation on extremely long-living, time-dependent, and spatially localized field configurations, named oscillons. This is accomplished in (D+1D+1) dimensions for two interacting scalar field theories in the so-called Standard Model-Extension context. We show that DD-dimensional scalar field lumps can present a typical size RminRKKR_{\min }\ll R_{KK}, where RKKR_{KK} is the associated length scale of extra dimensions in Kaluza-Klein theories. Here, the size RminR_{\min } is shown to strongly depend on the terms that control the Lorentz violation of the theory. This implies either contraction or dilation of the average radius RminR_{\min}, and a new rule for its composition, likewise. Moreover, we show that the spatial dimensions for existence of oscillating lumps have an upper limit, opening new possibilities to probe the existence of a DD -dimensional oscillons at TeV energy scale. Moreover, in a cosmological scenario with Lorentz symmetry breaking, we argue that in the early Universe with an extremely high energy density and a strong Lorentz violation, the typical size RminR_{\min } was highly dilated. With the expansion and subsequent cooling of the Universe, we propose that it passed through a phase transition towards a Lorentz symmetry, wherein RminR_{\min } tends to be compact.Comment: 8 pages, final version to appear in PR
    corecore