Search CORE

15 research outputs found

SICK-BR: A Portuguese Corpus for Inference

Author: Albiero Beatriz
C. S. Câmara Igor
de Oliveira Lima Guilherme
de Paiva Valeria
Guide Bruno
Real Livy
Rodrigues Ana
Silva Cindy
Souza Rodrigo
Stanojevic Milos
Thalenberg Bruna
Vieira e Silva Andressa
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/08/2018
Field of study

Crossref

Edinburgh Research Explorer

Computational approach for the matter of stress in Brazilian Portuguese

Author: Guide Bruno Ferrari
Publication venue: 'Universidade de Sao Paulo, Agencia USP de Gestao da Informacao Academica (AGUIA)'
Publication date: 31/08/2016
Field of study

O objetivo central do projeto foi investigar a questão do acento no português brasileiro por meio do uso de ferramentas computacionais, a fim de encontrar possíveis relações entre traços segmentais, prosódicos ou morfológicos com o acento. Tal análise foi realizada a partir do estudo crítico das principais soluções propostas para a questão advindas da Fonologia Teórica. Isso foi considerado o primeiro passo para desenvolver uma abordagem que traga inovação para a área. A discussão teórica foi concluída com a implementação de algoritmos que representam modelizações das propostas para o tratamento da questão do acento. Estas foram, posteriormente, testadas em corpora relevantes do português com o objetivo de analisar tanto os casos considerados como padrão pelas propostas, quanto aqueles que são considerados exceções ao comportamento do idioma. Simultaneamente, foi desenvolvido um corpus anotado de palavras acentuadas do português brasileiro, a partir do qual foram implementados os dois grupos de modelos de natureza probabilística que formam o quadro de abordagens desenhado pelo projeto. O primeiro grupo se baseia na noção de N-gramas, em que a atribuição de acento a uma palavra ocorre a partir da probabilidade das cadeias de tamanho \" que a compõem, configurando-se, assim, um modelo que enxerga padrões simples de coocorrência e que é computacionalmente eficiente. O segundo grupo de modelos foi chamado de classificador bayesiano ingênuo, que é uma abordagem probabilística mais sofisticada e exigente em termos de corpus e que leva em consideração um vetor de traços a serem definidos para, no caso, atribuir o acento de uma palavra. Esses traços englobaram tanto características morfológicas, quanto prosódicas e segmentais das palavras.The main goal of this project was to provide insight into the behavior of stress patterns of Brazilian Portuguese using computational tools in order to find eventual relationships between segmental, prosodic or morphologic features and word stress. Such analysis was based on a critical reading of some of the main proposals from theoretical phonology regarding the matter. This was considered the first step towards an innovative approach for this field of research. Such discussion was concluded by implementing algorithms representing models of the theoretical proposals for treating the behavior of stress. Afterward, those solutions were tested in relevant corpora of Portuguese aiming to analyze both the words which fell inside what was considered standard and the words that should be considered exceptions to the typical behavior in the language. Simultaneously, a noted corpus of Brazilian Portuguese words was compiled, from which were implemented both groups of models that have probabilistic nature that completes the frame of approaches drawn from this project. The first group is composed of models based on the notion of N-grams, in which the attribution of stress to a word happens based on the probability attributed to the `n\' sized chains that compose this word, which results in a model that is sensitive to patterns of co-occurrence and computationally efficient. The second group of models is called Naive Bayes Classifier, which is a more sophisticated probabilistic approach that is more corpus demanding, this approach takes into account a vector of features that was defined in order to attribute stress to a word. Those features were morphological, prosodic and segmental characteristics of the words

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Biblioteca Digital de Teses e Dissertações

Automatic punitivist hate speech detection in social media

Author: Guide Bruno Ferrari
Publication venue: 'Universidade de Sao Paulo, Agencia USP de Gestao da Informacao Academica (AGUIA)'
Publication date: 08/12/2022
Field of study

O propósito deste trabalho é investigar a detecção automática do discurso de ódio punitivista em redes sociais. Para tanto, revisa a literatura sobre a tarefa de detecção automática de discurso de ódio em geral, traz a contextualização social e histórica sobre o que é o discurso de ódio punitivista e, a partir daí, passa por compilar um corpus de postagens de redes sociais, nomeado de Corpus de Discurso de Ódio Punitivista -- DOP -- para testar modelos de aprendizado de máquina dedicados a classificar textos como contendo discurso de ódio. Os modelos selecionados estão entre os mais utilizados nas tarefas de aprendizado de máquina e foram organizadas grades de hiperparâmetros para testar distintas configurações de cada modelo, a fim de gerar uma ampla gama de resultados, que são também comparados com os obtidos por um modelo genérico de detecção baseado em redes transformadores. Os resultados obtidos mostram que esse tipo de discurso de ódio tem comportamento similar ao de outros tipos mais estudados. Alguns modelos de aprendizado de máquina performam bem na tarefa de detecção automática. Os melhores resultados foram obtidos com o modelo de reforço extremo de gradiente (XGB), cuja métrica F1 obtida foi de o,76, contra o baseline de um modelo BERT específico para discurso de ódio em português, cuja métrica F1 foi de 0,49. Além disso, foi possível extrair algumas observações qualitativas sobre o fenômeno observado, que possibilitaram esboçar uma tipologia e alguns argumentos base do discurso de ódio punitivista. Dentro do campo da detecção automática de discurso de ódio, o fenômeno do ódio punitivista ainda não foi especificamente investigado. Além disso, ainda são poucos os trabalhos em português brasileiro sobre detecção automática de discurso de ódio em geral, especialmente dentro do ambiente das redes sociais. Apesar disso, dados de redes sociais são abundantes e cada vez mais o ambiente das redes se torna um espaço inevitável de socialização, ressaltando a importância de poder monitorar, identificar e alertar sobre comportamentos que estimulem o ódio e a violência, de forma que a tarefa de detecção automática de discurso de ódio constitui-se em uma ferramenta importante para o combate da disseminação de conteúdos tóxicos e agressivos.The purpose of this work is to investigate the automatic detection of punitivist hate speech in social media, therefore, it reviews the literature on the task of automatic detection of hate speech in general, brings the social and historical context about what is punitivist hate speech and then goes through compiling a corpus of social media posts, named Punitivist Hate Speech Corpus - Corpus DOP - to test machine learning models dedicated to classify texts as containing hate speech. The selected models are among the most used in machine learning tasks, and hyperparameter grids are organized to test different configurations of each model, in order to generate a wide range of results, which are also compared with those obtained by a generic detection model based on a transformer network. The results obtained show that this type of hate speech has a behavior similar to that of other more studied types and that some machine learning models perform well in the automatic detection task. The best results were obtained with the extreme gradient boost model (XGB), whose F1 metric obtained was 0.76, against the baseline of a specific BERT model for hate speech in Portuguese, whose F1 metric was 0.49. In addition, it was possible to extract some qualitative observations about the observed phenomenon, which made it possible to outline a typology and some basic arguments for punitivist hate speech. Within the field of automatic detection of hate speech, the phenomenon of punitivist hate has not yet been specifically investigated. In addition, there are still few works in Brazilian Portuguese on automatic detection of hate speech in general, especially within the social media environment. Despite this, data from social media is abundant and the network environment is increasingly becoming an inevitable space for socialization, highlighting the importance of being able to monitor, identify and alert about behaviors that encourage hatred and violence, so that the task automatic detection of hate speech constitutes an important tool to combat the dissemination of toxic and aggressive content

Biblioteca Digital de Teses e Dissertações

Computational approach for the matter of stress in Brazilian Portuguese

Author: Bruno Ferrari Guide
Publication venue: 'American Psychological Association (APA)'
Publication date: 31/08/2016
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Corpus ABG

Author: Aline de Lima Benevides
Bruno Ferrari Guide
Publication venue: 'Faculdade de Letras da UFMG'
Publication date: 01/06/2017
Field of study

RESUMO:Este artigo apresenta a metodologia empregada na compilação de um corpus linguístico do Português Brasileiro, o qual foi denominado de Corpus ABG, e no desenvolvimento de algumas ferramentas computacionais. O objetivo deste trabalho é reunir uma grande quantidade de textos, escritos e orais, que possa representar o falar brasileiro a fim de ser fonte de extração de dados fonológicos quantificados para duas pesquisas, a saber, Guide (2016) e Benevides (2017). O corpus contabiliza 3.616.625 ocorrências de palavras e 92.602 tipos de palavras, sendo que 1.938.805 ocorrências são provenientes dos corpora de fala e 1.676.820 ocorrências dos corpora escritos. Ancorado na metodologia da Linguística de Corpus e por meio de ferramentas computacionais desenvolvidas em Linguagem Python, o presente artigo divulga e disponibiliza à comunidade científica o Corpus ABG, as ferramentas computacionais (acentuador, categorizador de estruturas fonológicas, silabificador) e algumas informações fonológicas (acentuais e silábicas) já extraídas do corpus. Além disso, faz um convite a novas explorações dos dados a todos os pesquisadores que tiverem interesse. ABSTRACT:The present paper presents the task of compiling a linguistic corpus of Brazilian Portuguese, which was undertaken by the authors. It is called ABG Corpus, and this article is also about the computational tools developed for the task. Our main goal is to reunite a large amount of texts, both from spoken and written language to, in the best way possible, represent the Brazilian language in a way that we could use it as a database for our researches, Guide (2016) and Benevides (2017). The ABG corpus has 3.616.625 word tokens and 92.602 types of words, being that 1.938.805 of those tokens are from spoken language corpora and 1.676.820 tokens come from written corpora. Based on the corpus linguistics framework and through the use of computational tools developed using Python, this article shows and provides access to the ABG Corpus, the computational tools (stress marker, phonological structure identifier, syllabifier), as well as some phonological information (stress and syllable related), already present on the corpus. We end by inviting the community to further expand our findings and explore this new tool

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Viability of remanufacturing practice: a strategic decision making framework for Chinese auto-parts companies

Author: Amighini
Atasu
Atasu
Barker
Bellman
Bras
Bruno
Carpenter
Chang Liu
Chapman
Charan
Chen
Chen
Chengqi Shu
Chung
Copenhagen Economics
Curtis
Dowlatshahi
Dyer
Eisenhardt
Ferrer
Ferrer
Ferrer
Flapper
Fleischmann
Franke
Geyer
Giannetti
Guide
Guide
Guide
Guide
Hambrick
Hammond
Hauser
Hazen
Ijomah
Jayaraman
Junior
Kapetanopoulou
Kleber
Kutta
Lai
Martin
Michaud
Miles
Mitra
Muhammad Dan-Asabe Abdulrahman
Mukherjee
Nachiappan Subramanian
Narasimhan
Pagell
Parlikad
Parlikad
Parlikad
Peng
Pohekar
PwC
Rahman
Ramanathan
Rogers
Saaty
Saaty
Sandvall
Sarkis
Schmitt
Seuring
Srivastava
Steinhilper
Stock
Stuart
Subramanian
Subramoniam
Subramoniam
Subramoniam
Toffel
Um
USITC
Voss
Wang
Webster
Willis
Wu
Wu
Xiang
Zahedi
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Remanufacturing is a sustainable and proven profitable practice in the western world. Research on remanufacturing practices is relatively unexploited in China, despite being the “global factory” and both the world's largest automobile manufacturer and vehicle market. The increasing amount of automotive output and End-of-Life vehicles (ELVs) in China provides Chinese auto-parts companies with significant potential for environmentally conscious manufacturing and product recovery. Using case studies, we have investigated the status of remanufacturing practices, key determinants for strategic decision making to remanufacture in-house, outsource remanufacturing and/or not to engage in remanufacturing in Chinese auto parts firms using an analytical hierarchy process (AHP). This study suggests that Chinese firms are keen to adopt remanufacturing practice in-house compared to outsourcing despite a lack of technical and managerial capabilities

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

RMIT Research Repository

Sussex Research Online

Dystrophin proteolysis: a potential target for MMP-2 and its prevention by ischemic preconditioning

Author: Ali MA
Bruno Buchholz
Committee for the Update of the Guide for the Care and Use of Laboratory Animals
Ehmsen J
Gabriela Berg
Manuel Rodríguez
Martín Donato
Nadezda Siachoque
Ricardo J. Gelpi
Verónica Miksztowicz
Virginia Perez
Publication venue: 'American Physiological Society'
Publication date
Field of study

Crossref

Small Mission Design for Testing In-Orbit an Electrodynamic Tether Deorbiting System

Author: Bruno C.
Carroll J. A.
Dnepr User's Guide
Dobrowolny M.
Dobrowolny M.
Dobrowolny M.
Forward R. L.
Forward R. L.
Hoyt R. P.
L. Iess
L. Somenzi
Licata R.
McCoy J. E.
P. Tortora
Peláez J.
Peláez J.
Peláez J.
Purdy W.
R. Licata
Smith H. F.
Publication venue: 'American Institute of Aeronautics and Astronautics (AIAA)'
Publication date
Field of study

Crossref

Efficiency evaluation of a ductless Archimedes turbine: Laboratory experiments and numerical simulations

Author: Alessandro Brunori
Andersson
ANSYS Fluent Tutorial Guide
Assembly
Beran
Betz
Betz
Bruno Brunori
Chamorro
Fernandes
Fernandes
Fernando Fattore
Gianluca Zitti
Golecha
Gorban
Guney
Hao
Khan
Khan
Khan
Kumar
Kumar
Kusakana
Kusakana
Lei
Lei
Liu
Liu
Maurizio Brocchini
Menter
Menter
Okot
Price
Ragheb
Rostami
Schleicher
Schleicher
Stergiopoulou
Stergiopoulou
Stergiopoulou
Talukdar
Vermaak
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Validation of registration techniques applied to XRD signals for stress evaluations in titanium alloys

Author: B Beaubier
B. Voillot
BD Cullity
C Leyens
E Aeby-Gautier
F Hild
F Hild
F Hild
F Lefebvre
F. Hild
G Bruno
G Lütjering
H Leclerc
IC Noyan
ISO/IEC guide 99-12:2007
J Réthoré
J Réthoré
J-E Dufour
J-E Dufour
J.-L. Lebrun
M Bertin
M Hill
M Prime
MA Sutton
N Guillemot
P-J Withers
R Boyer
R. Billardon
S Djaziri
S Djaziri
S Freour
S Fréour
S Roux
V Hauk
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

To estimate stresses near specimen surfaces, X-ray diffraction (XRD) is applied to titanium alloys. Some of these alloys are difficult to study since they are composed of various phases of different proportions, shapes and scales. For millimetric probed volumes, such multi-phase microstructures induce shallow and noisy diffraction signals. Two peak registration techniques are introduced and validated thanks to tensile tests performed on two titanium alloy samples

Crossref

Oskar Bordeaux