Search CORE

11,611 research outputs found

Toxic comment classification using convolutional and recurrent neural networks

Author: Blanes Martin Victor
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/06/2018
Field of study

This thesis aims to provide a reasonable solution for categorizing automatically sentences into types of toxicity using different types of neural networks. There are six types of categories: Toxic, severe toxic, obscene, threat, insult and identity hate. Three different implementations have been studied to accomplish the objective: LSTM (Long Short-Term Memory), GRU (Gated Recurrent Unit) and convolutional neural networks. The thesis is not thought to aim on improving the performance of every individual model but on the comparison between them in terms of natural language processing adequacy. In addition, one differential aspect about this project is the research of LSTM neurons activations and thus the relationship of the words with the final sentence classificatory decision. In conclusion, the three models performed almost equally and the extraction of LSTM activations provided a very accurate and visual understanding of the decisions taken by the network.Esta tesis tiene como objetivo aportar una buena solución para la categorización automática de comentarios abusivos haciendo uso de distintos tipos de redes neuronales. Hay seis categorías: Tóxico, muy tóxico, obsceno, insulto, amenaza y racismo. Se ha hecho una investigación de tres implementaciones para llevar a cabo el objetivo: LSTM (Long Short-Term Memory), GRU (Gated Recurrent Unit) y redes convolucionales. El objetivo de este trabajo no es intentar mejorar al máximo el resultado de la clasificación sino hacer una comparación de los 3 modelos para los mismos parámetros e intentar saber cuál funciona mejor para este caso de procesado de lenguaje. Además, un aspecto diferencial de este proyecto es la investigación sobre las activaciones de las neuronas en el modelo LSTM y su relación con la importancia de las palabras respecto a la clasificación final de la frase. En conclusión, los tres modelos han funcionado de forma casi idéntica y la extracción de las activaciones han proporcionado un conocimiento muy preciso y visual de las decisiones tomadas por la red.Aquesta tesi té com a objectiu aportar una bona solució per categoritzar automàticament comentaris abusius usant diferents tipus de xarxes neuronals. Hi ha sis tipus de categories: Tòxic, molt tòxic, obscè, insult, amenaça i racisme. S'ha fet una recerca de tres implementacions per dur a terme l'objectiu: LSTM (Long Short-Term Memory), GRU (Gated Recurrent Unit) i xarxes convolucionals. L'objectiu d'aquest treball no és intentar millorar al màxim els resultats de classificació sinó fer una comparació dels 3 models pels mateixos paràmetres per tal d'esbrinar quin funciona millor en aquest cas de processat de llenguatge. A més, un aspecte diferencial d'aquest projecte és la recerca sobre les activacions de les neurones al model LSTM i la seva relació amb la importància de les paraules respecte la classificació final de la frase. En conclusió, els tres models han funcionat gairebé idènticament i l'extracció de les activacions van proporcionar un enteniment molt acurat i visual de les decisions preses per la xarxa

UPCommons. Portal del coneixement obert de la UPC

Cyclotomic and simplicial matroids

Author: Martin Jeremy
Reiner Victor
Publication venue
Publication date: 01/01/2004
Field of study

Two naturally occurring matroids representable over Q are shown to be dual: the {\it cyclotomic matroid}

\mu_n

represented by the

n^{th}

roots of unity

1,\zeta,\zeta^2,...,\zeta^{n-1}

inside the cyclotomic extension

Q(\zeta)

, and a direct sum of copies of a certain simplicial matroid, considered originally by Bolker in the context of transportation polytopes. A result of Adin leads to an upper bound for the number of

Q

-bases for

Q(\zeta)

among the

n^{th}

roots of unity, which is tight if and only if

n

has at most two odd prime factors. In addition, we study the Tutte polynomial of

\mu_n

in the case that

n

has two prime factors.Comment: 9 pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

KU ScholarWorks

Factorizations of some weighted spanning tree enumerators

Author: Martin Jeremy L.
Reiner Victor
Publication venue
Publication date: 02/09/2003
Field of study

We give factorizations for weighted spanning tree enumerators of Cartesian products of complete graphs, keeping track of fine weights related to degree sequences and edge directions. Our methods combine Kirchhoff's Matrix-Tree Theorem with the technique of identification of factors.Comment: Final version, 12 pages. To appear in the Journal of Combinatorial Theory, Series A. The paper has been reorganized, and the proof of Theorem 4 shortened, in light of a more general result appearing in reference [6

arXiv.org e-Print Archive

Elsevier - Publisher Connector

KU ScholarWorks

Unraveling Quantum Annealers using Classical Hardness

Author: Hen Itay
Martin-Mayor Victor
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Recent advances in quantum technology have led to the development and manufacturing of experimental programmable quantum annealing optimizers that contain hundreds of quantum bits. These optimizers, named `D-Wave' chips, promise to solve practical optimization problems potentially faster than conventional `classical' computers. Attempts to quantify the quantum nature of these chips have been met with both excitement and skepticism but have also brought up numerous fundamental questions pertaining to the distinguishability of quantum annealers from their classical thermal counterparts. Here, we propose a general method aimed at answering these, and apply it to experimentally study the D-Wave chip. Inspired by spin-glass theory, we generate optimization problems with a wide spectrum of `classical hardness', which we also define. By investigating the chip's response to classical hardness, we surprisingly find that the chip's performance scales unfavorably as compared to several analogous classical algorithms. We detect, quantify and discuss purely classical effects that possibly mask the quantum behavior of the chip.Comment: 12 pages, 9 figure

arXiv.org e-Print Archive

Docta Complutense

Crossref

Repositorio Universidad de Zaragoza

PubMed Central

Embodying gender, age, ethnicity and power in ‘the field’: Reflections on dress and the presentation of the self in research with older Pakistani Muslims

Author: Martin W
Victor C
Zubair M
Publication venue: Universities of Surrey and Stirling, the British Sociological Association and SAGE Publications Ltd
Publication date: 01/08/2012
Field of study

This article is available open access through the publisher’s website at the link below. Copyright @ 2012 Sociological Research Online.In recent years, there has been an increasing interest in researching people growing older in the South Asian ethnic minority communities in the UK. However, these populations have received comparatively little attention in wide-ranging discussions on culturally and socially appropriate research methodologies. In this paper, we draw on the experiences of a young female Pakistani Muslim researcher researching older Pakistani Muslim women and men, to explore the significance of gender, age and ethnicity to fieldwork processes and 'field' relationships. In particular, we highlight the significance of dress and specific presentations of the embodied self within the research process. We do so by focusing upon three key issues: (1) Insider/Outsider boundaries and how these boundaries are continuously and actively negotiated in the field through the use of dress and specific presentations of the embodied 'self'; (2) The links between gender, age and space - more specifically, how the researcher's use of traditional Pakistani dress, and her differing research relationships, are influenced by the older Pakistani Muslim participants' gendered use of public and private space; and (3) The opportunities and vulnerabilities experienced by the researcher in the field, reinforced by her use (or otherwise) of the traditional and feminine Pakistani Muslim dress. Our research therefore highlights the role of different presentations of the embodied 'self' to fieldwork processes and relationships, and illustrates how age, gender and status intersect to produce fluctuating insider/outsider boundaries as well as different opportunities and experiences of power and vulnerability within research relationships.ESR

Crossref

Brunel University Research Archive