Search CORE

2,287 research outputs found

Lexical representation explains cortical entrainment during speech comprehension

Author: Frank Stefan
Yang Jinbiao
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2018
Field of study

Results from a recent neuroimaging study on spoken sentence comprehension have been interpreted as evidence for cortical entrainment to hierarchical syntactic structure. We present a simple computational model that predicts the power spectra from this study, even though the model's linguistic knowledge is restricted to the lexical level, and word-level representations are not combined into higher-level units (phrases or sentences). Hence, the cortical entrainment results can also be explained from the lexical properties of the stimuli, without recourse to hierarchical syntax.Comment: Submitted for publicatio

arXiv.org e-Print Archive

Directory of Open Access Journals

Radboud Repository

MPG.PuRe

FigShare

Linguistics and LIS: A Research Agenda

Author: Bolin Mary K
Publication venue: SJSU ScholarWorks
Publication date: 21/06/2017
Field of study

Linguistics and Library and Information Science (LIS) are both interdisciplinary fields that draws from areas such as languages, psychology, sociology, cognitive science, computer science, anthropology, education, and management. The theories and methods of linguistic research can have significant explanatory power for LIS. This article presents a research agenda for LIS that proposes the use of linguistic analysis methods, including discourse analysis, typology, and genre theory

DigitalCommons@University of Nebraska

SJSU ScholarWorks

A Wikipedia Literature Review

Author: Martin Owen S.
Publication venue
Publication date: 01/01/2010
Field of study

This paper was originally designed as a literature review for a doctoral dissertation focusing on Wikipedia. This exposition gives the structure of Wikipedia and the latest trends in Wikipedia research

arXiv.org e-Print Archive

CiteSeerX

Testing word embeddings for Polish

Author: Agnieszka Mykowiecka
Małgorzata Marciniak
Piotr Rychlik
Publication venue: 'Institute of Slavic Studies Polish Academy of Sciences'
Publication date: 01/01/2017
Field of study

Testing word embeddings for Polish Distributional Semantics postulates the representation of word meaning in the form of numeric vectors which represent words which occur in context in large text data. This paper addresses the problem of constructing such models for the Polish language. The paper compares the effectiveness of models based on lemmas and forms created with Continuous Bag of Words (CBOW) and skip-gram approaches based on different Polish corpora. For the purposes of this comparison, the results of two typical tasks solved with the help of distributional semantics, i.e. synonymy and analogy recognition, are compared. The results show that it is not possible to identify one universal approach to vector creation applicable to various tasks. The most important feature is the quality and size of the data, but different strategy choices can also lead to significantly different results. Testowanie wektorowych reprezentacji dystrybucyjnych słów języka polskiego Semantyka dystrybucyjna opiera się na założeniu, że znaczenie słów wyrażone jest za pomocą wektorów reprezentujących, w sposób bezpośredni bądź pośredni, konteksty, w jakich słowo to jest używane w dużym zbiorze tekstów. Niniejszy artykuł dotyczy ewaluacji wielu takich modeli skonstruowanych dla języka polskiego. W pracy porównano skuteczność modeli opartych na lematach i formach słów, utworzonych przy wykorzystaniu sieci neuronowych na danych z dwóch różnych korpusów języka polskiego. Ewaluacji dokonano na podstawie wyników dwóch typowych zadań rozwiązywanych za pomocą metod semantyki dystrybucyjnej, tzn. rozpoznania występowania synonimii i analogii między konkretnymi parami słów. Uzyskane wyniki dowodzą, że nie można wskazać jednego uniwersalnego podejścia do tworzenia modeli dystrybucyjnych, gdyż ich skuteczność jest różna w zależności od zastosowania. Najważniejszą cechą wpływającą na jakość modelu jest jakość oraz rozmiar danych, ale wybory różnych strategii uczenia sieci mogą również prowadzić do istotnie odmiennych wyników

Crossref

Biblioteka Nauki - repozytorium artykuÅÃ³w

Directory of Open Access Journals