1,323,412 research outputs found

    Language universals and interlinguistic variation

    Get PDF
    Actually, the title should include intralinguistic variation along with the interlinguistic one. For variation within one and the same language is the thing which directly presents itself to the observation while it still remains to be demonstrated that phenomena in different languages can be regarded as variants to be assigned to one and the same invariant principle. There are two senses in which the terms of variant, variation are used in the following remarks: one, which has just been mentioned, concerns the assignment of variants to some definite invariant. The other implies the possibility of gradient transitions and opposes the notions of discreteness and of yes-or-no. I shall not try here to reconcile these two senses and I trust that what I intend to show will become intelligible nevertheless. Henri Delacroix (1924:126f) has reformulated an old hypothesis which seems worth exploring in connection with the search for language universals: "Une langue est une variation historique sur le grand thème humain du langage." It remains to be seen what "le grand thème" or rather "les grands thèmes" are about and what particular language-specific properties could be shown to be variants of one and the same theme. One such major theme which we shall now investigate is the interrelation between, on one side, a word or a sequence of words, and, on the other, a sentence. As this for us is not only a syntactic but also a semantic problem, we might rephrase the anti thesis as that between a term or sequence of terms and a proposition. Two alternative views on the nature of this interrelation seem conceivable: A. The interrelation is yes-or-no, i. e. an element or a string of elements either constitutes a term (sequence of terms) or a proposition. B. The interrelation is of gradient nature, i. e. we find intermediary stages. Both alternatives are appropriate, but under different circumstances

    [v]at is going on? Local and global ideologies about Indian English

    Get PDF
    ABSTRACTThis article examines local and global language ideologies surrounding a particular phonetic feature in Indian English, the pronunciation of /v/ as [w]. By focusing on how local and global participants – both individuals and institutions – imagine language variation through disparate framings of “neutral” and “standard,” it highlights how processes of globalization and localization are interconnected, dialogic, and symbiotic. Compared are (i) sociolinguistic constructions of Indian cartoon characters, (ii) American “accent training” institutes, (iii) Indian call center and language improvement books, (iv) American speakers’ interpretations of merged IE speech, and, (v) IE speakers’ attitudes about IE, “neutral,” and ”standard” language. The relative social capital of these populations mediates both how each constructs its respective ideology about language variation, and how these ideologies dialogically interact with each other. (Language variation, language ideologies, dialogic, standard language)1</jats:p

    Dialectometric analysis of language variation in Twitter

    Full text link
    In the last few years, microblogging platforms such as Twitter have given rise to a deluge of textual data that can be used for the analysis of informal communication between millions of individuals. In this work, we propose an information-theoretic approach to geographic language variation using a corpus based on Twitter. We test our models with tens of concepts and their associated keywords detected in Spanish tweets geolocated in Spain. We employ dialectometric measures (cosine similarity and Jensen-Shannon divergence) to quantify the linguistic distance on the lexical level between cells created in a uniform grid over the map. This can be done for a single concept or in the general case taking into account an average of the considered variants. The latter permits an analysis of the dialects that naturally emerge from the data. Interestingly, our results reveal the existence of two dialect macrovarieties. The first group includes a region-specific speech spoken in small towns and rural areas whereas the second cluster encompasses cities that tend to use a more uniform variety. Since the results obtained with the two different metrics qualitatively agree, our work suggests that social media corpora can be efficiently used for dialectometric analyses.Comment: 10 pages, 7 figures, 1 table. Accepted to VarDial 201

    On formal universals in phonology

    Get PDF
    Understanding the universal aspects of human language structure requires comparison at multiple levels of analysis. While Evans & Levinson (E&L) focus mostly on substantive variation in language, equally revealing insights can come from studying formal universals. I first discuss how Artificial Grammar Experiments can test universal preferences for certain types of abstract phonological generalizations over others. I then discuss moraic onsets in the language Arrernte, and how its apparent substantive variation ultimately rests on a formal universal regarding syllable-weight sensitivity

    Selecting ELL Textbooks: A Content Analysis of Language-Teaching Models

    Get PDF
    Many middle school teachers lack adequate criteria to critically select materials that represent a variety of L2 teaching models. This study analyzes the illustrated and written content of 33 ELL textbooks to determine the range of L2 teaching models represented. The researchers asked to what extent do middle school ELL texts depict frequency and variation of language-teaching models in illustrations and written texts. Using content analysis, they measured the range of depiction of the 4 language-teaching models and concluded that 4 of the 33 textbooks had considerable to extensive frequency and variation of L2 teaching model

    Towards a continuous modeling of natural language domains

    Full text link
    Humans continuously adapt their style and language to a variety of domains. However, a reliable definition of `domain' has eluded researchers thus far. Additionally, the notion of discrete domains stands in contrast to the multiplicity of heterogeneous domains that humans navigate, many of which overlap. In order to better understand the change and variation of human language, we draw on research in domain adaptation and extend the notion of discrete domains to the continuous spectrum. We propose representation learning-based models that can adapt to continuous domains and detail how these can be used to investigate variation in language. To this end, we propose to use dialogue modeling as a test bed due to its proximity to language modeling and its social component.Comment: 5 pages, 3 figures, published in Uphill Battles in Language Processing workshop, EMNLP 201
    corecore