171 research outputs found

    POLLEX-Online: The Polynesian Lexicon Project Online

    Full text link

    Basic vocabulary and Bayesian phylolinguistics

    No full text
    Donohue et al.’s critique of our work on the origins and spread of the Austronesian language family is marred by misunderstandings of our approach. We respond to these by noting that our Bayesian phylogenetic approach: (1) distinguishes between retentions and innovations probabilistically, (2) focuses on basic vocabulary not ‘the lexicon’, (3) eliminates known loanwords, (4) produces results that are congruent with the results of the comparative method and conflict with the scenarios requiring unprecedented amounts of language shift postulated by Donohue et al

    Population structure and cultural geography of a folktale in Europe.

    No full text
    Despite a burgeoning science of cultural evolution, relatively little work has focused on the population structure of human cultural variation. By contrast, studies in human population genetics use a suite of tools to quantify and analyse spatial and temporal patterns of genetic variation within and between populations. Human genetic diversity can be explained largely as a result of migration and drift giving rise to gradual genetic clines, together with some discontinuities arising from geographical and cultural barriers to gene flow. Here, we adapt theory and methods from population genetics to quantify the influence of geography and ethnolinguistic boundaries on the distribution of 700 variants of a folktale in 31 European ethnolinguistic populations. We find that geographical distance and ethnolinguistic affiliation exert significant independent effects on folktale diversity and that variation between populations supports a clustering concordant with European geography. This pattern of geographical clines and clusters parallels the pattern of human genetic diversity in Europe, although the effects of geographical distance and ethnolinguistic boundaries are stronger for folktales than genes. Our findings highlight the importance of geography and population boundaries in models of human cultural variation and point to key similarities and differences between evolutionary processes operating on human genes and culture

    How Accurate and Robust Are the Phylogenetic Estimates of Austronesian Language Relationships?

    Get PDF
    We recently used computational phylogenetic methods on lexical data to test between two scenarios for the peopling of the Pacific. Our analyses of lexical data supported a pulse-pause scenario of Pacific settlement in which the Austronesian speakers originated in Taiwan around 5,200 years ago and rapidly spread through the Pacific in a series of expansion pulses and settlement pauses. We claimed that there was high congruence between traditional language subgroups and those observed in the language phylogenies, and that the estimated age of the Austronesian expansion at 5,200 years ago was consistent with the archaeological evidence. However, the congruence between the language phylogenies and the evidence from historical linguistics was not quantitatively assessed using tree comparison metrics. The robustness of the divergence time estimates to different calibration points was also not investigated exhaustively. Here we address these limitations by using a systematic tree comparison metric to calculate the similarity between the Bayesian phylogenetic trees and the subgroups proposed by historical linguistics, and by re-estimating the age of the Austronesian expansion using only the most robust calibrations. The results show that the Austronesian language phylogenies are highly congruent with the traditional subgroupings, and the date estimates are robust even when calculated using a restricted set of historical calibrations

    Scintillation in the Circinus Galaxy water megamasers

    Full text link
    We present observations of the 22 GHz water vapor megamasers in the Circinus galaxy made with the Tidbinbilla 70m telescope. These observations confirm the rapid variability seen earlier by Greenhill et al (1997). We show that this rapid variability can be explained by interstellar scintillation, based on what is now known of the interstellar scintillation seen in a significant number of flat spectrum AGN. The observed variability cannot be fully described by a simple model of either weak or diffractive scintillation.Comment: 10 pages, 5 figures. AJ accepte

    Untangling the Evolution of Body-Part Terminology in Pano: Conservative versus Innovative Traits in Body-Part Lexicalization

    Get PDF
    Although language-family specific traits which do not find direct counterparts outside a given language family are usually ignored in quantitative phylogenetic studies, scholars have made ample use of them in qualitative investigations, revealing their potential for identifying language relationships. An example of such a family specific trait are body-part expressions in Pano languages, which are often lexicalized forms, composed of bound roots (also called body-part prefixes in the literature) and non-productive derivative morphemes (called here body-part formatives). We use various statistical methods to demonstrate that whereas body-part roots are generally conservative, body-part formatives exhibit diverse chronologies and are often the result of recent and parallel innovations. In line with this, the phylogenetic structure of body-part roots projects the major branches of the family, while formatives are highly non-tree-like. Beyond its contribution to the phylogenetic analysis of Pano languages, this study provides significative insights into the role of grammatical innovations for language classification, the origin of morphological complexity in the Amazon and the phylogenetic signal of specific grammatical traits in language families

    Historical, archaeological and linguistic evidence test the phylogenetic inference of Viking-Age plant use

    Get PDF
    In this paper, past plant knowledge serves as a case study to highlight the promise and challenges of interdisciplinary data collection and interpretation in cultural evolution. Plants are central to human life and yet, apart from the role of major crops, people–plant relations have been marginal to the study of culture. Archaeological, linguistic, and historical evidence are often limited when it comes to studying the past role of plants. This is the case in the Nordic countries, where extensive collections of various plant use records are absent until the 1700s. Here, we test if relatively recent ethnobotanical data can be used to trace back ancient plant knowledge in the Nordic countries. Phylogenetic inferences of ancestral states are evaluated against historical, linguistic, and archaeobotanical evidence. The exercise allows us to discuss the opportunities and shortcomings of using phylogenetic comparative methods to study past botanical knowledge. We propose a ‘triangulation method’ that not only combines multiple lines of evidence, but also quantitative and qualitative approaches. This article is part of the theme issue ‘Foundations of cultural evolution’

    Sequence comparison in computational historical linguistics

    Get PDF
    With increasing amounts of digitally available data from all over the world, manual annotation of cognates in multi-lingual word lists becomes more and more time-consuming in historical linguistics. Using available software packages to pre-process the data prior to manual analysis can drastically speed-up the process of cognate detection. Furthermore, it allows us to get a quick overview on data which have not yet been intensively studied by experts. LingPy is a Python library which provides a large arsenal of routines for sequence comparison in historical linguistics. With LingPy, linguists can not only automatically search for cognates in lexical data, but they can also align the automatically identified words, and output them in various forms, which aim at facilitating manual inspection. In this tutorial, we will briefly introduce the basic concepts behind the algorithms employed by LingPy and then illustrate in concrete workflows how automatic sequence comparison can be applied to multi-lingual word lists. The goal is to provide the readers with all information they need to (1) carry out cognate detection and alignment analyses in LingPy, (2) select the appropriate algorithms for the appropriate task, (3) evaluate how well automatic cognate detection algorithms perform compared to experts, and (4) export their data into various formats useful for additional analyses or data sharing. While basic knowledge of the Python language is useful for all analyses, our tutorial is structured in such a way that scholars with basic knowledge of computing can follow through all steps as well

    CLICS2: An improved database of cross-linguistic colexifications assembling lexical data with the help of cross-linguistic data formats

    Get PDF
    The Database of Cross-Linguistic Colexifications (CLICS), has established a computer-assisted framework for the interactive representation of cross-linguistic colexification patterns. In its current form, it has proven to be a useful tool for various kinds of investigation into cross-linguistic semantic associations, ranging from studies on semantic change, patterns of conceptualization, and linguistic paleontology. But CLICS has also been criticized for obvious shortcomings, ranging from the underlying dataset, which still contains many errors, up to the limits of cross-linguistic colexification studies in general. Building on recent standardization efforts reflected in the Cross-Linguistic Data Formats initiative (CLDF) and novel approaches for fast, efficient, and reliable data aggregation, we have created a new database for cross-linguistic colexifications, which not only supersedes the original CLICS database in terms of coverage but also offers a much more principled procedure for the creation, curation and aggregation of datasets. The paper presents the new database and discusses its major features

    Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison

    Get PDF
    THIS IS A PRE-PRINT, PLEASE CITE IT AS: Tresoldi, Tiago; Rzymski, Christoph; Forkel, Robert; Greenhill, Simon J.; List, Johann-Mattis; and Gray, Russell D. (2019) “Managing historical linguistic data for computational phylogenetics and computer-assisted language comparison (PRE-PRINT)”. Jena: Max-Planck-Institute for the Science of Human History. The popularisation of computer-based methods in comparative linguistics has led to a greater awareness of issues resulting from limited data sustainability and proper data management. In this use-case and its accompanying tutorial, we present principles of data management as applied to computational phylogenetics and computer-assisted language comparison, showcasing the solutions we recommend. Instead of enumerating the many possibilities to code and use linguistic data to conduct a phylogenetic analysis, we illustrate our suggestions for phylogenetic data management in a workflow based on a concrete analysis, showing how data should be managed with the help of a published dataset, exploring the information, file formats, processes, and software involved, explaining and showing how to collect and store cross-linguistic information, how to guarantee that datasets are cross-linguistically comparable, how to store intermediate and final results of the analyses, and how to share data in a reusable form by relying in the tools and principles of the CLDF initiative
    • …
    corecore