102,916 research outputs found

    Computational Sociolinguistics: A Survey

    Get PDF
    Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of "Computational Sociolinguistics" that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions employed in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.Comment: To appear in Computational Linguistics. Accepted for publication: 18th February, 201

    Trends in LN-embedding practices at Waikato Institute of Technology (Wintec) in 2019

    Get PDF
    In this report, we describe the trends in literacy-embedding practices of level-2 and level-3 tutors who worked in vocational contexts at Waikato Institute of Technology (Wintec), and who completed the New Zealand Certificate in Adult Literacy and Numeracy Education (NZCALNE[Voc]) in 2019. We analysed 19 observations, following constructivist grounded theory methodology (Charmaz, 2014), to produce 1302 descriptive labels that highlight literacy and numeracy practices integrated into tutors’ teaching intentionally pursued in a collaborative and mentored training process. Of the initial 12 categories, we conflated the mapping of LN course demands and identifying learners’ LN needs to arrive at a final 11. We then used these categories in an axial analysis (Saldaňa, 2013), categorising the 1302 labels as binaries (i.e. if the label was related to the category, 1 was coded; if not 0 [zero]). The matrix of 14322 ratings of 1s and 0s was then analysed. We calculated the frequency of 1s by category. We argued that the axial analysis allowed us to develop a more holistic perspective which showed how the 1302 labels were configured in relation to the 11 categories of analysis. We concluded that the 11 categories represented key aspects of vocational teaching and training emphasising that LN-embedding practices have to be seamlessly integrated into general pedagogical approaches. A key construct for new tutors is to shape their understanding of seamlessly integrated versus bolted-on LN practices. Our recommendations remain within the whole-of-organisation perspective proposed in the 2017-2018 report (Greyling, 2019)

    How did the discussion go: Discourse act classification in social media conversations

    Full text link
    We propose a novel attention based hierarchical LSTM model to classify discourse act sequences in social media conversations, aimed at mining data from online discussion using textual meanings beyond sentence level. The very uniqueness of the task is the complete categorization of possible pragmatic roles in informal textual discussions, contrary to extraction of question-answers, stance detection or sarcasm identification which are very much role specific tasks. Early attempt was made on a Reddit discussion dataset. We train our model on the same data, and present test results on two different datasets, one from Reddit and one from Facebook. Our proposed model outperformed the previous one in terms of domain independence; without using platform-dependent structural features, our hierarchical LSTM with word relevance attention mechanism achieved F1-scores of 71\% and 66\% respectively to predict discourse roles of comments in Reddit and Facebook discussions. Efficiency of recurrent and convolutional architectures in order to learn discursive representation on the same task has been presented and analyzed, with different word and comment embedding schemes. Our attention mechanism enables us to inquire into relevance ordering of text segments according to their roles in discourse. We present a human annotator experiment to unveil important observations about modeling and data annotation. Equipped with our text-based discourse identification model, we inquire into how heterogeneous non-textual features like location, time, leaning of information etc. play their roles in charaterizing online discussions on Facebook

    Collaborative trails in e-learning environments

    Get PDF
    This deliverable focuses on collaboration within groups of learners, and hence collaborative trails. We begin by reviewing the theoretical background to collaborative learning and looking at the kinds of support that computers can give to groups of learners working collaboratively, and then look more deeply at some of the issues in designing environments to support collaborative learning trails and at tools and techniques, including collaborative filtering, that can be used for analysing collaborative trails. We then review the state-of-the-art in supporting collaborative learning in three different areas – experimental academic systems, systems using mobile technology (which are also generally academic), and commercially available systems. The final part of the deliverable presents three scenarios that show where technology that supports groups working collaboratively and producing collaborative trails may be heading in the near future
    • …
    corecore