Search CORE

102,916 research outputs found

Computational Sociolinguistics: A Survey

Author: de Jong Franciska
Doğruöz A. Seza
Nguyen Dong
Rosé Carolyn P.
Publication venue
Publication date: 01/01/2016
Field of study

Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) community in the social dimension of language. In this article we present a survey of the emerging field of "Computational Sociolinguistics" that reflects this increased interest. We aim to provide a comprehensive overview of CL research on sociolinguistic themes, featuring topics such as the relation between language and social identity, language use in social interaction and multilingual communication. Moreover, we demonstrate the potential for synergy between the research communities involved, by showing how the large-scale data-driven methods that are widely used in CL can complement existing sociolinguistic studies, and how sociolinguistics can inform and challenge the methods and assumptions employed in CL studies. We hope to convey the possible benefits of a closer collaboration between the two communities and conclude with a discussion of open challenges.Comment: To appear in Computational Linguistics. Accepted for publication: 18th February, 201

arXiv.org e-Print Archive

Crossref

Ghent University Academic Bibliography

EUR Research Repository

University of Twente Research Information

Trends in LN-embedding practices at Waikato Institute of Technology (Wintec) in 2019

Author: Ahmad Farzana
Greyling Willfred
Wallace Nika
Publication venue: Wintec
Publication date: 06/04/2020
Field of study

In this report, we describe the trends in literacy-embedding practices of level-2 and level-3 tutors who worked in vocational contexts at Waikato Institute of Technology (Wintec), and who completed the New Zealand Certificate in Adult Literacy and Numeracy Education (NZCALNE[Voc]) in 2019. We analysed 19 observations, following constructivist grounded theory methodology (Charmaz, 2014), to produce 1302 descriptive labels that highlight literacy and numeracy practices integrated into tutors’ teaching intentionally pursued in a collaborative and mentored training process. Of the initial 12 categories, we conflated the mapping of LN course demands and identifying learners’ LN needs to arrive at a final 11. We then used these categories in an axial analysis (Saldaňa, 2013), categorising the 1302 labels as binaries (i.e. if the label was related to the category, 1 was coded; if not 0 [zero]). The matrix of 14322 ratings of 1s and 0s was then analysed. We calculated the frequency of 1s by category. We argued that the axial analysis allowed us to develop a more holistic perspective which showed how the 1302 labels were configured in relation to the 11 categories of analysis. We concluded that the 11 categories represented key aspects of vocational teaching and training emphasising that LN-embedding practices have to be seamlessly integrated into general pedagogical approaches. A key construct for new tutors is to shape their understanding of seamlessly integrated versus bolted-on LN practices. Our recommendations remain within the whole-of-organisation perspective proposed in the 2017-2018 report (Greyling, 2019)

Wintec Research Archive

How did the discussion go: Discourse act classification in social media conversations

Author: B O’Connor
J Bollen
K Scott
Mirko Lai
ML Larson
S Bhatia
S Hochreiter
S Hochreiter
Subhabrata Dutta
T Chakraborty
V Eisenlauer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/08/2018
Field of study

We propose a novel attention based hierarchical LSTM model to classify discourse act sequences in social media conversations, aimed at mining data from online discussion using textual meanings beyond sentence level. The very uniqueness of the task is the complete categorization of possible pragmatic roles in informal textual discussions, contrary to extraction of question-answers, stance detection or sarcasm identification which are very much role specific tasks. Early attempt was made on a Reddit discussion dataset. We train our model on the same data, and present test results on two different datasets, one from Reddit and one from Facebook. Our proposed model outperformed the previous one in terms of domain independence; without using platform-dependent structural features, our hierarchical LSTM with word relevance attention mechanism achieved F1-scores of 71\% and 66\% respectively to predict discourse roles of comments in Reddit and Facebook discussions. Efficiency of recurrent and convolutional architectures in order to learn discursive representation on the same task has been presented and analyzed, with different word and comment embedding schemes. Our attention mechanism enables us to inquire into relevance ordering of text segments according to their roles in discourse. We present a human annotator experiment to unveil important observations about modeling and data annotation. Equipped with our text-based discourse identification model, we inquire into how heterogeneous non-textual features like location, time, leaning of information etc. play their roles in charaterizing online discussions on Facebook

arXiv.org e-Print Archive

Crossref

Recommended from our members

Applying Graph Theory to Examine the Dynamics of Student Discussions in Small-Group Learning.

Author: Chai Albert
Le Joshua P
Lee Andrew S
Lo Stanley M
Publication venue: eScholarship, University of California
Publication date: 01/06/2019
Field of study

Group work in science, technology, engineering, and mathematics courses is an effective means of improving student outcomes, and many different factors can influence the dynamics of student discussions and, ultimately, the success of collaboration. The substance and dynamics of group discussions are commonly examined using qualitative methods such as discourse analysis. To complement existing work in the literature, we developed a quantitative methodology that uses graph theory to map the progression of talk-turns of discussions within a group. We observed groups of students working with peer facilitators to solve problems in biological sciences, with three iterations of data collection and two major refinements of graph theory calculations. Results include general behaviors based on the turns in which different individuals talk and graph theory parameters to quantify group characteristics. To demonstrate the potential utility of the methodology, we present case studies with distinct patterns: a centralized group in which the peer facilitator behaves like an authority figure, a decentralized group in which most students talk their fair share of turns, and a larger group with subgroups that have implications for equity, diversity, and inclusion. Together, these results demonstrate that our adaptation of graph theory is a viable quantitative methodology to examine group discussions

eScholarship - University of California

Collaborative trails in e-learning environments

Author: Bodnar E.
Brasher A.
de Freitas Sara
Emans B.
Eyssautier C.
Jones A.
Keenoy K.
Levene Mark
Montandon L.
Pernin J.P.
Schoonenboom J.
Turcsanyi-Szabo M.
Waycott J.
Publication venue
Publication date: 23/12/2004
Field of study

This deliverable focuses on collaboration within groups of learners, and hence collaborative trails. We begin by reviewing the theoretical background to collaborative learning and looking at the kinds of support that computers can give to groups of learners working collaboratively, and then look more deeply at some of the issues in designing environments to support collaborative learning trails and at tools and techniques, including collaborative filtering, that can be used for analysing collaborative trails. We then review the state-of-the-art in supporting collaborative learning in three different areas – experimental academic systems, systems using mobile technology (which are also generally academic), and commercially available systems. The final part of the deliverable presents three scenarios that show where technology that supports groups working collaboratively and producing collaborative trails may be heading in the near future

Birkbeck Institutional Research Online

Recommended from our members

Discussion Analytics: Identifying Conversations and Social Learners in FutureLearn MOOCs

Author: Chua Shi-Min
Rienties Bart
Sharples Mike
Tagg Caroline
Publication venue: CEUR-WS.org
Publication date: 01/03/2017
Field of study

Discussion among learners in MOOCs has been hailed as beneficial for social constructive learning. To understand the pedagogical value of MOOC discussion forums, several researchers have utilized content analysis techniques to associate individual postings with differing levels of cognitive activity. However, this analysis typically ignores the turn taking among discussion postings, such as learners responding to others’ replies to their posts, learners receiving no reply for their posts, or learners just posting without conversing with others. This information is particularly important in understanding patterns of conversations that occur in MOOCs, and learners’ commenting behaviors. Therefore, in this paper we categorize comments in a FutureLearn MOOC based on their nature (post vs. reply to others’ post), classify learners based on their contributions for each type of post-ing, and identify conversations based on the types of comments composing them. This categorization quantifies the dynamics of conversations in the discussion activities, allowing monitoring of on-going discussion activities in FutureLearn and further analysis of identified conversations, social learners, and course steps with an unusually high number of a particular type of comment

Open Research Online (The Open University)