942 research outputs found

    A Spanish text corpus for the author profiling task

    Get PDF
    Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to its potential applications in security, crime and marketing, among others. One of the main difficulties in this field is the lack of reliable text collections (corpora) to train and test automatically derived classifiers, in particular in specific languages such as Spanish. Although some recent data sets were generated for the PAN competitions, these documents have a lot of “noise” that prevent researchers from obtaining more general conclusions about this task when more formal documents are used. In this context, this work proposes and describes SpanText, a data collection of formal texts in Spanish language which is, as far as we know, the first collection with these characteristics for the author profiling task. Besides, an experimental study is carried out where the difference in performance obtained with formal and informal texts is clearly established and opens interesting research lines to get a deeper understanding of the particularities that each type of documents poses to the author profiling task.XI Workshop Bases de Datos y Minería de DatosRed de Universidades con Carreras de Informática (RedUNCI

    A Spanish text corpus for the author profiling task

    Get PDF
    Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to its potential applications in security, crime and marketing, among others. One of the main difficulties in this field is the lack of reliable text collections (corpora) to train and test automatically derived classifiers, in particular in specific languages such as Spanish. Although some recent data sets were generated for the PAN competitions, these documents have a lot of “noise” that prevent researchers from obtaining more general conclusions about this task when more formal documents are used. In this context, this work proposes and describes SpanText, a data collection of formal texts in Spanish language which is, as far as we know, the first collection with these characteristics for the author profiling task. Besides, an experimental study is carried out where the difference in performance obtained with formal and informal texts is clearly established and opens interesting research lines to get a deeper understanding of the particularities that each type of documents poses to the author profiling task.XI Workshop Bases de Datos y Minería de DatosRed de Universidades con Carreras de Informática (RedUNCI

    A Spanish text corpus for the author profiling task

    Get PDF
    Author Profiling is the task of predicting characteristics of the author of a text, such as age, gender, personality, native language, etc. This is a task of growing importance due to its potential applications in security, crime and marketing, among others. One of the main difficulties in this field is the lack of reliable text collections (corpora) to train and test automatically derived classifiers, in particular in specific languages such as Spanish. Although some recent data sets were generated for the PAN competitions, these documents have a lot of “noise” that prevent researchers from obtaining more general conclusions about this task when more formal documents are used. In this context, this work proposes and describes SpanText, a data collection of formal texts in Spanish language which is, as far as we know, the first collection with these characteristics for the author profiling task. Besides, an experimental study is carried out where the difference in performance obtained with formal and informal texts is clearly established and opens interesting research lines to get a deeper understanding of the particularities that each type of documents poses to the author profiling task.XI Workshop Bases de Datos y Minería de DatosRed de Universidades con Carreras de Informática (RedUNCI

    What demographic attributes do our digital footprints reveal? A systematic review

    Get PDF
    <div><p>To what extent does our online activity reveal who we are? Recent research has demonstrated that the digital traces left by individuals as they browse and interact with others online may reveal who they are and what their interests may be. In the present paper we report a systematic review that synthesises current evidence on predicting demographic attributes from online digital traces. Studies were included if they met the following criteria: (i) they reported findings where at least one demographic attribute was predicted/inferred from at least one form of digital footprint, (ii) the method of prediction was automated, and (iii) the traces were either visible (e.g. tweets) or non-visible (e.g. clickstreams). We identified 327 studies published up until October 2018. Across these articles, 14 demographic attributes were successfully inferred from digital traces; the most studied included gender, age, location, and political orientation. For each of the demographic attributes identified, we provide a database containing the platforms and digital traces examined, sample sizes, accuracy measures and the classification methods applied. Finally, we discuss the main research trends/findings, methodological approaches and recommend directions for future research.</p></div

    PROFILING LEARNING ACTIVITIES IN EXTENSIVE READING COURSE: A CASE OF INDONESIAN UNIVERSITY LEARNERS

    Get PDF
    Extensive Reading (hereafter, ER) has been discussed and deployed as a prevalent approach to enhance EFL/ESL learners’ reading skills in language classroom for several decades. However, insufficient attention has been devoted to the students’ learning activities in Extensive Reading course, notably in Indonesia. For this reason, this study accentuated on profiling the learning activities in Extensive Reading course in such a country. The data were collected through semi-structured interview and analysed with thematic analysis (Braun Clarke, 2006). The findings designated that the students performed two main learning activities in Extensive Reading course, namely inside and outside classroom activities. Viewed from inside classroom activities, they conducted classroom presentations to develop not only reading skills but also speaking skills, self-confidence and self-responsibilities. On the other hand, they selected and read literary works based on their interests and abilities, completed reading logs, created powerpoint slides, made a written reports, produced a poster for presentations and posted their works on their own blogs. Given these facts, ER learning activities enable the EFL/ESL learners to foster and sustain their reading strategies and become more strategic readers

    A systematic survey of online data mining technology intended for law enforcement

    Get PDF
    As an increasing amount of crime takes on a digital aspect, law enforcement bodies must tackle an online environment generating huge volumes of data. With manual inspections becoming increasingly infeasible, law enforcement bodies are optimising online investigations through data-mining technologies. Such technologies must be well designed and rigorously grounded, yet no survey of the online data-mining literature exists which examines their techniques, applications and rigour. This article remedies this gap through a systematic mapping study describing online data-mining literature which visibly targets law enforcement applications, using evidence-based practices in survey making to produce a replicable analysis which can be methodologically examined for deficiencies

    The Impact of Viral Marketing on Emotion and Impulse Buying Behavior: A Case Study of Online Fashion

    Get PDF
    Impulsive online shopping is becoming a habit for many young consumers, especially for fashion products. This study aims to analyze the influence of viral marketing on emotions and impulsive online shopping behavior of young people for fashion products in Vietnam. The results showed that viral marketing with characteristics such as entertainment, source credibility, visual appeal, informativeness, and irritation all had a significant impact on emotions and impulsive online shopping behavior. Therefore,some suggestions are proposed for applying viral marketing to promote impulsive online shopping behavior for fashion products. Keywords: Viral marketing, Impulse buying behavior, Online shopping, Emotions, Fashion. DOI: 10.7176/EJBM/15-7-03 Publication date: April 30th 202

    The discursive construction of teachers and implications for continuing professional development

    Get PDF
    The Malaysia Education Blueprint 2013-2025 is a document that spells out a plan of action for revamping the Malaysian education system. Therefore, it is no surprise that references are made to teachers and their role in ensuring the successful execution of the action plan. Although the blueprint does not set out a course of action for teachers of individual subjects, specific reference is made to English language teachers and this is ideologically significant. In order to understand this significance and how the blueprint positions Malaysian English language teachers, the document needs to be located within the wider discourse community, vis-à-vis through an intertextual reading. In this paper, we first examine the discursive construction of English language teachers in the blueprint as well as media texts to illustrate how these texts have collectively constructed the identity of Malaysian English language teachers. Next, we argue that this discursive construction of Malaysian English language teachers has had consequences for the way continuing professional development programmes have been organised for them in the first of three waves of the Malaysian Education Blueprint action plan from 2013 to 2015. The findings reveal that continuing professional development programmes during this period have focused predominantly on the training of the discursively constructed inept Malaysian English language teacher to ensure they possess the desired proficiency and are able to make changes to existing classroom practices that are aligned with the government agenda

    The Obama/Pentagon War Narrative, the Real War and Where Afghan Civilian Deaths Do Matter

    Get PDF
    This article investigates two related issues: (1) the on-the-ground experience of the fierce US war. in Afghanistan, in contrast to the Pentagon story and the mainstream media; (2) The relentless efforts of Obama and the Pentagon to control the public account of this war. While the real war spread geographically and violence intensified, so did the efforts of the United States. to build a positive reading. The examination of the corpses (of foreign occupation forces and innocent Afghan civilians) reveals a situation of exchange. The elites of the countries of the NATO countries have understood that they have entered a dead end and begin to back down

    The Obama/Pentagon War Narrative, the Real War and Where Afghan Civilian Deaths Do Matter

    Get PDF
    This article investigates two related issues: (1) the on-the-ground experience of the fierce US war. in Afghanistan, in contrast to the Pentagon story and the mainstream media; (2) The relentless efforts of Obama and the Pentagon to control the public account of this war. While the real war spread geographically and violence intensified, so did the efforts of the United States. to build a positive reading. The examination of the corpses (of foreign occupation forces and innocent Afghan civilians) reveals a situation of exchange. The elites of the countries of the NATO countries have understood that they have entered a dead end and begin to back down
    corecore