398 research outputs found

    THE EFFECTIVENESS OF USING VIDEO IN TEACHING LISTENING OF ORAL NARRATIVE TEXT

    Get PDF
    The objective of the research is to identify the effectiveness of using video in teaching listening of oral narrative text. Related to the objective of the research, the writer uses experimental method. The research was conducted at SMP N 1 Sawit, Boyolali from 13 January to 28 February 2010, in the academic year 2009/2010. The population in this research is the eighth grade students of SMP N 1 Sawit. The total number of population is 280 students coming from seven classes. The sampling of the research is cluster random sampling. From the population, two classes were taken randomly as the sample. The samples are class VIII G as the experimental group which consists of 40 students, and class VIII F as the control group which consists of 40 students. The writer uses t-test, normality test, and homogeneity test in order to check whether the two groups have the same listening ability, homogeneity and normal distribution or not. After analyzing the pre-test data the writer finds that both groups are homogeneous and in normal distribution. Moreover, based on t-test of pre-test data analysis, the writer finds that t0=0.617 is less than tt(78,0.05)=1.980 or t0< tt. It means H0 is accepted and there is no significant difference in listening ability between the experimental group and the control group. The research design used is Quasi- Experimental Design with Pretest- Posttest and Control Group. While in collecting the data, the writer used a test in the form of multiple choice type. The data are then analyzed by using t-test formula. In this case, data which are analyzed are pre-test and post-test scores of the two groups, the experimental group and the control group. The result of t-test computation shows that t observation (t0) is 4.99 while the value of t table (tt) is 1.98. In other words, t0 is higher than tt (t observation > t table). Therefore, the Alternative Hypothesis (Ha) is accepted while Null Hypothesis (H0) is rejected. It can be concluded that there is a significant difference in listening achievement of oral narrative text between the experimental group and the control group. Besides, the writer finds that the mean of the scores of the experimental group is higher than the control group. The mean of the scores of the experimental group is 6.05, while the mean of the scores of control group is 5.48. The mean difference between them is 0.57. Thus, the result of the research study implies that video is effective to be applied in teaching listening of oral narrative text

    From Verbs to Tasks: An Integrated Account of Learning Tasks from Situated Interactive Instruction.

    Full text link
    Intelligent collaborative agents are becoming common in the human society. From virtual assistants such as Siri and Google Now to assistive robots, they contribute to human activities in a variety of ways. As they become more pervasive, the challenge of customizing them to a variety of environments and tasks becomes critical. It is infeasible for engineers to program them for each individual use. Our research aims at building interactive robots and agents that adapt to new environments autonomously by interacting with human users using natural modalities. This dissertation studies the problem of learning novel tasks from human-agent dialog. We propose a novel approach for interactive task learning, situated interactive instruction (SII), and investigate approaches to three computational challenges that arise in designing SII agents: situated comprehension, mixed-initiative interaction, and interactive task learning. We propose a novel mixed-modality grounded representation for task verbs which encompasses their lexical, semantic, and task-oriented aspects. This representation is useful in situated comprehension and can be learned through human-agent interactions. We introduce the Indexical Model of comprehension that can exploit extra-linguistic contexts for resolving semantic ambiguities in situated comprehension of task commands. The Indexical model is integrated with a mixed-initiative interaction model that facilitates a flexible task-oriented human-agent dialog. This dialog serves as the basis of interactive task learning. We propose an interactive variation of explanation-based learning that can acquire the proposed representation. We demonstrate that our learning paradigm is efficient, can transfer knowledge between structurally similar tasks, integrates agent-driven exploration with instructional learning, and can acquire several tasks. The methods proposed in this thesis are integrated in Rosie - a generally instructable agent developed in the Soar cognitive architecture and embodied on a table-top robot.PhDComputer Science and EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/111573/1/shiwali_1.pd

    Unsupervised extraction of semantic relations using discourse information

    Get PDF
    La compréhension du langage naturel repose souvent sur des raisonnements de sens commun, pour lesquels la connaissance de relations sémantiques, en particulier entre prédicats verbaux, peut être nécessaire. Cette thèse porte sur la problématique de l'utilisation d'une méthode distributionnelle pour extraire automatiquement les informations sémantiques nécessaires à ces inférences de sens commun. Des associations typiques entre des paires de prédicats et un ensemble de relations sémantiques (causales, temporelles, de similarité, d'opposition, partie/tout) sont extraites de grands corpus, par l'exploitation de la présence de connecteurs du discours signalant typiquement ces relations. Afin d'apprécier ces associations, nous proposons plusieurs mesures de signifiance inspirées de la littérature ainsi qu'une mesure novatrice conçue spécifiquement pour évaluer la force du lien entre les deux prédicats et la relation. La pertinence de ces mesures est évaluée par le calcul de leur corrélation avec des jugements humains, obtenus par l'annotation d'un échantillon de paires de verbes en contexte discursif. L'application de cette méthodologie sur des corpus de langue française et anglaise permet la construction d'une ressource disponible librement, Lecsie (Linked Events Collection for Semantic Information Extraction). Celle-ci est constituée de triplets: des paires de prédicats associés à une relation; à chaque triplet correspondent des scores de signifiance obtenus par nos mesures.Cette ressource permet de dériver des représentations vectorielles de paires de prédicats qui peuvent être utilisées comme traits lexico-sémantiques pour la construction de modèles pour des applications externes. Nous évaluons le potentiel de ces représentations pour plusieurs applications. Concernant l'analyse du discours, les tâches de la prédiction d'attachement entre unités du discours, ainsi que la prédiction des relations discursives spécifiques les reliant, sont explorées. En utilisant uniquement les traits provenant de notre ressource, nous obtenons des améliorations significatives pour les deux tâches, par rapport à plusieurs bases de référence, notamment des modèles utilisant d'autres types de représentations lexico-sémantiques. Nous proposons également de définir des ensembles optimaux de connecteurs mieux adaptés à des applications sur de grands corpus, en opérant une réduction de dimension dans l'espace des connecteurs, au lieu d'utiliser des groupes de connecteurs composés manuellement et correspondant à des relations prédéfinies. Une autre application prometteuse explorée dans cette thèse concerne les relations entre cadres sémantiques (semantic frames, e.g. FrameNet): la ressource peut être utilisée pour enrichir cette structure par des relations potentielles entre frames verbaux à partir des associations entre leurs verbes. Ces applications diverses démontrent les contributions prometteuses amenées par notre approche permettant l'extraction non supervisée de relations sémantiques.Natural language understanding often relies on common-sense reasoning, for which knowledge about semantic relations, especially between verbal predicates, may be required. This thesis addresses the challenge of using a distibutional method to automatically extract the necessary semantic information for common-sense inference. Typical associations between pairs of predicates and a targeted set of semantic relations (causal, temporal, similarity, opposition, part/whole) are extracted from large corpora, by exploiting the presence of discourse connectives which typically signal these semantic relations. In order to appraise these associations, we provide several significance measures inspired from the literature as well as a novel measure specifically designed to evaluate the strength of the link between the two predicates and the relation. The relevance of these measures is evaluated by computing their correlations with human judgments, based on a sample of verb pairs annotated in context. The application of this methodology to French and English corpora leads to the construction of a freely available resource, Lecsie (Linked Events Collection for Semantic Information Extraction), which consists of triples: pairs of event predicates associated with a relation; each triple is assigned significance scores based on our measures. From this resource, vector-based representations of pairs of predicates can be induced and used as lexical semantic features to build models for external applications. We assess the potential of these representations for several applications. Regarding discourse analysis, the tasks of predicting attachment of discourse units, as well as predicting the specific discourse relation linking them, are investigated. Using only features from our resource, we obtain significant improvements for both tasks in comparison to several baselines, including ones using other representations of the pairs of predicates. We also propose to define optimal sets of connectives better suited for large corpus applications by performing a dimension reduction in the space of the connectives, instead of using manually composed groups of connectives corresponding to predefined relations. Another promising application pursued in this thesis concerns relations between semantic frames (e.g. FrameNet): the resource can be used to enrich this sparse structure by providing candidate relations between verbal frames, based on associations between their verbs. These diverse applications aim to demonstrate the promising contributions provided by our approach, namely allowing the unsupervised extraction of typed semantic relations

    The evolution of language: Proceedings of the Joint Conference on Language Evolution (JCoLE)

    Get PDF

    Linguistic Representation of Problem Solving Processes in Unaided Object Assembly

    Get PDF
    This thesis investigates the linguistic representation of problem solving processes in data recorded during unaided object assembly. It combines traditional approaches of analyzing verbal protocols with the recent approach of Cognitive Discourse Analysis

    Epistemička modalnost u akademskom diskursu u hrvatskom i engleskom jeziku

    Get PDF
    The present thesis is the result of a cross-cultural, genre-based study whose main objective is to examine how writers of research articles in psychology in Croatian and English use epistemic modality devices in hedging their claims or in evaluating other scholars’ work. Based on the corpus of 60 research articles published in Croatian and English journals, the study aims to establish the patterns of similarities and differences in the use of the epistemic devices across the main rhetorical sections of a research article as well as to identify their major hedging functions. The overall results show that English writers use epistemic markers more frequently than their Croatian counterparts. This finding is generally in line with the previous cross-cultural studies, showing a more salient use of hedges and their more entrenched status in the Anglo- American writing as compared to academic writing in some other languages investigated. With respect to the individual categories of epistemic devices, the results show both similarities and differences in their uses across the two sub-corpora. In both the English and Croatian sub-corpus, epistemic modal verbs are employed most frequently, followed by epistemic verbs, while epistemic nouns are the least frequent category of epistemic devices. The major difference in the overall results concerns the distributional patterns in the use of epistemic devices. While epistemic modal verbs show a strikingly high frequency of occurrences as compared to other epistemic devices in the English corpus, the results of the frequency analysis of the Croatian corpus show that writers hedge their claims mostly by means of the modal verbs, epistemic verbs, and epistemic adverbs and particles, as attested by their overall similar frequencies. With respect to the distribution of epistemic devices across the research article sections, both English and Croatian writers hedge their claims mostly in the Discussion, followed by the Introduction section, while the use of epistemic devices in the remaining two sections is significantly lower by comparison. Generally, this complies with the major rhetorical functions of the research article sections. Thus, the highest density of hedges in the Discussion reflects its major rhetorical functions primarily concerned with writers’ interpretations and implications of the given research, which often requires a cautious and tentative use of language, shielding writers from the risks of negatibilty of the claims. By contrast, the use of hedges in the middle research article sections is less salient given their focus on the descriptive accounts of the methodological procedures and obtained findings. Drawing on Hyland’s (1998) polypragmatic model of scientific hedges, epistemic devices in both corpora are mostly concerned with the reliability type of hedges, concerned with indicating uncertainties towards the propositional content, signaling at the same time the extent to which the claims may be considered as accurate given the limited state of knowledge they are based on. In addition, epistemic markers may be used as writer-oriented hedges concerned with diminishing the writers’ presence in the text, allowing them to maintain distance from the proposed claims. Finally, the use of epistemic verbs co-occurring with the 1st person plural pronouns is interpreted in the present study as a writer’s strategic choice in foregrounding the epistemic stance. This use of epistemic devices is more frequent in the English as compared to the Croatian corpus, which is in line with some previous crosscultural research, indicating that self-mention is a more prominent feature of the Anglo- American writing as compared to that in other languages. In sum, the present findings provide an insight into the use of the epistemic language in the cross-cultural disciplinary writing and as such may be of particular use to the Croatian speaking disciplinary scholars, students and all those interested in writing research articles in English. On a more general note, it is expected that the study may incite further research on academic writing conventions in Croatian or their comparison with those in English as a lingua franca of science.Cilj je rada istražiti kako autori znanstvenih članaka iz područja psihologije na hrvatskom i engleskom jeziku koriste sredstva epistemičke modalnosti da bi izrazili različiti stupanj sigurnosti prema iznesenim tvrdnjama te iskazali stav prema tvrdnjama drugih autora. Analiza se temelji na korpusu 60 znanstvenih članaka objavljenim u znanstvenim časopisima na hrvatskom i engleskom jeziku. Cilj je analize utvrđivanje sličnosti i razlika u uporabi i učestalosti sredstava epistemičke modalnosti u glavnim retoričkim segmentima znanstvenog članka te istraživanje njihovih pragmatičkih funkcija kao sredstava ograđivanja u znanstvenom tekstu. Rezultati frekvencijske analize pokazuju veću zastupljenost sredstava epistemičke modalnosti u engleskom korpusu u odnosu na hrvatski, što je općenito u skladu s nalazima prethodnih međujezičnih istraživanja koja upućuju na učestaliju uporabu oznaka ograđivanja u akademskom stilu angloameričkog govornog područja u odnosu na akademske stilove pisanja u nekim drugim jezicima. Rezultati pokazuju da su modalni glagoli najčešća gramatička kategorija epistemičkih sredstava u oba korpusa, dok su epistemički glagoli sljedeća kategorija po čestotnosti. U oba korpusa najmanju zastupljenost pokazuje uporaba epistemičkih imenica. Unatoč navedenim sličnostima, rezultati analize pokazuju na istaknutu uporabu modalnih glagola u engleskom korpusu, dok učestalost ostalih sredstava epistemičke modalnosti ne pokazuje drastična odstupanja. Rezultati analize hrvatskog korpusa pokazuju da se najčešća sredstva grupiraju oko modalnih glagola, epistemičkih punoznačnih glagola te modalnih priloga i čestica, dok su ostala sredstva značajno manje zastupljena. Nalazi analize ukazuju da se u oba korpusa oznake ograđivanja najviše koriste u Raspravi, manje u Uvodu, dok je značajno manja učestalost zabilježena u Metodi i Rezultatima. Najveća zastupljenost oznaka ograđivanja u Raspravi ukazuje na autorovu potrebu iskazivanja opreza i odmaka u tumačenju nalaza istraživanja i pokušajima izvođenja zaključaka, što proizlazi iz svijesti o različitim ograničenjima istraživanja koja često ne dozvoljavaju iskazivanje visokog stupnja sigurnosti u iznošenju stavova. Manja zastupljenost oznaka ograđivanja u središnjim segmentima članka odražava njihovu primarnu usmjerenost na opise metodoloških postupaka i rezultata, što u pravilu ne zahtijeva izraženiju uporabu oznaka ograđivanja. U odnosu na Hylandov (1998) polipragmatički model ograđivanja u znanstvenom tekstu, rezultati pokazuju da se sredstva epistemičke modalnosti najčešće koriste za iskazivanje nižeg stupnja sigurnosti u odnosu na sadržaj tvrdnje, upućujući pritom da se iste mogu smatrati pouzdanim u okvirima postojećeg, često ograničenog, znanja na temelju kojeg se izvode. Osim na propozicijski sadržaj, pragmatičke funkcije epistemičkih sredstava mogu biti usmjerene i na autora, pri čemu se umanjuje njegova prisutnost u tekstu te omogućuje zadržavanje većeg odmaka od iznesenih tvrdnji. Naposlijetku, uporaba prvog lica i punoznačnih epistemičkih glagola u ovom se radu smatra autorovim izborom s ciljem isticanja osobnog epistemičkog stava. Rezultati pokazuju da je navedena uporaba epistemičkih sredstava učestalija u engleskom korpusu, što je općenito u skladu s nekim prethodnim međujezičnim istraživanjima koja ukazuju da je prisutnost autora istaknutija konvencija angloameričkog akademskog stila pisanja u odnosu na iste u nekim drugim istraživanim jezicima. Zaključno, pretpostavlja se da bi uočene specifičnosti u uporabi sredstava epistemičke modalnosti u psihologijskim člancima u engleskom i hrvatskom jeziku mogle koristiti predmetnim stručnjacima, studentima i svima onima koji počinju pisati ili već imaju iskustvo pisanja znanstvenih članaka kako na hrvatskom, tako i na engleskom jeziku. Očekuje se da bi postojeće istraživanje moglo potaknuti daljnja istraživanja konvencija akademskog pisanja, kako hrvatskog jezika, tako i njihove usporedbe s engleskim jezikom kao globalnim jezikom znanosti

    Argumentation Mining in User-Generated Web Discourse

    Full text link
    The goal of argumentation mining, an evolving research field in computational linguistics, is to design methods capable of analyzing people's argumentation. In this article, we go beyond the state of the art in several ways. (i) We deal with actual Web data and take up the challenges given by the variety of registers, multiple domains, and unrestricted noisy user-generated Web discourse. (ii) We bridge the gap between normative argumentation theories and argumentation phenomena encountered in actual data by adapting an argumentation model tested in an extensive annotation study. (iii) We create a new gold standard corpus (90k tokens in 340 documents) and experiment with several machine learning methods to identify argument components. We offer the data, source codes, and annotation guidelines to the community under free licenses. Our findings show that argumentation mining in user-generated Web discourse is a feasible but challenging task.Comment: Cite as: Habernal, I. & Gurevych, I. (2017). Argumentation Mining in User-Generated Web Discourse. Computational Linguistics 43(1), pp. 125-17

    Extracting Temporal and Causal Relations between Events

    Full text link
    Structured information resulting from temporal information processing is crucial for a variety of natural language processing tasks, for instance to generate timeline summarization of events from news documents, or to answer temporal/causal-related questions about some events. In this thesis we present a framework for an integrated temporal and causal relation extraction system. We first develop a robust extraction component for each type of relations, i.e. temporal order and causality. We then combine the two extraction components into an integrated relation extraction system, CATENA---CAusal and Temporal relation Extraction from NAtural language texts---, by utilizing the presumption about event precedence in causality, that causing events must happened BEFORE resulting events. Several resources and techniques to improve our relation extraction systems are also discussed, including word embeddings and training data expansion. Finally, we report our adaptation efforts of temporal information processing for languages other than English, namely Italian and Indonesian.Comment: PhD Thesi
    corecore