Search CORE

2,256 research outputs found

Classifying Cue Phrases in Text and Speech Using Machine Learning

Author: Litman Diane J.
Publication venue
Publication date: 01/01/1994
Field of study

Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. This paper explores the use of machine learning for classifying cue phrases as discourse or sentential. Two machine learning programs (Cgrendel and C4.5) are used to induce classification rules from sets of pre-classified cue phrases and their features. Machine learning is shown to be an effective technique for not only automating the generation of classification rules, but also for improving upon previous results.Comment: 8 pages, PostScript File, to appear in the Proceedings of AAAI-9

arXiv.org e-Print Archive

CiteSeerX

Cue Phrase Classification Using Machine Learning

Author: Litman Diane J.
Publication venue
Publication date: 01/01/1996
Field of study

Cue phrases may be used in a discourse sense to explicitly signal discourse structure, but also in a sentential sense to convey semantic rather than structural information. Correctly classifying cue phrases as discourse or sentential is critical in natural language processing systems that exploit discourse structure, e.g., for performing tasks such as anaphora resolution and plan recognition. This paper explores the use of machine learning for classifying cue phrases as discourse or sentential. Two machine learning programs (Cgrendel and C4.5) are used to induce classification models from sets of pre-classified cue phrases and their features in text and speech. Machine learning is shown to be an effective technique for not only automating the generation of classification models, but also for improving upon previous results. When compared to manually derived classification models already in the literature, the learned models often perform with higher accuracy and contain new linguistic insights into the data. In addition, the ability to automatically construct classification models makes it easier to comparatively analyze the utility of alternative feature representations of the data. Finally, the ease of retraining makes the learning approach more scalable and flexible than manual methods.Comment: 42 pages, uses jair.sty, theapa.bst, theapa.st

arXiv.org e-Print Archive

CiteSeerX

Guidelines for spaceborne microwave remote sensors

Author: Litman V.
Nicholas J.
Publication venue
Publication date
Field of study

A handbook was developed to provide information and support to the spaceborne remote sensing and frequency management communities: to guide sensor developers in the choice of frequencies; to advise regulators on sensor technology needs and sharing potential; to present sharing analysis models and, through example, methods for determining sensor sharing feasibility; to introduce developers to the regulatory process; to create awareness of proper assignment procedures; to present sensor allocations; and to provide guidelines on the use and limitations of allocated bands. Controlling physical factors and user requirements and the regulatory environment are discussed. Sensor frequency allocation achievable performance and usefulness are reviewed. Procedures for national and international registration, the use of non-allocated bands and steps for obtaining new frequency allocations, and procedures for reporting interference are also discussed

NASA Technical Reports Server

Plan recognition for space telerobotics

Author: Goodman Bradley A.
Litman Diane J.
Publication venue
Publication date
Field of study

Current research on space telerobots has largely focused on two problem areas: executing remotely controlled actions (the tele part of telerobotics) or planning to execute them (the robot part). This work has largely ignored one of the key aspects of telerobots: the interaction between the machine and its operator. For this interaction to be felicitous, the machine must successfully understand what the operator is trying to accomplish with particular remote-controlled actions. Only with the understanding of the operator's purpose for performing these actions can the robot intelligently assist the operator, perhaps by warning of possible errors or taking over part of the task. There is a need for such an understanding in the telerobotics domain and an intelligent interface being developed in the chemical process design domain addresses the same issues

NASA Technical Reports Server

PARADISE: A Framework for Evaluating Spoken Dialogue Agents

Author: Abella Alicia
Kamm Candace A.
Litman Diane J.
Walker Marilyn A.
Publication venue
Publication date: 01/01/1997
Field of study

This paper presents PARADISE (PARAdigm for DIalogue System Evaluation), a general framework for evaluating spoken dialogue agents. The framework decouples task requirements from an agent's dialogue behaviors, supports comparisons among dialogue strategies, enables the calculation of performance over subdialogues and whole dialogues, specifies the relative contribution of various factors to performance, and makes it possible to compare agents performing different tasks by normalizing for task complexity.Comment: 10 pages, uses aclap, psfig, lingmacros, time

arXiv.org e-Print Archive

CiteSeerX

Creating Full Individual-level Location Timelines from Sparse Social Media Data

Author: Chon Yohan
Jurgens David
Litman Todd
Sadilek Adam
Yu A. J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 22/11/2019
Field of study

In many domain applications, a continuous timeline of human locations is critical; for example for understanding possible locations where a disease may spread, or the flow of traffic. While data sources such as GPS trackers or Call Data Records are temporally-rich, they are expensive, often not publicly available or garnered only in select locations, restricting their wide use. Conversely, geo-located social media data are publicly and freely available, but present challenges especially for full timeline inference due to their sparse nature. We propose a stochastic framework, Intermediate Location Computing (ILC) which uses prior knowledge about human mobility patterns to predict every missing location from an individual's social media timeline. We compare ILC with a state-of-the-art RNN baseline as well as methods that are optimized for next-location prediction only. For three major cities, ILC predicts the top 1 location for all missing locations in a timeline, at 1 and 2-hour resolution, with up to 77.2% accuracy (up to 6% better accuracy than all compared methods). Specifically, ILC also outperforms the RNN in settings of low data; both cases of very small number of users (under 50), as well as settings with more users, but with sparser timelines. In general, the RNN model needs a higher number of users to achieve the same performance as ILC. Overall, this work illustrates the tradeoff between prior knowledge of heuristics and more data, for an important societal problem of filling in entire timelines using freely available, but sparse social media data.Comment: 10 pages, 8 figures, 2 table

arXiv.org e-Print Archive

Crossref

Cholesterol Dependent Recruitment of di22:6-PC by a G Protein-Coupled Receptor into Lateral Domains

Author: Litman Burton J.
Polozova Alla
Publication venue: The Biophysical Society. Published by Elsevier Inc.
Publication date: 30/11/2000
Field of study

AbstractBovine rhodopsin was reconstituted into mixtures of didocosahexaenoylphosphatidylcholine (di22:6-PC), dipalmitoylphosphatidylcholine (di16:0-PC), sn-1-palmitoyl-sn-2-docosahexaenoylphosphatidylcholine (16:0, 22:6-PC) and cholesterol. Rhodopsin denaturation was examined by using high-sensitivity differential scanning calorimetry. The unfolding temperature was increased at lower levels of lipid unsaturation, but the highest temperature was detected for native disk membranes: di22:6-PC<16:0,22:6-PC<di16:0,18:1-PC<native disks. The incorporation of 30mol% of cholesterol resulted in 2–4°C increase of denaturation temperature in all reconstituted systems examined. From the analysis of van’t Hoff’s and calorimetric enthalpies, it was concluded that the presence of cholesterol in di22:6-PC-containing bilayers induces a level of cooperativity in rhodopsin unfolding. Fluorescence resonance energy transfer (FRET), using lipids labeled at the headgroup with pyrene (Py) as donors and rhodopsin retinal group as acceptor of fluorescence, was used to study rhodopsin association with lipids. Higher FRET efficiencies detected for di22:6-PE-Py, compared to di16:0-PE-Py, in mixed di22:6-PC–di16:0-PC–cholesterol bilayers, indicate preferential segregation of rhodopsin with polyunsaturated lipids. The effective range of the rhodopsin–lipid interactions facilitating cluster formation exceeds two adjacent lipid layers. In similar mixed bilayers containing no cholesterol, cluster formation was absent at temperatures above lipid phase transition, indicating a crucial role of cholesterol in microdomain formation

Elsevier - Publisher Connector

Recommended from our members

Empirically Evaluating an Adaptable Spoken Dialogue System Diane J. Litman

Author: Litman Diane J.
Pan Shimei
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/1999
Field of study

Recent technological advances have made it possible to build real-time, interactive spoken dialogue systems for a wide variety of applications. However, when users do not respect the limitations of such systems, performance typically degrades. Although users differ with respect to their knowledge of system limitations, and although different dialogue strategies make system limitations more apparent to users, most current systems do not try to improve performance by adapting dialogue behavior to individual users. This paper presents an empirical evaluation of TOOT, an adaptable spoken dialogue system for retrieving train schedules on the web. We conduct an experiment in which 20 users carry out 4 tasks with both adaptable and non-adaptable versions of TOOT, resulting in a corpus of 80 dialogues. The values for a wide range of evaluation measures are then extracted from this corpus. Our results show that adaptable TOOT generally outperforms non-adaptable TOOT, and that the utility of adaptation depends on TOOT's initial dialogue strategies

Columbia University Academic Commons

When Does Disengagement Correlate with Performance in Spoken Dialog Computer Tutoring?

Author: Forbes-Riley Kate
Litman Diane J
Publication venue: 'IOS Press'
Publication date: 01/01/2012
Field of study

In this paper we investigate how student disengagement relates to two performance metrics in a spoken dialog computer tutoring corpus, both when disengagement is measured through manual annotation by a trained human judge, and also when disengagement is measured through automatic annotation by the system based on a machine learning model. First, we investigate whether manually labeled overall disengagement and six different disengagement types are predictive of learning and user satisfaction in the corpus. Our results show that although students’ percentage of overall disengaged turns negatively correlates both with the amount they learn and their user satisfaction, the individual types of disengagement correlate differently: some negatively correlate with learning and user satisfaction, while others don’t correlate with eithermetric at all. Moreover, these relationships change somewhat depending on student prerequisite knowledge level. Furthermore, using multiple disengagement types to predict learning improves predictive power. Overall, these manual label-based results suggest that although adapting to disengagement should improve both student learning and user satisfaction in computer tutoring, maximizing performance requires the system to detect and respond differently based on disengagement type. Next, we present an approach to automatically detecting and responding to user disengagement types based on their differing correlations with correctness. Investigation of ourmachine learningmodel of user disengagement shows that its automatic labels negatively correlate with both performance metrics in the same way as the manual labels. The similarity of the correlations across the manual and automatic labels suggests that the automatic labels are a reasonable substitute for the manual labels. Moreover, the significant negative correlations themselves suggest that redesigning ITSPOKE to automatically detect and respond to disengagement has the potential to remediate disengagement and thereby improve performance, even in the presence of noise introduced by the automatic detection process

D-Scholarship@Pitt