396 research outputs found

    CLiFF Notes: Research In Natural Language Processing at the University of Pennsylvania

    Get PDF
    CLIFF is the Computational Linguists\u27 Feedback Forum. We are a group of students and faculty who gather once a week to hear a presentation and discuss work currently in progress. The \u27feedback\u27 in the group\u27s name is important: we are interested in sharing ideas, in discussing ongoing research, and in bringing together work done by the students and faculty in Computer Science and other departments. However, there are only so many presentations which we can have in a year. We felt that it would be beneficial to have a report which would have, in one place, short descriptions of the work in Natural Language Processing at the University of Pennsylvania. This report then, is a collection of abstracts from both faculty and graduate students, in Computer Science, Psychology and Linguistics. We want to stress the close ties between these groups, as one of the things that we pride ourselves on here at Penn is the communication among different departments and the inter-departmental work. Rather than try to summarize the varied work currently underway at Penn, we suggest reading the abstracts to see how the students and faculty themselves describe their work. The report illustrates the diversity of interests among the researchers here, as well as explaining the areas of common interest. In addition, since it was our intent to put together a document that would be useful both inside and outside of the university, we hope that this report will explain to everyone some of what we are about

    Sinhala and Tamil : a case of contact-induced restructuring

    Get PDF
    PhD ThesisThe dissertation presents a comparative synchronic study of the morphosyntactic features of modern spoken Sinhala and Tamil, the two main languages of Sri Lanka. The main motivation of the research is that Sinhala and Tamil, two languages of diverse origins—the New Indo-Aryan (NIA) and Dravidian families respectively—share a wide spectrum of morphosyntactic features. Sinhala has long been isolated from the other NIA languages and co-existed with Tamil in Sri Lanka ever since both reached Sri Lanka from India. This coexistence, it is believed, led to what is known as the contact-induced restructuring that Sinhala morphosyntax has undergone on the model of Tamil, while retaining its NIA lexicon. Moreover, as languages of South Asia, the two languages share the areal features of this region. The research seeks to address the following questions: (i) What features do the two languages share and what features do they not share?; (ii) Are the features that they share areal features of the region or those diffused into one another owing to contact?; (iii) If the features that they share are due to contact, has diffusion taken place unidirectionally or bidirectionally?; and (iv) Does contact have any role to play with respect to features that they do not share? The claim that this research intends to substantiate is that Sinhala has undergone morphosyntactic restructuring on the model of Tamil. The research, therefore, attempts to answer another question: (v) Can the morphosyntactic restructuring that Sinhala has undergone be explained in syntactic terms? The morphosyntactic features of the two languages are analyzed at macro- and micro-levels. At the macro-level, a wide range of morphosyntactic features of Tamil and Sinhala, and those of seven other languages of the region are compared with a view to determining the origins of these features and showing the large scale morphosyntactic convergence between Sinhala and Tamil and the divergence between Sinhala and other NIA languages. At the micro-level the dissertation analyzes in detail two morphosyntactic phenomena, namely null arguments and focus constructions. It examines whether subject/verb agreement, which is different across the two languages, plays a role in the licensing of null arguments in each language. It also examines the nature of the changes Sinhala morphosyntax has undergone because of the two kinds of Tamil focus constructions that Sinhala has replicated. It is hoped, that this dissertation will make a significant contribution to the knowledge and understanding of the morphosyntax of the two languages, the effects of language contact on morphosyntax, and more generally, the nature of linguistic variation.Scholarship Programme of the Higher Education for the Twenty First Century (HETC) Project, Ministry of Higher Education, Sri Lanka

    Reflexive constructions in the world's languages

    Get PDF
    Synopsis: This landmark publication brings together 28 papers on reflexive constructions in languages from all continents, representing very diverse language types. While reflexive constructions have been discussed in the past from a variety of angles, this is the first edited volume of its kind. All the chapters are based on original data, and they are broadly comparable through a common terminological framework. The volume opens with two introductory chapters by the editors that set the stage and lay out the main comparative concepts, and it concludes with a chapter presenting generalizations on the basis of the studies of individual languages

    CLiFF Notes: Research in the Language, Information and Computation Laboratory of the University of Pennsylvania

    Get PDF
    One concern of the Computer Graphics Research Lab is in simulating human task behavior and understanding why the visualization of the appearance, capabilities and performance of humans is so challenging. Our research has produced a system, called Jack, for the definition, manipulation, animation and human factors analysis of simulated human figures. Jack permits the envisionment of human motion by interactive specification and simultaneous execution of multiple constraints, and is sensitive to such issues as body shape and size, linkage, and plausible motions. Enhanced control is provided by natural behaviors such as looking, reaching, balancing, lifting, stepping, walking, grasping, and so on. Although intended for highly interactive applications, Jack is a foundation for other research. The very ubiquitousness of other people in our lives poses a tantalizing challenge to the computational modeler: people are at once the most common object around us, and yet the most structurally complex. Their everyday movements are amazingly fluid, yet demanding to reproduce, with actions driven not just mechanically by muscles and bones but also cognitively by beliefs and intentions. Our motor systems manage to learn how to make us move without leaving us the burden or pleasure of knowing how we did it. Likewise we learn how to describe the actions and behaviors of others without consciously struggling with the processes of perception, recognition, and language. Present technology lets us approach human appearance and motion through computer graphics modeling and three dimensional animation, but there is considerable distance to go before purely synthesized figures trick our senses. We seek to build computational models of human like figures which manifest animacy and convincing behavior. Towards this end, we: Create an interactive computer graphics human model; Endow it with reasonable biomechanical properties; Provide it with human like behaviors; Use this simulated figure as an agent to effect changes in its world; Describe and guide its tasks through natural language instructions. There are presently no perfect solutions to any of these problems; ultimately, however, we should be able to give our surrogate human directions that, in conjunction with suitable symbolic reasoning processes, make it appear to behave in a natural, appropriate, and intelligent fashion. Compromises will be essential, due to limits in computation, throughput of display hardware, and demands of real-time interaction, but our algorithms aim to balance the physical device constraints with carefully crafted models, general solutions, and thoughtful organization. The Jack software is built on Silicon Graphics Iris 4D workstations because those systems have 3-D graphics features that greatly aid the process of interacting with highly articulated figures such as the human body. Of course, graphics capabilities themselves do not make a usable system. Our research has therefore focused on software to make the manipulation of a simulated human figure easy for a rather specific user population: human factors design engineers or ergonomics analysts involved in visualizing and assessing human motor performance, fit, reach, view, and other physical tasks in a workplace environment. The software also happens to be quite usable by others, including graduate students and animators. The point, however, is that program design has tried to take into account a wide variety of physical problem oriented tasks, rather than just offer a computer graphics and animation tool for the already computer sophisticated or skilled animator. As an alternative to interactive specification, a simulation system allows a convenient temporal and spatial parallel programming language for behaviors. The Graphics Lab is working with the Natural Language Group to explore the possibility of using natural language instructions, such as those found in assembly or maintenance manuals, to drive the behavior of our animated human agents. (See the CLiFF note entry for the AnimNL group for details.) Even though Jack is under continual development, it has nonetheless already proved to be a substantial computational tool in analyzing human abilities in physical workplaces. It is being applied to actual problems involving space vehicle inhabitants, helicopter pilots, maintenance technicians, foot soldiers, and tractor drivers. This broad range of applications is precisely the target we intended to reach. The general capabilities embedded in Jack attempt to mirror certain aspects of human performance, rather than the specific requirements of the corresponding workplace. We view the Jack system as the basis of a virtual animated agent that can carry out tasks and instructions in a simulated 3D environment. While we have not yet fooled anyone into believing that the Jack figure is real , its behaviors are becoming more reasonable and its repertoire of actions more extensive. When interactive control becomes more labor intensive than natural language instructional control, we will have reached a significant milestone toward an intelligent agent

    Spatial Reference in Rongga (ISO 639-3: ror), Balinese (ISO 639-3: ban), and Indonesian (ISO 639-3: ind)

    Get PDF
    Many scholars have proposed concepts relevant to spatial reference. Herskovits (1982) proposed that the topological concepts support, contiguity and containment are basic in English, while Levinson et al.'s (2003) examination of nine unrelated languages revealed that the concept attachment is primary. Neither of these proposals is confirmed in Rongga, Balinese, and Indonesian. My empirical and experimental investigation of these languages showed that the concept expectedness governs the use of topological prepositions in the languages. Non-topologically, it has long been claimed that a relative frame of reference is universal. This claim is also not confirmed in this study. My non-topological relation study reveals that Rongga and Balinese use a landmark system, while Indonesian practices a relative system. The Balinese landmark system changes to an absolute system when speakers leave the island. In short, this study reveals that previous proposals on the concepts relevant to spatial reference are not universally supported

    CLiFF Notes: Research in the Language Information and Computation Laboratory of The University of Pennsylvania

    Get PDF
    This report takes its name from the Computational Linguistics Feedback Forum (CLIFF), an informal discussion group for students and faculty. However the scope of the research covered in this report is broader than the title might suggest; this is the yearly report of the LINC Lab, the Language, Information and Computation Laboratory of the University of Pennsylvania. It may at first be hard to see the threads that bind together the work presented here, work by faculty, graduate students and postdocs in the Computer Science, Psychology, and Linguistics Departments, and the Institute for Research in Cognitive Science. It includes prototypical Natural Language fields such as: Combinatorial Categorial Grammars, Tree Adjoining Grammars, syntactic parsing and the syntax-semantics interface; but it extends to statistical methods, plan inference, instruction understanding, intonation, causal reasoning, free word order languages, geometric reasoning, medical informatics, connectionism, and language acquisition. With 48 individual contributors and six projects represented, this is the largest LINC Lab collection to date, and the most diverse

    A grammar of the Pendau language of central Sulawesi, Indonesia

    Get PDF
    • …
    corecore