8 research outputs found
Exploiting E-mail structure to improve summarization
Thesis (M.Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2002.Includes bibliographical references (p. 77-81).This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.For this thesis, I designed and implemented a system to summarize e-mail messages. The system exploits two aspects of e-mail, thread reply chains and commonly-found features, to generate summaries. The system uses existing software designed to summarize single text documents. Such software typically performs best on well-authored, formal documents. E-mail messages, however, are typically neither well-authored, nor formal. As a result, existing summarization software typically gives a poor summary of e-mail messages. To remedy this poor performance, the system's approach preprocesses e-mail messages to synthesize new input to this software, so that it will output more useful summaries of e-mail. This pre-processing involves a lightweight, heuristics-based approach to filtering e-mail to remove e-mail signatures, header fields, and quoted parent messages. I also present a heuristics-based approach to identifying and reporting names, dates, and companies found in e-mail messages. Lastly, I discuss conclusions from a pilot user study of my summarization system, and conclude with areas for further investigation.by Derek Scott Lam.M.Eng
Research in the Language, Information and Computation Laboratory of the University of Pennsylvania
This report takes its name from the Computational Linguistics Feedback Forum (CLiFF), an informal discussion group for students and faculty. However the scope of the research covered in this report is broader than the title might suggest; this is the yearly report of the LINC Lab, the Language, Information and Computation Laboratory of the University of Pennsylvania.
It may at first be hard to see the threads that bind together the work presented here, work by faculty, graduate students and postdocs in the Computer Science and Linguistics Departments, and the Institute for Research in Cognitive Science. It includes prototypical Natural Language fields such as: Combinatorial Categorial Grammars, Tree Adjoining Grammars, syntactic parsing and the syntax-semantics interface; but it extends to statistical methods, plan inference, instruction understanding, intonation, causal reasoning, free word order languages, geometric reasoning, medical informatics, connectionism, and language acquisition.
Naturally, this introduction cannot spell out all the connections between these abstracts; we invite you to explore them on your own. In fact, with this issue itâs easier than ever to do so: this document is accessible on the âinformation superhighwayâ. Just call up http://www.cis.upenn.edu/~cliff-group/94/cliffnotes.html
In addition, you can find many of the papers referenced in the CLiFF Notes on the net. Most can be obtained by following links from the authorsâ abstracts in the web version of this report.
The abstracts describe the researchersâ many areas of investigation, explain their shared concerns, and present some interesting work in Cognitive Science. We hope its new online format makes the CLiFF Notes a more useful and interesting guide to Computational Linguistics activity at Penn
Recommended from our members
Cultural critique and canon formation, 1910-1937. A study in modernism and cultural memory
This thesis argues that one of the tasks of literary history is to identify and challenge the processes by which writers who were once highly valued come to be forgotten and excluded from the canon. I investigate the work and cultural milieu of three such writers: Douglas Goldring, John Rodker and Mary Butts. The first chapter sets the terms of the argument, and presents the grounds for a reconfiguration of the conventional historical view of modernism. The second examines the early work of Douglas Goldring: his achievements as editor of The Tramp are related to its cultural and historical situation; I then turn to the history of conscientious objection in the First World War in order to explore the politics of his 1917 novel The Fortune, and to provide historical material necessary for the later reading of Rodker's Memoirs of Other Fronts. This leads on to a discussion of some of his subsequent political novels and plays. In the third and fourth
chapters, I analyse the work of John Rodker, from his adolescence in the East End of London to his maturity, first in relation to modernist dance and theatrical experiments in London during the first war, and later to avant-garde writing in England and France in the 1920s, particularly as it draws upon psycho-analysis. The fifth chapter examines the novels of Mary Butts, particularly Ashe of Rings, which is read as a war-novel, but one which makes constructive use of her interest in the occult. What this category meant during that period is also investigated, which allows me to formulate a broader argument that situates her work within a tradition that takes fantasy seriously, while remaining critical of the conceptual framework of psycho-analysis. I follow this up by showing the later importance of unconscious anti-semitism to her capacity to elaborate an ecological nationalism. The final chapter examines anti-semitism and satire in the
relationship between Rodker and Wyndham Lewis, and offers a further explanatory justification for the argument of the thesis
Head-Driven Phrase Structure Grammar
Head-Driven Phrase Structure Grammar (HPSG) is a constraint-based or declarative approach to linguistic knowledge, which analyses all descriptive levels (phonology, morphology, syntax, semantics, pragmatics) with feature value pairs, structure sharing, and relational constraints. In syntax it assumes that expressions have a single relatively simple constituent structure. This volume provides a state-of-the-art introduction to the framework. Various chapters discuss basic assumptions and formal foundations, describe the evolution of the framework, and go into the details of the main syntactic phenomena. Further chapters are devoted to non-syntactic levels of description. The book also considers related fields and research areas (gesture, sign languages, computational linguistics) and includes chapters comparing HPSG with other frameworks (Lexical Functional Grammar, Categorial Grammar, Construction Grammar, Dependency Grammar, and Minimalism)
Head-Driven Phrase Structure Grammar
Head-Driven Phrase Structure Grammar (HPSG) is a constraint-based or declarative approach to linguistic knowledge, which analyses all descriptive levels (phonology, morphology, syntax, semantics, pragmatics) with feature value pairs, structure sharing, and relational constraints. In syntax it assumes that expressions have a single relatively simple constituent structure. This volume provides a state-of-the-art introduction to the framework. Various chapters discuss basic assumptions and formal foundations, describe the evolution of the framework, and go into the details of the main syntactic phenomena. Further chapters are devoted to non-syntactic levels of description. The book also considers related fields and research areas (gesture, sign languages, computational linguistics) and includes chapters comparing HPSG with other frameworks (Lexical Functional Grammar, Categorial Grammar, Construction Grammar, Dependency Grammar, and Minimalism)
The phrasal implicature theory of metaphors and slurs
This thesis develops a pragmatic theory of metaphors and slurs. In the pragmatic
literature, theorists mostly hold the view that the framework developed by Grice is
only applicable to the sentence-level pragmatic phenomena, whereas the subsentential
pragmatic phenomena require a different approach. In this thesis, I argue against
this view and claim that the Gricean framework, after some plausible revisions, can
explain subsentential pragmatic phenomena, such as metaphors and slurs.
In the first chapter, I introduce three basic theses I will defend and give an
outline of the argument I will develop. The second chapter discusses three claims
on metaphor that are widely discussed in the literature. There I state my aim to
present a theory of metaphor which can accommodate these three claims. Chapter 3
introduces the notion of "phrasal implicature", which will be used to explain phrase-level
pragmatic phenomena with a Gricean approach. In Chapter 4, I present my
theory of metaphor, which I call "phrasal implicature theory of metaphor" and
discuss certain aspects of the theory. The notion of phrasal implicature enables a
new conception of what-is-said and a different approach to the semantics-pragmatics
distinction. Chapter 5 looks into these issues. In Chapter 6, I compare my theory
of metaphor with three other theories. Finally, in Chapter 7, I develop a phrasal
implicature theory of slurs, which I argue outperforms its rivals in explaining various
uses of slurs