Search CORE

11 research outputs found

An office document retrieval system with the capability of processing incomplete and vague queries

Author: Liu Qianhong
Publication venue: Digital Commons @ NJIT
Publication date: 31/10/1994
Field of study

TEXPROS (TEXt PROcessing System) is an intelligent document processing system. The system is a combination of filing and retrieval systems, which supports storing, classifying, categorizing, retrieving and reproducing documents, as well as extracting, browsing, retrieving and synthesizing information from a variety of documents. This dissertation presents a retrieval system for TEXPROS, which is capable of processing incomplete or vague queries and providing semantically meaningful responses to the users. The design of the retrieval system is highly integrated with various mechanisms for achieving these goals. First, a system catalog including a thesaurus is used to store the knowledge about the database. Secondly, there is a query transformation mechanism which consists of context construction and algebraic query formulation modules. Given an incomplete query, the context construction module searches the system for the required terms and constructs a query that has a complete representation. The resulting query is then formulated into an algebraic query. Thirdly, in practice, the user may not have a precise notion of what he is looking for. A browsing mechanism is employed for such situations to assist the user in the retrieval process. With the browser, vague queries can be entered into the system until sufficient information is obtained to the extent that the user is able to construct a query for his request. Finally, when processing of queries responds with an empty answer to the user, a query generalization mechanism is used to give the user a cooperative explanation for the empty answer. The generalizations of any given failed queries (i.e., with an empty answer) are derived by applying both the folder and type substitutions and weakening the search criteria in the original query. An efficient way is investigated for determining whether the empty answer is genuine and whether the original query reflects erroneous presuppositions, and therefore answering any failed query with a meaningful and cooperative response. It incorporates with a methodical approach to reducing the search space of generalized subqueries by analyzing the results of executing the query generalization and by efficiently applying the possible substitutions in a query to generate a small subset of relevant subqueries which are to be evaluated

Digital Commons @ New Jersey Institute of Technology (NJIT)

Recommended from our members

Democratizing Web Automation: Programming for Social Scientists and Other Domain Experts

Author: Chasins Sarah Elizabeth
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

We have promised social scientists a data revolution, but it has not arrived. What stands between practitioners and the data-driven insights they want? Acquiring the data. In particular, acquiring the social media, online forum, and other web data that was supposed to help them produce big, rich, ecologically valid datasets. Web automation programming is resistant to high-level abstractions, so end-user programmers end up stymied by the need to reverse engineer website internals—DOM, JavaScript, AJAX. Programming by Demonstration (PBD) offered one promising avenue towards democratizing web automation. Unfortunately, as the web matured, the programs became too complex for PBD tools to synthesize, and web PBD progress stalled.This dissertation describes how I reformulated traditional web PBD around the insight that demonstrations are not always the easiest way for non-programmers to communicate their intent. By shifting from a purely Programming-By-Demonstration view to a Programming-By-X view that accepts a variety of user-friendly inputs, we can dramatically broaden the class of programs that come in reach for end-user programmers. Our Helena ecosystem combines (i) usable PBD-based program drafting tools, (ii) learnable programming languages, and (iii) novel programming environment interactions. The end result: non-coders write Helena programs in 10 minutes that can handle the complexity of modern webpages, while coders attempt the same task and time out in an hour. I conclude with a discussion of the abstraction-resistant domains that will fall next and how hybrid PL-HCI breakthroughs will vastly expand access to programming

eScholarship - University of California

On document filing based upon predicates

Author: Zhu Zhijian
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/1997
Field of study

This dissertation presents a formal approach to modeling documents in a personal office environment, proposes a heterogeneous algebraic query language to manipulating objects (folders) in the document model, and investigates a predicate-driven document filing system for automatically filing documents. The document model was initially proposed in [38] which adopts a very natural view for describing the office documents using the relational and object-oriented paradigms. The model employs a dual approach to classifying and categorizing office documents by defining both a document type hierarchy and a folder organization. This dissertation extends and specifies formally the document model. Documents are partitioned into different classes, each document class being represented by frame template which describes the properties of the documents of the class. A particular office document, summarized from the view point of its frame template, yields a synopsis of the document which is called frame instances. Frame instances are grouped into a folder on the basis of user-defined criteria, specified as predicates, which determine whether a frame instance belongs to a folder. Folders, each of which is a heterogeneous set of frame instances, can be naturally organized into a folder organization. The folder organization specifying the document filing view is then defined using predicates and a directed graph. However, some operators in the algebraic query language [38] do not support the heterogeneous property. This dissertation proposes an algebra-based query language that gives full support to this heterogeneous property. We investigate the construction problem of a folder organization: does it allow a user to add a new folder with an arbitrary local predicate? Given a folder organization, creating a new folder with arbitrarily defined predicate may cause two abnormalities: inapplicable edges (filing paths) and redundant folders. To deal such abnormalities in the process of constructing a folder organization, the concept of predicate consistency is discussed and an algorithm is proposed for determining whether the predicate of a new folder is consistent with the existing folder organization. The global predicate of a folder governs the content of the folder. However, the predicates of folders (that is, global predicates) do not uniquely specify a folder organization. Then, we investigate the reconstruction problem: under what circumstance can we uniquely recover the folder organization from its global predicates? The problem is solved in terms of graph-theoretic concepts such as associated digraphs, transitive closure, and redundant/non-redundant filing paths. A transitive closure inversion algorithm is then presented which efficiently recovers a folder organization digraph from its associated digraph. After defining a folder organization, we can file a frame instance into the folder organization. A document filing algorithm describes the procedure of filing a frame instance. However, the critical issue of the algorithm is how to evaluate whether a frame instance satisfies the predicate of a folder in a folder organization. In order to solve this issue, a thesaurus, an association dictionary and a knowledge base are then introduced. The thesaurus specifies the association relationship among the key terms that are actually residing in the system and terms that are used by users. An association dictionary gives the association relationship between an attribute of a predicate and a frame template defined in a folder organization. A knowledge base represents background knowledge in a certain application domain

Digital Commons @ New Jersey Institute of Technology (NJIT)

Knowledge-based document filing for texpros

Author: Fan Xien
Publication venue: Digital Commons @ NJIT
Publication date: 31/05/1998
Field of study

This dissertation presents a knowledge-based document filing system for TEXPROS. The requirements of a. personal document processing system are investigated. In order for the system to be used in various application domains, a flexible, dynamic modeling approach is employed by getting the user involved in document modeling. The office documents are described using a dual-model which consists of a document type hierarchy and a folder organization. The document type hierarchy is used to capture the layout, logical and conceptual structures of documents. The folder organization, which is defined by the user, emulates the real world structure for organizing and storing documents in an office environment. The document filing and retrieval are predicate-driven. The user can specify filing criteria and queries in terms of predicates. The predicate specification and folder organization specification are described. It is shown that the new specifications can prevent false drops which happen in the previous approach. The dual models are incorporated by a three-level storage architecture. This storage architecture supports efficient document and information retrieval by limiting the searches to those frame instances of a document type within those folders which appear to be the most similar to the corresponding queries, Specifically, a. three-level retrieval strategy is used in document and information retrieval. Firstly, a knowledge-based query preprocess is applied for efficiently reducing the search space to a small set of frame instances, using the information in the query formula. Secondly, the knowledge and content-based retrieval on the small set of frame instances is applied. Finally, the third level storage provides a platform for adopting potential content-based multimedia document retrieval techniques. A knowledge-based predicate evaluation engine is described for automating document filing. The dissertation presents a knowledge representation model. The knowledge base is dynamicly created by a learning agent, which demonstrates that the notion of flexible and dynamic modeling is applicable. The folder organization is implemented using an agent-based architecture. Each folder is monitored by a filing agent. The basic operations for constructing and reorganizing a folder organization are defined. The dissertation also discusses the cooperation among the filing agents, which is needed for implementing the folder organization

Digital Commons @ New Jersey Institute of Technology (NJIT)

Planning Responses From High-Level Goals: Adopting the Respondent\u27s Perspective Cooperative Response Generation

Author: Cheikes Brant
Publication venue: ScholarlyCommons
Publication date: 27/01/1992
Field of study

Within the natural-language research community it has long been acknowledged that the conventions and pragmatics of natural-language communication often oblige dialogue systems to consider and address the underlying purposes of queries in their responses rather than answering them literally and without further comment or elaboration. Such systems cannot simply translate their users\u27 requests into transactions on database or expert systems, but must apply many more complex reasoning mechanisms to the task of selecting responses that are both appropriate and useful. This idea has given rise to a broadly-defined program of research in cooperative response generation (CRG). Research in CRG carried on over more than a decade has yielded a substantial body of literature. Analysis of that literature, however, shows that investigators have focused primarily on modeling manifestations of cooperative behavior without directly considering the nature and motivations of the behavior itself. But if we want to develop natural language dialogue systems that are truly to function as cooperative respondents instead of serving only as models of particular kinds of cooperative responses, a different approach is required. I identify two opposing perspectives on the process of cooperative response generation: the questioner-based and the respondent-based perspectives. I argue that past research efforts have largely been questioner-based, and that this view has led to the development of theories that are incompatible and cannot be integrated. I propose the respondent-based view as an alternative, and provide evidence that taking such a perspective might allow several interesting but otherwise poorly-understood aspects of cooperative response behavior to be modeled. The final portion of the dissertation explores the computational implications of a respondent-based perspective. I outline the architecture of a Cooperative Response Planning System, a dialogue system that raises, reasons about, and attempts to satisfy high-level cooperative goals in its responses. This architecture constitutes a first approximation to a theory of how a system might reason from the beliefs it derives from a questioner\u27s utterances to choose a cooperative response. The processing of two sample responses in this framework is described in detail to illustrate the architecture\u27s capabilities

ScholarlyCommons@Penn

When Psychologists Were Naturalists: Questionnaires and Collecting Practices in Early American Psychology, 1880 - 1932

Author: Young Jacy Lee
Publication venue
Publication date: 28/08/2015
Field of study

This dissertation reshapes our understanding of the earliest years of American psychology by documenting the discipline’s methodological plurality from its very inception. In tracing the use of questionnaires over the first half century of the discipline’s existence as a science, I argue that a natural historical orientation, wherein collection, analysis, and categorization are central to the scientific enterprise, has been a persistent facet of the field. Manifested in a recurrent interest in collecting information on mental life, this natural historical perspective facilitated a moral economy of data, wherein the discipline’s affect-laden norms and values sanctified the objects and practices of mass data collection. This in turn lent itself to the adoption of statistical analyses as a central component of psychological science. Although, at first glance, falling outside of the bounds of the mechanically objective practices that characterized the new psychology’s laboratory endeavours, with their use of standardized instrumentation, projects with this orientation adhered to this form of objectivity in their own way. Seeking precise accounts of mental life, including information on its physical correlates, these enterprises engaged the public in collection practices in the field. Taking up subjects with widespread interest outside of purely scientific spheres – including child study, psychical matters, and dreaming – questionnaire projects had broad appeal. Undertakings with less popular allure deliberately and necessarily confined themselves to more restricted university populations. Issues of social relevance remained mainstays of this kind of research, but by the 1920s the public’s relation to questionnaire research shifted so that they were no longer active participants in collecting activities. Instead, questionnaires were circulated in more restricted circumstances and their findings served as the basis for broad claims about the state of the public’s mind. To do so effectively, I argue, practices of collecting with questionnaires shifted from thick to thin description; no longer were rich descriptive accounts of mental life the aim of these endeavours. Rather, increasingly restricted ranges of information were accumulated, a process that culminated in the development of numerical Likert scales and the use of more sophisticated statistical analyses. Scales of this kind continue to dominate questionnaire research today

YorkSpace

Computing point-of-view : modeling and simulating judgments of taste

Author: Pattie Maes
Xinyu Hugo Liu
Xinyu Hugo Liu
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2006
Field of study

Thesis (Ph. D.)--Massachusetts Institute of Technology, School of Architecture and Planning, Program in Media Arts and Sciences, 2006.Includes bibliographical references (p. 153-163).People have rich points-of-view that afford them the ability to judge the aesthetics of people, things, and everyday happenstance; yet viewpoint has an ineffable quality that is hard to articulate in words, let alone capture in computer models. Inspired by cultural theories of taste and identity, this thesis explores end-to-end computational modeling of people's tastes-from model acquisition, to generalization, to application- under various realms. Five aesthetical realms are considered-cultural taste, attitudes, ways of perceiving, taste for food, and sense-of-humor. A person's model is acquired by reading her personal texts, such as a weblog diary, a social network profile, or emails. To generalize a person model, methods such as spreading activation, analogy, and imprimer supplementation are applied to semantic resources and search spaces mined from cultural corpora. Once a generalized model is achieved, a person's tastes are brought to life through perspective-based applications, which afford the exploration of someone else's perspective through interactivity and play. The thesis describes model acquisition systems implemented for each of the five aesthetical realms.(cont.) The techniques of 'reading for affective themes' (RATE), and 'culture mining' are described, along with their enabling technologies, which are commonsense reasoning and textual affect analysis. Finally, six perspective-based applications were implemented to illuminate a range of real-world beneficiaries to person modeling-virtual mentoring, self-reflection, and deep customization.by Xinyu Hugo Liu.Ph.D

CiteSeerX

DSpace@MIT

The use of group psychotherapy in the professional training of ministers

Author: Boyd Richard White
Publication venue: Boston University
Publication date: 01/01/1952
Field of study

Thesis (Ph.D.)--Boston UniversityBackground and problem of the study. Group psychotherapy, first used as a treatment for the mentally-ill, has recently proved its value in the training of professional groups, notably clinical psychologists, nurses, and psychiatric social workers. The first attempt to use this technique to train theological students and ministers, so far as we know, is t he course, "Group Therapy," which is offered at the Boston Psychopathic Hospital. This six-weeks course offers 27 hours of group psychotherapy, supplemented by 27 eight-hour days in which the students are volunteers in the hospital [TRUNCATED

Boston University Institutional Repository (OpenBU)

A Csongrád megyei országgyűlési választókerületek geoinformatikai elemzése a 2014-es eredmények alapján

Author: Kovalcsik Tamás
Mucsi László
Vida György
Publication venue: Debreceni Egyetemi Kiadó
Publication date: 01/01/2016
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

Studies in the linguistic sciences. 17-18 (1987-1988)

Author
Publication venue: Urbana, Ill. : Dept. of Linguistics, University of Illinois,
Publication date
Field of study

Illinois Digital Environment for Access to Learning and Scholarship Repository