11,693 research outputs found

    United we fall, divided we stand: A study of query segmentation and PRF for patent prior art search

    Get PDF
    Previous research in patent search has shown that reducing queries by extracting a few key terms is ineffective primarily because of the vocabulary mismatch between patent applications used as queries and existing patent documents. This finding has led to the use of full patent applications as queries in patent prior art search. In addition, standard information retrieval (IR) techniques such as query expansion (QE) do not work effectively with patent queries, principally because of the presence of noise terms in the massive queries. In this study, we take a new approach to QE for patent search. Text segmentation is used to decompose a patent query into selfcoherent sub-topic blocks. Each of these much shorted sub-topic blocks which is representative of a specific aspect or facet of the invention, is then used as a query to retrieve documents. Documents retrieved using the different resulting sub-queries or query streams are interleaved to construct a final ranked list. This technique can exploit the potential benefit of QE since the segmented queries are generally more focused and less ambiguous than the full patent query. Experiments on the CLEF-2010 IP prior-art search task show that the proposed method outperforms the retrieval effectiveness achieved when using a single full patent application text as the query, and also demonstrates the potential benefits of QE to alleviate the vocabulary mismatch problem in patent search

    Initial specification of the evaluation tasks "Use cases to bridge validation and benchmarking" PROMISE Deliverable 2.1

    Get PDF
    Evaluation of multimedia and multilingual information access systems needs to be performed from a usage oriented perspective. This document outlines use cases from the three use case domains of the PROMISE project and gives some initial pointers to how their respective characteristics can be extrapolated to determine and guide evaluation activities, both with respect to benchmarking and to validation of the usage hypotheses. The use cases will be developed further during the course of the evaluation activities and workshops projected to occur in coming CLEF conferences

    Toward higher effectiveness for recall-oriented information retrieval: A patent retrieval case study

    Get PDF
    Research in information retrieval (IR) has largely been directed towards tasks requiring high precision. Recently, other IR applications which can be described as recall-oriented IR tasks have received increased attention in the IR research domain. Prominent among these IR applications are patent search and legal search, where users are typically ready to check hundreds or possibly thousands of documents in order to find any possible relevant document. The main concerns in this kind of application are very different from those in standard precision-oriented IR tasks, where users tend to be focused on finding an answer to their information need that can typically be addressed by one or two relevant documents. For precision-oriented tasks, mean average precision continues to be used as the primary evaluation metric for almost all IR applications. For recall-oriented IR applications the nature of the search task, including objectives, users, queries, and document collections, is different from that of standard precision-oriented search tasks. In this research study, two dimensions in IR are explored for the recall-oriented patent search task. The study includes IR system evaluation and multilingual IR for patent search. In each of these dimensions, current IR techniques are studied and novel techniques developed especially for this kind of recall-oriented IR application are proposed and investigated experimentally in the context of patent retrieval. The techniques developed in this thesis provide a significant contribution toward evaluating the effectiveness of recall-oriented IR in general and particularly patent search, and improving the efficiency of multilingual search for this kind of task

    Special Libraries, February 1978

    Get PDF
    Volume 69, Issue 2https://scholarworks.sjsu.edu/sla_sl_1978/1001/thumbnail.jp

    Intellectual property rights in a knowledge-based economy

    Get PDF
    Intellectual property rights (IPR) have been created as economic mechanisms to facilitate ongoing innovation by granting inventors a temporary monopoly in return for disclosure of technical know-how. Since the beginning of 1980s, IPR have come under scrutiny as new technological paradigms appeared with the emergence of knowledge-based industries. Knowledge-based products are intangible, non-excludable and non-rivalrous goods. Consequently, it is difficult for their creators to control their dissemination and use. In particular, many information goods are based on network externalities and on the creation of market standards. At the same time, information technologies are generic in the sense of being useful in many places in the economy. Hence, policy makers often define current IPR regimes in the context of new technologies as both over- and under-protective. They are over-protective in the sense that they prevent the dissemination of information which has a very high social value; they are under-protective in the sense that they do not provide strong control over the appropriation of rents from their invention and thus may not provide strong incentives to innovate. During the 1980s, attempts to assess the role of IPR in the process of technological learning have found that even though firms in high-tech sectors do use patents as part of their strategy for intellectual property protection, the reliance of these sectors on patents as an information source for innovation is lower than in traditional industries. Intellectual property rights are based mainly on patents for technical inventions and on copyrights for artistic works. Patents are granted only if inventions display minimal levels of utility, novelty and non-obviousness of technical know-how. By contrast, copyrights protect only final works and their derivatives, but guarantee protection for longer periods, according to the Berne Convention. Licensing is a legal aid that allows the use of patented technology by other firms, in return for royalty fees paid to the inventor. Licensing can be contracted on an exclusive or non-exclusive basis, but in most countries patented knowledge can be exclusively held by its inventors, as legal provisions for compulsory licensing of technologies do not exist. The fair use doctrine aims to prevent formation of perfect monopolies over technological fields and copyrighted artefacts as a result of IPR application. Hence, the use of patented and copyrighted works is permissible in academic research, education and the development of technologies that are complimentary to core technologies. Trade secrecy is meant to prevent inadvertent technology transfer to rival firms and is based on contracts between companies and employees. However, as trade secrets prohibit transfer of knowledge within industries, regulators have attempted to foster disclosure of technical know-how by institutional means of patents, copyrights and sui-generis laws. And indeed, following the provisions formed by IPR regulation, firms have shifted from methods of trade secrecy towards patenting strategies to achieve improved protection of intellectual property, as well as means to acquire competitive advantages in the market by monopolization of technological advances.economics of technology ;

    Encyclopedia of software components

    Get PDF
    Intelligent browsing through a collection of reusable software components is facilitated with a computer having a video monitor and a user input interface such as a keyboard or a mouse for transmitting user selections, by presenting a picture of encyclopedia volumes with respective visible labels referring to types of software, in accordance with a metaphor in which each volume includes a page having a list of general topics under the software type of the volume and pages having lists of software components for each one of the generic topics, altering the picture to open one of the volumes in response to an initial user selection specifying the one volume to display on the monitor a picture of the page thereof having the list of general topics and altering the picture to display the page thereof having a list of software components under one of the general topics in response to a next user selection specifying the one general topic, and then presenting a picture of a set of different informative plates depicting different types of information about one of the software components in response to a further user selection specifying the one component

    Methods and systems for searching databases and displaying search results

    Get PDF
    In one aspect, the invention comprises a computer system comprising databases storing information regarding publications, the information comprising author, title, date of publication, abstract, cited references, and citing references data; and Internet servers in communication with the databases; wherein at least one of the Internet servers is in communication with and operable to transmit data to a Web browser application resident on a user's computer, and wherein the data is sufficient to enable the browser to display: (a) a search page for enabling the user to input publication search parameters; and (b) a results page comprising a tabular display of results that enables the user to sort results according to date, relevance, author, source title, and number of citations to each publication, and further comprising a list of publication titles, with each title having one or more adjacent selectable links to an aspect of the publication corresponding to the title

    Query refinement for patent prior art search

    Get PDF
    A patent is a contract between the inventor and the state, granting a limited time period to the inventor to exploit his invention. In exchange, the inventor must put a detailed description of his invention in the public domain. Patents can encourage innovation and economic growth but at the time of economic crisis patents can hamper such growth. The long duration of the application process is a big obstacle that needs to be addressed to maximize the benefit of patents on innovation and economy. This time can be significantly improved by changing the way we search the patent and non-patent literature.Despite the recent advancement of general information retrieval and the revolution of Web Search engines, there is still a huge gap between the emerging technologies from the research labs and adapted by major Internet search engines, and the systems which are in use by the patent search communities.In this thesis we investigate the problem of patent prior art search in patent retrieval with the goal of finding documents which describe the idea of a query patent. A query patent is a full patent application composed of hundreds of terms which does not represent a single focused information need. Other relevance evidences (e.g. classification tags, and bibliographical data) provide additional details about the underlying information need of the query patent. The first goal of this thesis is to estimate a uni-gram query model from the textual fields of a query patent. We then improve the initial query representation using noun phrases extracted from the query patent. We show that expansion in a query-dependent manner is useful.The second contribution of this thesis is to address the term mismatch problem from a query formulation point of view by integrating multiple relevance evidences associated with the query patent. To do this, we enhance the initial representation of the query with the term distribution of the community of inventors related to the topic of the query patent. We then build a lexicon using classification tags and show that query expansion using this lexicon and considering proximity information (between query and expansion terms) can improve the retrieval performance. We perform an empirical evaluation of our proposed models on two patent datasets. The experimental results show that our proposed models can achieve significantly better results than the baseline and other enhanced models

    Special Libraries, July 1978

    Get PDF
    Volume 69, Issue 7https://scholarworks.sjsu.edu/sla_sl_1978/1005/thumbnail.jp
    corecore