1,259 research outputs found
Cross-domain citation recommendation based on hybrid topic model and co-citation selection citation selection
Cross-domain recommendations are of growing importance in the research community. An application of particular interest is to recommend a set of relevant research papers as citations for a given patent. This paper proposes an approach for cross-domain citation recommendation based on the hybrid topic model and co-citation selection. Using the topic model, relevant terms from documents could be clustered into the same topics. In addition, the co-citation selection technique will help select citations based on a set of highly similar patents. To evaluate the performance, we compared our proposed approach with the traditional baseline approaches using a corpus of patents collected for different technological fields of biotechnology, environmental technology, medical technology and nanotechnology. Experimental results show our cross domain citation recommendation yields a higher performance in predicting relevant publication citations than all baseline approaches
Reasonable Certainty & Corpus Linguistics: Judging Definiteness after Nautilus & Teva
In Nautilus (2014), the Supreme Court held “that a patent is invalid for indefiniteness if its claims...fail to inform, with reasonable certainty, those skilled in the art about the scope of the invention.” We don’t require perfect clarity because, as Festo (2002) highlights, patentees can’t achieve it. We don’t launch a post hoc judicial salvage operation to rescue slipshod text because, as the functional-claiming cases from the 1930s and 1940s highlight, others can’t adequately plan around it. Reasonably certain notice, then, is just right: § 112 “require[s] that a patent’s claims, viewed in light of the specification and prosecution history, inform those skilled in the art about the scope of the invention with reasonable certainty.” Nautilus. How then, should the Patent Office or the courts determine whether a given bit of disputed claim language is reasonably certain, for the protection of patentee and public alike? Reasonableness — a prudent point of balance between extremes — depends on context. And in Teva (2015), the Court confirmed that claim construction disputes, including claim definiteness, can turn on factual findings about a term’s meaning to artisans at a particular moment in time. This paper explores the contours of reasonably certain notice, using insights from negligence law’s reasonable care standard and procedural due process law’s reasonable notice standard. The upshot: To fairly judge the reasonableness of the notice that patent claim language provides, we must embed the claim language in more robust objective data about language usage than we currently consider — data about typical usage in the relevant art at the time the inventor applied for patent rights. Happily, we can adapt existing tools from corpus linguistics to analyze usage patterns, if we construct text corpora that are fit for purpose. The statutory demand for reasonable certainty, both at the Patent Office and in the courts, should prompt us to put these linguistics tools to work in patent law
Evaluating Information Retrieval and Access Tasks
This open access book summarizes the first two decades of the NII Testbeds and Community for Information access Research (NTCIR). NTCIR is a series of evaluation forums run by a global team of researchers and hosted by the National Institute of Informatics (NII), Japan. The book is unique in that it discusses not just what was done at NTCIR, but also how it was done and the impact it has achieved. For example, in some chapters the reader sees the early seeds of what eventually grew to be the search engines that provide access to content on the World Wide Web, today’s smartphones that can tailor what they show to the needs of their owners, and the smart speakers that enrich our lives at home and on the move. We also get glimpses into how new search engines can be built for mathematical formulae, or for the digital record of a lived human life. Key to the success of the NTCIR endeavor was early recognition that information access research is an empirical discipline and that evaluation therefore lay at the core of the enterprise. Evaluation is thus at the heart of each chapter in this book. They show, for example, how the recognition that some documents are more important than others has shaped thinking about evaluation design. The thirty-three contributors to this volume speak for the many hundreds of researchers from dozens of countries around the world who together shaped NTCIR as organizers and participants. This book is suitable for researchers, practitioners, and students—anyone who wants to learn about past and present evaluation efforts in information retrieval, information access, and natural language processing, as well as those who want to participate in an evaluation task or even to design and organize one
The analysis and presentation of patents to support engineering design
This paper explores the role of patents in engineering design, and how the extraction and presentation of patent data could be improved for designers. We propose the use of crowdsourcing as a means to post tasks online for a crowd of people to participate and complete. The is-sues of assessment, searching, clustering and knowledge transfer are evaluated with respect to the literature. Opportunities for potential crowd intervention are then discussed, before the presentation of two initial studies. These related to the categorization and interpretation of patents respectively using an online platform. The initial results establish basic crowd capabilities in understanding patent text and interpreting patent drawings. This has shown that reasonable results can be achieved if tasks of appropriate duration and complexity are set, and if test questions are incorporated to ensure a basic level of understanding exists in the workers
ExpFinder: An Ensemble Expert Finding Model Integrating -gram Vector Space Model and CO-HITS
Finding an expert plays a crucial role in driving successful collaborations
and speeding up high-quality research development and innovations. However, the
rapid growth of scientific publications and digital expertise data makes
identifying the right experts a challenging problem. Existing approaches for
finding experts given a topic can be categorised into information retrieval
techniques based on vector space models, document language models, and
graph-based models. In this paper, we propose , a new
ensemble model for expert finding, that integrates a novel -gram vector
space model, denoted as VSM, and a graph-based model, denoted as
\textit{\muCO-HITS}, that is a proposed variation of the CO-HITS algorithm.
The key of VSM is to exploit recent inverse document frequency weighting
method for -gram words and incorporates VSM into
\textit{\muCO-HITS} to achieve expert finding. We comprehensively evaluate
on four different datasets from the academic domains in
comparison with six different expert finding models. The evaluation results
show that is a highly effective model for expert finding,
substantially outperforming all the compared models in 19% to 160.2%.Comment: 15 pages, 18 figures, "for source code on Github, see
https://github.com/Yongbinkang/ExpFinder", "Submitted to IEEE Transactions on
Knowledge and Data Engineering
Query refinement for patent prior art search
A patent is a contract between the inventor and the state, granting a limited time period to the inventor to exploit his invention. In exchange, the inventor must put a detailed description of his invention in the public domain. Patents can encourage innovation and economic growth but at the time of economic crisis patents can hamper such growth. The long duration of the application process is a big obstacle that needs to be addressed to maximize the benefit of patents on innovation and economy. This time can be significantly improved by changing the way we search the patent and non-patent literature.Despite the recent advancement of general information retrieval and the revolution of Web Search engines, there is still a huge gap between the emerging technologies from the research labs and adapted by major Internet search engines, and the systems which are in use by the patent search communities.In this thesis we investigate the problem of patent prior art search in patent retrieval with the goal of finding documents which describe the idea of a query patent. A query patent is a full patent application composed of hundreds of terms which does not represent a single focused information need. Other relevance evidences (e.g. classification tags, and bibliographical data) provide additional details about the underlying information need of the query patent. The first goal of this thesis is to estimate a uni-gram query model from the textual fields of a query patent. We then improve the initial query representation using noun phrases extracted from the query patent. We show that expansion in a query-dependent manner is useful.The second contribution of this thesis is to address the term mismatch problem from a query formulation point of view by integrating multiple relevance evidences associated with the query patent. To do this, we enhance the initial representation of the query with the term distribution of the community of inventors related to the topic of the query patent. We then build a lexicon using classification tags and show that query expansion using this lexicon and considering proximity information (between query and expansion terms) can improve the retrieval performance. We perform an empirical evaluation of our proposed models on two patent datasets. The experimental results show that our proposed models can achieve significantly better results than the baseline and other enhanced models
Automating the search for a patent's prior art with a full text similarity search
More than ever, technical inventions are the symbol of our society's advance.
Patents guarantee their creators protection against infringement. For an
invention being patentable, its novelty and inventiveness have to be assessed.
Therefore, a search for published work that describes similar inventions to a
given patent application needs to be performed. Currently, this so-called
search for prior art is executed with semi-automatically composed keyword
queries, which is not only time consuming, but also prone to errors. In
particular, errors may systematically arise by the fact that different keywords
for the same technical concepts may exist across disciplines. In this paper, a
novel approach is proposed, where the full text of a given patent application
is compared to existing patents using machine learning and natural language
processing techniques to automatically detect inventions that are similar to
the one described in the submitted document. Various state-of-the-art
approaches for feature extraction and document comparison are evaluated. In
addition to that, the quality of the current search process is assessed based
on ratings of a domain expert. The evaluation results show that our automated
approach, besides accelerating the search process, also improves the search
results for prior art with respect to their quality
Feature-based validation reasoning for intent-driven engineering design
Feature based modelling represents the future of CAD systems. However,
operations such as modelling and editing can corrupt the validity of a feature-based
model representation. Feature interactions are a consequence of feature
operations and the existence of a number of features in the same model. Feature
interaction affects not only the solid representation of the part, but also the
functional intentions embedded within features. A technique is thus required to
assess the integrity of a feature-based model from various perspectives,
including the functional intentional one, and this technique must take into
account the problems brought about by feature interactions and operations. The
understanding, reasoning and resolution of invalid feature-based models
requires an understanding of the feature interaction phenomena, as well as the
characterisation of these functional intentions. A system capable of such
assessment is called a feature-based representation validation system.
This research studies feature interaction phenomena and feature-based
designer's intents as a medium to achieve a feature-based representation
validation system. [Continues.
Tailoring patent policy for developing economies
As intellectual property chapters are now regularly part of free trade agreements, countries need to have a clear view of what elements of a patent system will encourage domestic innovation and what elements will simply raise the cost of goods and services. Drawing on the range of empirical material available about patent systems, this paper presents an initial analysis of critical design elements to maximise economic welfare while implementing patent policy in developing and technology-importing economies. Key issues considered are: patent policy objectives; limitations to patentable subject matter; the height of the inventive step; the privileges provided by patents; incentives, penalties and strategic gaming; and transparency issues particularly oversight, evaluation and audit. Development of a set of policy principles which align with maximising national economic well-being goes some way to meeting the goals of the Development Agenda Group put forward in the context of WIPO's Committee on Development and Intellectual Property. Such a set of principles would also play a useful role is assessing the value of patents in trading for improved market access for goods and services thus assisting an evidence-based approach to trade negotiations
- …