4 research outputs found
University Patenting: Is Private Law Serving Public Values?
Article published in the Michigan State Law Review
University Patenting: Is Private Law Serving Public Values?
Article published in the Michigan State Law Review
Narrowing the Universe: A Machine Learning Approach to Patent Clearance
Companies cannot reliably predict which patents are likely to be asserted against them. If they could, they would be better able to quantify and mitigate their own patent infringement risk. We used machine learning methods, informed by legal scholarsā understanding of relevant patent traits, to improve on prior attempts to predict litigation. We built primarily on Colleen Chienās Predicting Patent Litigation. Chien used traits from a patentās legal history and developed a method of prediction based on the traits acquired before litigation, but not after. She demonstrated that the traits acquired before litigation are useful predictors. Evaluating Chienās approach, we determined that her logistic regression model was generalizableāthat is, not overfit to her training sampleāthough it does not perform as well on real datasets as her matched-pairs evaluation suggested. We found that year-over-year changes in patenting and litigation will hinder real-world prediction with this approach, but only modestly. Building a much larger dataset of newer patents, and selecting machine learning models tailored to the task, we improved on Chienās results. Our random forest model had a 7.8% greater area under the precision-recall curve, and it could allow a company to narrow its patent clearance search to a set of patents up to 34% smaller, compared to Chienās logistic regression approach. We report our results on a random sample of patents using standardized metrics, providing a baseline for future work predicting patent litigation
Narrowing the Universe: A Machine Learning Approach to Patent Clearance
Companies cannot reliably predict which patents are likely to be asserted against them. If they could, they would be better able to quantify and mitigate their own patent infringement risk. We used machine learning methods, informed by legal scholarsā understanding of relevant patent traits, to improve on prior attempts to predict litigation. We built primarily on Colleen Chienās Predicting Patent Litigation. Chien used traits from a patentās legal history and developed a method of prediction based on the traits acquired before litigation, but not after. She demonstrated that the traits acquired before litigation are useful predictors. Evaluating Chienās approach, we determined that her logistic regression model was generalizableāthat is, not overfit to her training sampleāthough it does not perform as well on real datasets as her matched-pairs evaluation suggested. We found that year-over-year changes in patenting and litigation will hinder real-world prediction with this approach, but only modestly. Building a much larger dataset of newer patents, and selecting machine learning models tailored to the task, we improved on Chienās results. Our random forest model had a 7.8% greater area under the precision-recall curve, and it could allow a company to narrow its patent clearance search to a set of patents up to 34% smaller, compared to Chienās logistic regression approach. We report our results on a random sample of patents using standardized metrics, providing a baseline for future work predicting patent litigation