5 research outputs found
Stop Illegal Comments: A Multi-Task Deep Learning Approach
Deep learning methods are often difficult to apply in the legal domain due to
the large amount of labeled data required by deep learning methods. A recent
new trend in the deep learning community is the application of multi-task
models that enable single deep neural networks to perform more than one task at
the same time, for example classification and translation tasks. These powerful
novel models are capable of transferring knowledge among different tasks or
training sets and therefore could open up the legal domain for many deep
learning applications. In this paper, we investigate the transfer learning
capabilities of such a multi-task model on a classification task on the
publicly available Kaggle toxic comment dataset for classifying illegal
comments and we can report promising results.Comment: 10 pages, 4 figures, 1 tabl
Predicting the Outcome of Appeal Decisions in Germany’s Tax Law
Part 3: Policy Modeling and Policy InformaticsInternational audiencePredicting the outcome or the probability of winning a legal case has always been highly attractive in legal sciences and practice. Hardly any attempt has been made to predict the outcome of German cases, although prior court decisions become more and more important in various legal domains of Germany’s jurisdiction, e.g., tax law.This paper summarizes our research on training a machine learning classifier to determine likelihood ratios and thus predict the outcome of a restricted set of cases from Germany’s jurisdiction. Based on a data set of German tax law cases (44 285 documents from 1945 to 2016) we selected those cases which belong to an appeal decision (5 990 documents). We used the provided meta-data and natural language processing to extract 11 relevant features and trained a Naive Bayes classifier to predict whether an appeal is going to be successful or not.The evaluation (10-fold cross validation) on the data set has shown a performance regarding -score between 0.53 and 0.58. This score indicates that there is room for improvement. We expect that the high relevancy for legal practice, the availability of data, and advance machine learning techniques will foster more research in this area