A Comparative Study of Classifying Legal Documents with Neural Networks
Citations Over TimeTop 1% of 2018 papers
Abstract
In recent years, deep learning has shown promising results when used in the field of natural language processing (NLP). Neural networks (NNs) such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs) have been used for various NLP tasks including sentiment analysis, information retrieval, and document classification. In this paper, we the present the Supreme Court Classifier (SCC), a system that applies these methods to the problem of document classification of legal court opinions. We compare methods using traditional machine learning with recent NN-based methods. We also present a CNN used with pre-trained word vectors which shows improvements over the state-of-the-art applied to our dataset. We train and evaluate our system using the Washington University School of Law Supreme Court Database (SCDB). Our best system (word2vec + CNN) achieves 72.4% accuracy when classifying the court decisions into 15 broad SCDB categories and 31.9% accuracy when classifying among 279 finer-grained SCDB categories.
Related Papers
- → Real-Time Sentiment Analysis of 2019 Election Tweets using Word2vec and Random Forest Model(2019)48 cited
- → A Comparison Study on Legal Document Classification Using Deep Neural Networks(2019)2 cited
- → A study on document classification using multiple distributed representations(2017)1 cited
- → Fine-grained opinion mining : An application of online review analysis in the express industry(2017)1 cited
- → Tweets Sentiment Analysis via Word Embeddings and Machine Learning Techniques(2020)6 cited