
Legal Case Reports
Donated on 10/18/2012
A textual corpus of 4000 legal cases for automatic summarization and citation analysis. For each document we collect catchphrases, citations sentences, citation catchphrases and citation classes.
Dataset Characteristics
Text
Subject Area
Other
Associated Tasks
Classification
Feature Type
-
# Instances
-
# Features
-
Dataset Information
Additional Information
This dataset contains Australian legal cases from the Federal Court of Australia (FCA). The cases were downloaded from AustLII (http://www.austlii.edu.au). We included all cases from the year 2006,2007,2008 and 2009. We built it to experiment with automatic summarization and citation analysis. For each document we collected catchphrases, citations sentences, citation catchphrases, and citation classes. Catchphrases are found in the document, we used the catchphrases are gold standard for our summarization experiments. Citation sentences are found in later cases that cite the present case, we use citation sentences for summarization. Citation catchphrases are the catchphrases (where available) of both later cases that cite the present case, and older cases cited by the present case. Citation classes are indicated in the document, and indicate the type of treatment given to the cases cited by the present case.
Has Missing Values?
No
Variable Information
Provide information about each attribute in your data set.
Galgani,Filippo. (2012). Legal Case Reports. UCI Machine Learning Repository. https://doi.org/10.24432/C5ZS41.
@misc{misc_legal_case_reports_239, author = {Galgani,Filippo}, title = {{Legal Case Reports}}, year = {2012}, howpublished = {UCI Machine Learning Repository}, note = {{DOI}: https://doi.org/10.24432/C5ZS41} }
pip install ucimlrepo
from ucimlrepo import fetch_ucirepo # fetch dataset legal_case_reports = fetch_ucirepo(id=239) # data (as pandas dataframes) X = legal_case_reports.data.features y = legal_case_reports.data.targets # metadata print(legal_case_reports.metadata) # variable information print(legal_case_reports.variables)
Creators
Filippo Galgani
DOI
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
This allows for the sharing and adaptation of the datasets for any purpose, provided that the appropriate credit is given.