Legal Case Reports

Donated on 10/18/2012

A textual corpus of 4000 legal cases for automatic summarization and citation analysis. For each document we collect catchphrases, citations sentences, citation catchphrases and citation classes.

Dataset Characteristics

Text

Subject Area

Other

Associated Tasks

Classification

Feature Type

-

# Instances

-

# Features

-

Dataset Information

Additional Information

This dataset contains Australian legal cases from the Federal Court of Australia (FCA). The cases were downloaded from AustLII (http://www.austlii.edu.au). We included all cases from the year 2006,2007,2008 and 2009. We built it to experiment with automatic summarization and citation analysis. For each document we collected catchphrases, citations sentences, citation catchphrases, and citation classes. Catchphrases are found in the document, we used the catchphrases are gold standard for our summarization experiments. Citation sentences are found in later cases that cite the present case, we use citation sentences for summarization. Citation catchphrases are the catchphrases (where available) of both later cases that cite the present case, and older cases cited by the present case. Citation classes are indicated in the document, and indicate the type of treatment given to the cases cited by the present case.

Has Missing Values?

No

Dataset Files

FileSize
corpus/citations_class/06_1234.xml6.5MB
corpus/fulltext/07_1062.xml3MB
corpus/citations_class/06_1112.xml2.3MB
corpus/fulltext/08_498.xml1.6MB
corpus/citations_class/06_1663.xml1.1MB

0 to 5 of 10536

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (85MB)
0 citations
11118 views

Creators

Filippo Galgani

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy