gene expression cancer RNA-Seq

Donated on 6/8/2016

This collection of data is part of the RNA-Seq (HiSeq) PANCAN data set, it is a random extraction of gene expressions of patients having different types of tumor: BRCA, KIRC, COAD, LUAD and PRAD.

Dataset Characteristics

Multivariate

Subject Area

Biology

Associated Tasks

Classification, Clustering

Feature Type

Real

# Instances

801

# Features

20531

Dataset Information

Additional Information

Samples (instances) are stored row-wise. Variables (attributes) of each sample are RNA-Seq gene expression levels measured by illumina HiSeq platform.

Has Missing Values?

No

Variable Information

A dummy name (gene_XX) is given to each attribute. Check the original submission (https://www.synapse.org/#!Synapse:syn4301332), or the platform specs for the complete list of probes name. The attributes are ordered consitently with the original submission.

Dataset Files

FileSize
TCGA-PANCAN-HiSeq-801x20531.tar.gz69.5 MB

Reviews

There are no reviews for this dataset yet.

Login to Write a Review
Download (69.5 MB)
0 citations
14259 views

Creators

Samuele Fiorini

License

By using the UCI Machine Learning Repository, you acknowledge and accept the cookies and privacy practices used by the UCI Machine Learning Repository.

Read Policy