Skip to content

ResearchDataFile

Description

A Research Data File is a File that contains raw data originating from an Experiment.

Fields

format

description : The file format of the Research Data File (e.g., FASTQ, uBAM, FASTA).
required : True
data type : Controlled Vocabulary

Permissible Values
Permissible Values Description
FASTA FASTA format including NCBI-style IDs.
FASTQ FASTQ short read format ignoring quality scores.
UBAM Unaligned BAM file.
FAST5 FAST5 data format for Nanopore.
RAW Raw file format for mass spectrometry proteomics data.
D Raw .d files for mass spectrometry proteomics data.
MZML mzML format for mass spectrometry proteomics data.
MZDATA mzData for mass spectrometry proteomics data.
IDAT IDAT file format for BeadArray data.
OTHER A file format not captured by the controlled vocabulary.

technical_replicate

description : An integer to indicate the technical replicate of this File.
required : True
data type : integer

sequencing_lane_id

description : The identifier of a sequencing lane.
required : False
data type : string

experiments

description : The aliases of the Experiments that produced this Research Data File.
required : True
data type : Experiment

name

description : The given filename.
required : True
data type : string

dataset

description : The Dataset alias associated with this File.
required : True
data type : Dataset

ega_accession

description : The EGA accession ID of an entity.
required : False
data type : string

included_in_submission

description : Whether a File is included in the Submission or not.
required : True
data type : boolean

alias

description : The alias for an entity at the time of submission.
required : True
data type : string