Skip to content

ResearchDataFile

Description

A Research Data File is a File that contains raw data originating from an Experiment.

Fields

format

description : The file format of the Research Data File (e.g., FASTQ, uBAM, FASTA).
required : True
data type : Controlled Vocabulary

Permissible Values
Permissible Values Description
FASTA FASTA format including NCBI-style IDs.
FASTQ FASTQ short read format ignoring quality scores.
UBAM Unaligned BAM file.
FAST5 FAST5 data format for Nanopore.
RAW Raw file format for mass spectrometry proteomics data.
D Raw .d files for mass spectrometry proteomics data.
MZML mzML format for mass spectrometry proteomics data.
MZDATA mzData for mass spectrometry proteomics data.
OTHER A file format not captured by the controlled vocabulary.

technical_replicate

description : An integer to indicate the technical replicate of this File.
required : True
data type : integer

sequencing_lane_id

description : The identifier of a sequencing lane.
required : False
data type : string

experiments

description : The aliases of the Experiments that produced this Research Data File.
required : True
data type : Experiment

name

description : The given filename.
required : True
data type : string

dataset

description : The Dataset alias associated with this File.
required : True
data type : Dataset

ega_accession

description : The EGA accession ID of an entity.
required : False
data type : string

included_in_submission

description : Whether a File is included in the Submission or not.
required : True
data type : boolean

alias

description : The alias for an entity at the time of submission.
required : True
data type : string