Check Gene Expression.
check_gene_expression.Rd
This function determines which samples have expression data available in the merge and drop redundant data while consistently prioritizing by protocol and nucleic acid source.
Value
A data frame with a row for each non-redundant RNA-seq result and the following columns:
- mrna_sample_id
The unique sample_id value that will match a single row from the GAMBL metadata where seq_type is mrna.
- biopsy_id
The unique identifier for the source of nucleic acids.
- sample_id
Identical to mrna_sample_id
- capture_sample_id
When this biopsy has capture/exome data in the GAMBL metadata, the value will be the sample_id for that data. NA otherwise.
- genome_sample_id
When this biopsy has genome data in the GAMBL metadata, the value will be the sample_id for that data. NA otherwise.
- patient_id
The anonymized unique identifier for this patient. For BC samples, this will be Res ID.
- seq_type
The assay type used to produce this data (will always be "mrna" in this case)
- protocol
Specifies the RNA-seq library construction protocol.
- ffpe_or_frozen
Specifies the way the source of nucleic acids was preserved. Either FFPE or frozen.