All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- V&V implemented akin to RSEM counts table
- Ensure counts correctly aggregated from {sample}_ReadsPerGene.out.tab
- Uses {Oorg}.grp for file prefix (e.g. Mmus.grp / Mmus_ERCC.grp)
- Unnormalized_Counts.csv renamed to RSEM_Unnormalized_Counts.csv to disambiguate compared to STAR_Unnormalized_Counts.csv
- raw fastq.gz files will now be named as they were originally named in the GLDS repo
- Files derived from those original raw fastq.gz will be renamed to a standard format (as opposed to renaming the original raw files in a standard format)
- Modified to use database files generated in-house.
- This addresses a bug in STRINGdb and may be reverted once the STRINGdb bug fix is released
- md5sum table is not synced with assay table
- Now correctly synced
- New published files
- Output directory: 02-STAR_Alignment
- STAR_NumNonZeroGenes.csv
- A dataset wide tabulation of the number of genes with non zero counts.
- STAR_Unnormalized_Counts.csv
- A dataset wide tabulation of gene counts output from STAR.
- STAR_NumNonZeroGenes.csv
- Output directory: 02-STAR_Alignment
- STAR_Unnormalized_Counts.csv file is published as originally intended
- Will now output quantification 'GeneCounts' (in addition to TranscriptomeSAM)
0.1.1-beta - 2022-03-09
- Python scripts set to executable