Refactoring Ligandomics analysis #2

CaroAMN · 2022-07-04T13:36:30Z

start of the refactoring of the Ligandomics analysis + extra script for functions that I use

Done:

loading the data
data preparation like filtering
Waterfall plots
basic Venn diagrams

To do:

saturation analysis
length distribution
all todos open in the code
netMHCpan output reader
peptide selection

RNAseq analysis:

small changes like linting
included reduced data set were the dan contaminated sample was excluded + all benign samples also (just for testing )

…analysis

marissaDubbelaar

Additional to the comments, take a look at the linting

cschwitalla/Ligandomics_Analysis/Ligandomics_Analysis.R

marissaDubbelaar · 2022-07-05T09:09:36Z

cschwitalla/Ligandomics_Analysis/Ligandomics_Analysis.R

-required_Libs <- c("tidyr","readxl", "ggVennDiagram", "dplyr", "stringr", "tibble", 
-                   "ggplot2", "org.Hs.eg.db")
+required_Libs <- c("tidyr","readxl", "ggVennDiagram", "dplyr", "stringr",
+                   "tibble", "ggplot2", "org.Hs.eg.db")

 suppressMessages(invisible(lapply(required_Libs, library, character.only = T)))


Include a commented line that enables the user to install the libraries in one go

cschwitalla/Ligandomics_Analysis/Ligandomics_Analysis.R

marissaDubbelaar · 2022-07-05T09:11:05Z

cschwitalla/Ligandomics_Analysis/Ligandomics_Analysis.R

+GB_HLA_types <- read_xlsx(paste0(input_dir, "HLA-Typisierung_GBM.xlsx"), col_names = TRUE)
+
+# get list of unique HLA types
+uniqe_HLA_types <- unique(c(as.matrix(GB_HLA_types[2:16, 2:7])))


For me it is unknown what the information in the columns and row is, can you use another approach?
If not specify this information clearly.

marissaDubbelaar · 2022-07-05T09:12:04Z

cschwitalla/Ligandomics_Analysis/Ligandomics_Analysis.R

+# Benign data Immunology -------------------------------------------------------
+# more specific
+# less hits
+benign_pep_I <- read.csv(paste0(input_dir, "newBenignmorespecific/Benign_class1.csv"),


Can you find a way to reduce these 7-8 lines even more?

marissaDubbelaar · 2022-07-05T09:32:02Z

cschwitalla/Ligandomics_Analysis/functions_ligandomics.R

+##
+## OUTPUT:
+##
+getProteinAcc_uniqemappers <- function(list) {


You don't need the for loop, you can manipulate the data as it is

marissaDubbelaar · 2022-07-05T10:42:59Z

cschwitalla/RNAseq_Analysis/DE_Analysis.R



 ################################################################################
 ###                            Load Data                                     ###
 ################################################################################
 # Load meta data --> Metadata_GB.tsv in workdir
 metadata <- read.table(file = metadata_file, sep = "\t", header = TRUE)
+metadata2 <- metadata[-grep(("QATLV129AQ|QATLV139AX|QATLV162AW|QATLV171AV|QATLV188AQ"),metadata$QBiC.Code),]


QATLV(129AQ|139AX|162AW|171AV|188AQ) might be a better alternative

marissaDubbelaar · 2022-07-05T10:43:20Z

cschwitalla/RNAseq_Analysis/DE_Analysis.R

 # get filenames of inputDir
 file_names <- list.files(path = input_dir)
+# files without ben + outlier sample
+filnames_excl <- grep(("NEC|INF|T1"), file_names, value = TRUE)
+filnames_excl <- filnames_excl[c(1:7,9:45)]


Define which columns you collect from the filenames_excl

marissaDubbelaar · 2022-07-05T10:46:35Z

cschwitalla/RNAseq_Analysis/functions_RNA.R

@@ -154,13 +158,87 @@ make_heatmap <- function(gene_selection, vsd, batch, annotation_color) {
    "Sex" = vsd@colData@listData$Sex,
    "MGMT_methylation" = vsd@colData@listData$MGMT
  )
+  if (!is.null(k)) {


Make the if-else shorter

marissaDubbelaar · 2022-07-05T10:47:00Z

cschwitalla/RNAseq_Analysis/functions_RNA.R

+##             - batch: vsd column of the batch [vsd column]
+##
+## OUTPUT: PCA plot
+plot_pca <- function(dds_default, batch) {


I miss comments

…ults in one script now. Still some TODOs open

…nd other comments

… for oncoplots and venn diagram

CaroAMN added 2 commits June 30, 2022 23:46

start refactoring of ligandomics analysis + small changes for rnaseq …

ceb8a6c

…analysis

refactoring ligandomics analysis

1518c14

CaroAMN requested review from FriederikeHanssen and marissaDubbelaar July 4, 2022 13:37

marissaDubbelaar reviewed Jul 5, 2022

View reviewed changes

CaroAMN added 4 commits July 13, 2022 19:39

Biggest part of refactoring done. peptide selection and netMHCpan res…

f610c73

…ults in one script now. Still some TODOs open

comments added for all functions. Some corrections on miss spelling a…

08fe686

…nd other comments

added comments for functions and analysis

7210e10

included tcga compare, oncogenic pathways, gene selection with clivar…

26c8291

… for oncoplots and venn diagram

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactoring Ligandomics analysis #2

Refactoring Ligandomics analysis #2

CaroAMN commented Jul 4, 2022

marissaDubbelaar left a comment

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

marissaDubbelaar Jul 5, 2022

Refactoring Ligandomics analysis #2

Are you sure you want to change the base?

Refactoring Ligandomics analysis #2

Conversation

CaroAMN commented Jul 4, 2022

marissaDubbelaar left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment