-
Notifications
You must be signed in to change notification settings - Fork 255
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comparecluster analysis is not working for me with same data #754
Comments
Because of its length, lack of formatting and reproducibility I did not bother to read your full post.... Yet, did you see the error in the last few lines?
This error is thrown by Also note the warning on the ties in the list(s) of preranked stats! BTW: what is |
Hallo, Sorry it was GSEA analysis on the basis of signal to noise (SNR) ratio , I have checked my data fur duplications, and there is no duplicate in gene name ( Entrez ID) however in SNR values there are duplicates. I have performed GSEA analysis on the same date in September 2024 and now when I updated my R version to 4.4.2 and have re-runthe same code I got this error
Fehler in .stopf("Duplicate values in %s not allowed", universeArg) : |
Again, the error is due to duplicates in your input.
If you would like to get some further assistance, please provide some reproducible code, or share/attach (part of ) your input (i.e. object |
Also, did you check the input ( Thus, what is the output of |
yes I checked . It is a gene list of 4 clusters. I have run that in R and paste it below ) > set.seed(1) > > ck_TP_VL_HM<-compareCluster(geneClusters = geneList_TP_VL, fun="GSEA",TERM2GENE = hallmark_gs, + pAdjustMethod="fdr", + seed=TRUE, by="fgsea") Fehler in .stopf("Duplicate values in %s not allowed", universeArg) : Duplicate values in names(stats) not allowed Zusätzlich: Warnmeldung: In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam, : There are ties in the preranked stats (10.73% of the list). The order of those tied genes will be arbitrary, which may produce unexpected results.
Should I need to send my data. I am very surprised, on the same data the code worked few months before but now it provided thiis error. kindly help me in this regard. best |
From your latest post I noticed the length of the input lists is not the same! Yet, this is not causing a problem (see code below in which some example data is used). I also noted in your first post that you convert your lists of genes used as input to From your code in first post
Considering your use case, since your data set is based on ensemblids, my suggestion would be to download the MSigDB hallmark gene sets as Code chunk to show you can download the MSigDB hallmark gene sets as
Code chunk to show that size of ranked lists do not need to be equal.
|
Hallo, Thank you very musch. yes it worked for hallmark but for GSEA-Kegg, we need Entrez ID, from ChatGPT I got the following code ..geneList_fgsea <- geneList_fgsea[!duplicated(names(geneList_fgsea))] to remove duplicates and it resolved the issue. Thank you very much for your quick response and support. Regards |
Hallo,
I want to re-run the GSEA on my same data. I have previously performed the compare cluster analysis for GSEGSEA and it worked very well but now I have updated the R version to 4.4.2. here is the pathway and code with error. Kindly let me know what is the issue now
[1] "Row order is the same."
[1] "Row order is the same."
A tibble: 20 × 2
species_name species_common_name
1 Anolis carolinensis Carolina anole, green anole
2 Bos taurus bovine, cattle, cow, dairy cow, domestic cattle, domestic cow, ox, oxen
3 Caenorhabditis elegans NA
4 Canis lupus familiaris dog, dogs
5 Danio rerio leopard danio, zebra danio, zebra fish, zebrafish
6 Drosophila melanogaster fruit fly
7 Equus caballus domestic horse, equine, horse
8 Felis catus cat, cats, domestic cat
9 Gallus gallus bantam, chicken, chickens, Gallus domesticus
10 Homo sapiens human
11 Macaca mulatta rhesus macaque, rhesus macaques, Rhesus monkey, rhesus monkeys
12 Monodelphis domestica gray short-tailed opossum
13 Mus musculus house mouse, mouse
14 Ornithorhynchus anatinus duck-billed platypus, duckbill platypus, platypus
15 Pan troglodytes chimpanzee
16 Rattus norvegicus brown rat, Norway rat, rat, rats
17 Saccharomyces cerevisiae baker's yeast, brewer's yeast, S. cerevisiae
18 Schizosaccharomyces pombe 972h- NA
19 Sus scrofa pig, pigs, swine, wild boar
20 Xenopus tropicalis tropical clawed frog, western clawed frog
A tibble: 23 × 3
gs_cat gs_subcat num_genesets
1 C1 "" 299
2 C2 "CGP" 3384
3 C2 "CP" 29
4 C2 "CP:BIOCARTA" 292
5 C2 "CP:KEGG" 186
6 C2 "CP:PID" 196
7 C2 "CP:REACTOME" 1615
8 C2 "CP:WIKIPATHWAYS" 664
9 C3 "MIR:MIRDB" 2377
10 C3 "MIR:MIR_Legacy" 221
ℹ 13 more rows
ℹ Use
print(n = ...)
to see more rowsA tibble: 20 × 2
species_name species_common_name
1 Anolis carolinensis Carolina anole, green anole
2 Bos taurus bovine, cattle, cow, dairy cow, domestic cattle, domestic cow, ox, oxen
3 Caenorhabditis elegans NA
4 Canis lupus familiaris dog, dogs
5 Danio rerio leopard danio, zebra danio, zebra fish, zebrafish
6 Drosophila melanogaster fruit fly
7 Equus caballus domestic horse, equine, horse
8 Felis catus cat, cats, domestic cat
9 Gallus gallus bantam, chicken, chickens, Gallus domesticus
10 Homo sapiens human
11 Macaca mulatta rhesus macaque, rhesus macaques, Rhesus monkey, rhesus monkeys
12 Monodelphis domestica gray short-tailed opossum
13 Mus musculus house mouse, mouse
14 Ornithorhynchus anatinus duck-billed platypus, duckbill platypus, platypus
15 Pan troglodytes chimpanzee
16 Rattus norvegicus brown rat, Norway rat, rat, rats
17 Saccharomyces cerevisiae baker's yeast, brewer's yeast, S. cerevisiae
18 Schizosaccharomyces pombe 972h- NA
19 Sus scrofa pig, pigs, swine, wild boar
20 Xenopus tropicalis tropical clawed frog, western clawed frog
A tibble: 6 × 2
gs_name entrez_gene
1 HALLMARK_ADIPOGENESIS 11303
2 HALLMARK_ADIPOGENESIS 74610
3 HALLMARK_ADIPOGENESIS 52538
4 HALLMARK_ADIPOGENESIS 11363
5 HALLMARK_ADIPOGENESIS 11364
6 HALLMARK_ADIPOGENESIS 11409
Fehler in .stopf("Duplicate values in %s not allowed", universeArg) :
Duplicate values in names(stats) not allowed
Zusätzlich: Warnmeldung:
In preparePathwaysAndStats(pathways, stats, minSize, maxSize, gseaParam, :
There are ties in the preranked stats (10.73% of the list).
The order of those tied genes will be arbitrary, which may produce unexpected results.
The text was updated successfully, but these errors were encountered: