fix massive bottleneck #296

RaphaelS1 · 2023-11-01T13:27:02Z

Fixes a huge bottleneck in cpp functions for vectorised WeightDisc (includes Matdist and Arrdist)

codecov · 2023-11-01T14:56:53Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (255a666) 88.71% compared to head (dcd1d72) 88.71%.

❗ Current head dcd1d72 differs from pull request most recent head ddef09a. Consider uploading reports for the commit ddef09a to get more accurate results

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #296      +/-   ##
==========================================
- Coverage   88.71%   88.71%   -0.01%     
==========================================
  Files         112      112              
  Lines        9122     9115       -7     
==========================================
- Hits         8093     8086       -7     
  Misses       1029     1029

Files	Coverage Δ
R/SDistribution_Arrdist.R	`84.50% <100.00%> (-0.11%)`	⬇️
R/SDistribution_Matdist.R	`90.19% <100.00%> (-0.29%)`	⬇️
R/SDistribution_WeightedDiscrete.R	`90.00% <100.00%> (-0.17%)`	⬇️
src/Distributions.cpp	`92.28% <100.00%> (ø)`

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jemus42 · 2023-11-03T16:52:54Z

Any idea why R CMD Check (macOS-latest (release)) is stuck?
Is it possible to restart this?

jemus42 · 2023-11-06T15:28:07Z

The R CMD Check (macOS-latest (release)) job is still "expected", but... hanging?

bblodfon · 2023-11-10T10:18:16Z

src/Distributions.cpp

@@ -415,7 +415,7 @@ NumericMatrix C_Vec_WeightedDiscretePdf(NumericVector x, NumericMatrix data,
  for (int i = 0; i < nc; i++) {
    for (int k = 0; k < n; k++) {
      for (int j = 0; j < nr; j++) {
-        if (data(j, i) == x[k]) {
+        if (data(j) == x[k]) {


C++ not my strong point, but since data is a NumericVector as well, shouln't it be data[j]?

Checking with https://teuder.github.io/rcpp4everyone_en/080_vector.html#accessing-vector-elements that should be correct. I wonder what actually happens when using data(j, i) in this case 🤔

still it would be better to have it consistent, since both x and data are of the same type

Verification question: if we never have the statement data(j) == x[k] TRUE, then the mat(k, i) (pdf at a new time point not included in the original x) will be zero and that's what we want, right?

Would it make sense to speed this up by removing the third loop? You pretty much search for the index where the time/data points match every time you scan the vector data (again C++ not my strongest point, maybe with the break and scanning with for you are already doing pretty good)

There are other places in the code where we can substitute () with [] to make it more consistent - pushed a commit for this

How would you remove the third loop? Can you demonstrate with R code?

Was writing the previous response late and didn't think it through, so what I initially thought is not going to work. But I asked ChatGTP and got something similar: an optimization that reduces the time complexity from O(nr * nc * n) to O(nr + nc * n) by precomputing the indices of elements in the 'data' array using a hashmap (!):

#include <unordered_map> // Assuming mat, pdf, data, and x are appropriately defined // Create a hashmap to store the indices of elements in the 'data' array std::unordered_map<int, int> dataIndices; for (int j = 0; j < nr; ++j) { dataIndices[data[j]] = j; } // Use the hashmap to populate the 'mat' matrix efficiently for (int i = 0; i < nc; ++i) { for (int k = 0; k < n; ++k) { auto dataIndexIterator = dataIndices.find(x[k]); if (dataIndexIterator != dataIndices.end()) { int j = dataIndexIterator->second; mat(k, i) = pdf(j, i); } } }

Heh, looks like your ChatGPT-foo is better than mine, I didn't get too far 😬
I thought that using an ordered set should be the way to go based on how indices are, well, ordered, so surely using an ordered data structure has some sort of benefit?
But I also didn't think this through entirely because I think I still haven't wrapped my head around this fully.

bblodfon · 2023-11-10T17:26:32Z

R/SDistribution_Arrdist.R

@@ -310,30 +310,28 @@ Arrdist <- R6Class("Arrdist",
    .pdf = function(x, log = FALSE) {
      "pdf, data, wc" %=% gprm(self, c("pdf", "x", "which.curve"))
      mat <- .extCurve(pdf, wc)
-      out <- t(C_Vec_WeightedDiscretePdf(
-        x, matrix(data, ncol(mat), private$.ndists), t(mat)))
+      out <- t(C_Vec_WeightedDiscretePdf(x, data, t(mat)))


@RaphaelS1 so I think I understand what the C_Vec_WeightedDiscretePdf does => gets a pdf matrix and subsets it to the x times (points) instead of the data it has. It's just the names of x and data are not so informative in this context. Some points for discussion:

Should we consider some name changing? I think data is like a xnew?

These matrix tranposes maybe slow down things? Wouldn't be more sense if the code worked as C_Vec_WeightedDiscretePdf(x, data, mat))?

Yes maybe but not in this PR

I think (1) would make sense to do now, (2) might be trickier

for (1) => #297

src/Distributions.cpp

RaphaelS1 · 2023-11-10T22:57:03Z

The R CMD Check (macOS-latest (release)) job is still "expected", but... hanging?

Because I have outdated branch protection settings

src/Distributions.cpp

fix massive bottleneck

ca7677e

RaphaelS1 requested a review from bblodfon November 1, 2023 13:27

RaphaelS1 mentioned this pull request Nov 1, 2023

fix measure bottlenecks mlr-org/mlr3proba#337

Merged

bblodfon reviewed Nov 10, 2023

View reviewed changes

src/Distributions.cpp Show resolved Hide resolved

RaphaelS1 commented Nov 10, 2023

View reviewed changes

src/Distributions.cpp Outdated Show resolved Hide resolved

RaphaelS1 and others added 3 commits November 10, 2023 22:58

Update src/Distributions.cpp

2337982

consistent operator for vector indexing

6095caa

more informative comment about indexes used

ddef09a

RaphaelS1 merged commit a656b7b into main Nov 11, 2023
4 checks passed

jemus42 mentioned this pull request Dec 4, 2023

Checklist before final benchmark can run slds-lmu/paper_2023_survival_benchmark#3

Closed

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix massive bottleneck #296

fix massive bottleneck #296

RaphaelS1 commented Nov 1, 2023

codecov bot commented Nov 1, 2023 •

edited

Loading

jemus42 commented Nov 3, 2023

jemus42 commented Nov 6, 2023

bblodfon Nov 10, 2023

jemus42 Nov 10, 2023

bblodfon Nov 10, 2023

bblodfon Nov 10, 2023

RaphaelS1 Nov 10, 2023

bblodfon Nov 10, 2023 •

edited

Loading

RaphaelS1 Nov 11, 2023

bblodfon Nov 11, 2023

jemus42 Nov 11, 2023

bblodfon Nov 10, 2023 •

edited

Loading

RaphaelS1 Nov 10, 2023

bblodfon Nov 10, 2023

bblodfon Nov 11, 2023

RaphaelS1 commented Nov 10, 2023

fix massive bottleneck #296

fix massive bottleneck #296

Conversation

RaphaelS1 commented Nov 1, 2023

codecov bot commented Nov 1, 2023 • edited Loading

Codecov Report

jemus42 commented Nov 3, 2023

jemus42 commented Nov 6, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bblodfon Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bblodfon Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RaphaelS1 commented Nov 10, 2023

codecov bot commented Nov 1, 2023 •

edited

Loading

bblodfon Nov 10, 2023 •

edited

Loading

bblodfon Nov 10, 2023 •

edited

Loading