The init status in initialize_latent #1078

kubotty · 2024-07-05T05:59:12Z

def initialize_latent(init, input_dim, Y):
    Xr = np.asfortranarray(np.random.normal(0, 1, (Y.shape[0], input_dim)))
    if 'PCA' in init:
        ...
    elif init in 'empirical_samples':
        ...
    else:
        ...

If the init has any strings that include three letters of "PCA", it takes the first branch, and if the init includes any length of strings in "empirical_samples" (even an empty string: ""!), it takes the second branch. There is a high possibility that an unexpected branch may be called.

I propose changing it to:

def initialize_latent(init, input_dim, Y):
    Xr = np.asfortranarray(np.random.normal(0, 1, (Y.shape[0], input_dim)))
    if init == 'PCA':
        ...
    elif init == 'empirical_samples':
        ...
    else:
        ...

In addition, I can't find any situations where 'empirical_samples' is used in any of the calling modules. The init possibly has two values: "PCA" and "random" (I apologize if I missed others), so it seems better like this:

def initialize_latent(init, input_dim, Y):
    Xr = np.asfortranarray(np.random.normal(0, 1, (Y.shape[0], input_dim)))
    if init == 'PCA':
        ...
    elif init == 'random':
        ...
    else:
        raise ValueError(...)

The text was updated successfully, but these errors were encountered:

MartinBubel · 2024-07-10T21:34:39Z

Hi @kubotty
thank you for the issue.

I agree with you that this is potentially unsafe and that your suggestion is better/safer.
Personally I would prefer an enum but that would introduce a major change which is not what I would like to have at the moment - so your suggestion remains great.

I also agree with your comments on the "empirical_samples" branch. However, I would like to keep this as is. I don't have the overview if this is something that may is in use by some power users or whether there is any pull request in the loop that is using it.
Therefore I recommend to keep the empirical samples even if it does not seem necessary at the moment.
Also, I would like to keep "random" in the else case. Even if I cannot think of a dangerous scenario at the moment, I don't feel comfortable with such a change at the moment.

I joined GPy just a while ago and came more or less for maintenance. I don't have the full overview on the backend and thus I am a bit hesitating when it comes to deeper changes.

With that said, feel free to start a pull request if you want :)

Best
Martin

lawrennd · 2024-07-11T07:23:35Z

Hi Both,

@MartinBubel you're doing a great job and feel free to have confidence around deeper changes! I wouldn't be 100% sure of the protocol here, but perhaps a deprecation warning initially for empirical samples? Then that would trigger anyone who's using it to (hopefully) raise an issue if it's a problem to remove it?

Neil

MartinBubel · 2024-07-12T19:08:24Z

thanks Neil for your input. Deprecation warning like "empirical samples is on a deprecation path will be removed in a future version, raise an issue if you need it" would definitely be a good solution - probably the best.

MartinBubel · 2024-07-12T19:09:49Z

@kubotty are you interested in doing the PR? If so, feel free to choose me for the review.

MartinBubel added the good first issue label Jul 10, 2024

MartinBubel mentioned this issue Jul 10, 2024

Release patch v1.13.2 #1071

Closed

MartinBubel self-assigned this Jul 21, 2024

MartinBubel linked a pull request Jul 21, 2024 that will close this issue

1078 the init status in initialize latent #1082

Merged

MartinBubel added a commit that referenced this issue Jul 21, 2024

fix changes made to initializatino in #1078

8274d02

MartinBubel closed this as completed in #1082 Jul 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The init status in initialize_latent #1078

The init status in initialize_latent #1078

kubotty commented Jul 5, 2024

MartinBubel commented Jul 10, 2024 •

edited

Loading

lawrennd commented Jul 11, 2024

MartinBubel commented Jul 12, 2024

MartinBubel commented Jul 12, 2024

The init status in initialize_latent #1078

The init status in initialize_latent #1078

Comments

kubotty commented Jul 5, 2024

MartinBubel commented Jul 10, 2024 • edited Loading

lawrennd commented Jul 11, 2024

MartinBubel commented Jul 12, 2024

MartinBubel commented Jul 12, 2024

MartinBubel commented Jul 10, 2024 •

edited

Loading