-
Notifications
You must be signed in to change notification settings - Fork 412
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimizing over discrete parameter domains #177
Comments
Hi @avimit, BoTorch itself at this point doesn't directly deal with discrete parameters, but If your goal is simply to run BayesOpt (rather than developing new BayesOpt primitives), you can easily do this using the higher-level Ax interface. Ax will perform appropriate transformations (e.g. rounding for ordinal parameters, or one-hot encoding for categorical ones) when talking to BoTorch under the hood. We are also working on improving support for discrete parameters directly in BoTorch (e.g. by using a concrete relaxation to still enable gradient-based optimization). Finally, if you're willing to write your own Kernels or even models that comply with our basic API, you can do anything you like directly with discrete parameters and still use BoTorch's acquisition functions. Note, however, that optimizing over mixed domains will then also require using a different optimization algorithm, i.e., you can't use the functions in Hope this helps! |
Hi @Balandat, Thank you for the answer! I will have a look at Ax, but I feel that we are in need of the more "low level" character of BoTorch. Explanation: we have have our own small platform for distributing & managing jobs (mostly DL training) on the cloud, and on top of it we sometimes run HPO. Our HPO framework treats the training of models as a "black box" function, launching train jobs as resources are freed, and collecting evaluations when ready. Our use case for BoTorch is as a supplier of next points for evaluation; I managed to successfully integrate it (and enjoyed the process!) as long as I don't use discrete parameters. Thanks, |
Glad you were able to use BoTorch directly for this. That said, Ax is designed for exactly your use case, and provides a number of different APIs (including lower-level ask/tell style ones) that should make that integration quite straightforward. You can also stick your own BoTorch components into these APIs if you want to retain the flexibility of customizing your own models/acquisition functions. See the tutorial on this. As I mentioned, we are working on improving support for discrete parameters directly in Botorch. For now, what you could do is perform the parameter transformations that Ax does in this module yourself directly in torch. |
I will sure give it a try! |
@avimit , based on your description, I think that Ax is the appropriate tool to plug into. For this ordered, categorical parameter (param3), you'll want to use a Your setup sounds kind of like what one might do with Slurm or Ray Tune. We have forthcoming interfaces between Ax and these platforms, and they were quite straightforward to implement. |
You might want to look at the source code for the Ray Tune interface as an example. |
Can you please direct me to a function in Ax which returns next candidates for evaluation, similar to joint_optimize() in BoTorch? (I have been doing something like this:
) Thanks! |
If you use Ax's service api see API examples here, You can also use the developer api (next tab on the above page) - you can access the generated candidates ("arms") by accessing th |
Thanks! I was there, but never noticed the tabs... Aren't you supposed to be resting on 4th of July? ;) |
Hi, I must admit that I find implementing Ax for my task much harder compared to BoTorch. The five schematic BoTorch lines I quote above do what I want: supply X new candidates for evaluation, given the known+pending evaluations. Haven't yet found a way to do it with ax.... Do I have to state an evaluation function when defining an "experiment"? Our "evaluation" is a training & testing of an ML model done on a cloud sever. I just want to feed the results to the model. Also, I couldn't find how to load the known+pending evaluations to the model (and are the objective_weights, that the gen() fonction of BotorchModel requires, factors for low/high-fidelity evals?) Thanks, |
If you want to manage that manually you could just use the service API - calling The benefit of using Ax for this is that it keeps track of all the metadata, so you can for instance store a complete history of your experiment in a sql database (if you hook Ax up to it with the appropriate tables). The developer API gives you even more flexibility. For instance you can write a I hope this helps. If you have specific questions about Ax, we should probably move the discussion to an issue in the Ax repo. There you'll also have folks like @lena-kashtelyan, @amurthy1, @sdsingh who understand the Ax APIs much better than I do. |
I write Like this, When I'd like discrete inputs(>=2 this case x2, x3) candidate, how should I use it? |
So doing If you want to optimize the continuous features and fix the discrete ones, you should be doing sth like |
Is this feature available now in BoTorch? I found the optimize_acqf() function only accepts continuous bounds in the bounds parameter. |
Yes it is, check out We also have an incoming PR (#1533) that implements the method from [1], but it'll take a bit to get that fully cleaned up and merged. [1] S. Daulton et al. Bayesian Optimization over Discrete and Mixed Spaces via Probabilistic Reparameterization. NeurIPS 2022 |
So if my understanding is correct, in optimize_acqf_mixed the bounds parameter handles continuous features and fixed_features_list handles discrete ones. So if we supply both arguments, such as the learning rate for bounds and the number of nodes in a layer for fixed_features_list, the function will condition on each unique discrete value of fixed_features_list and search for the optimal continuous value? In other words, it's the same as if we were to have a outer for loop over the discrete values and return the best one? |
That is correct, it's a convenience function for essentially doing that. While this is the gold standard for optimizing, this obviously doesn't scale with the number of discrete features. Which is why we worked on [1], so this is incoming. If you have discrete parameters with many ordered values (e.g. integers over a large range) it's usually fine to do a continuous relaxation for those (optimize over continuous range and then round). |
Sounds pretty cool! Looking forward to it! |
Can BoTorch be used over discrete parameter domains?
(if so, than this is a feature inquiry, and not a feature "request")
We have a use case of domains which are partly continuous, partly discrete, like:
[{"name": "param1", "type": "continuous", "domain": [-5, 10]}, {"name": "param2", "type": "continuous", "domain": [1, 15]}, {"name": "param3", "type": "discrete", "domain": [1, 1.5, 2, 2.5, 3, 3.5, 4]}]
The functions under "botorch/optim/optimize.py" accept an argument called "bounds", which you define as : "bounds: A
2 x d
tensor of lower and upper bounds for each column ofX
".These are obviously bounds for a continuous search space. Can BoTorch be used for searching over discrete spaces?
Thank you so much for the package!
Avi
The text was updated successfully, but these errors were encountered: