You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am trying to implement a scoring method in my inference calls using SGLang's choices: https://sgl-project.github.io/frontend/choices_methods.html. Where I have a prompt (combined prompt for A + B) and I want to determine if B is a good pick for A. I want the model to return Yes or No.
In my python code, I am initializing Engine:
sgl.Engine(model_path=self.model_path, tokenizer_path=self.tokenizer_path)
At inference time, I am doing a prompt_function.run(). However, I get a "Please specify a backend" error. How do I use the Engine (or equivalent SRT engine which doesn't require me to spin up a HTTP server layer) ? How do I define the backend to use for these choices function calls? Or is there another way to define the choices via Engine
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
Hi Team,
I am trying to implement a scoring method in my inference calls using SGLang's choices: https://sgl-project.github.io/frontend/choices_methods.html. Where I have a prompt (combined prompt for A + B) and I want to determine if B is a good pick for A. I want the model to return Yes or No.
In my python code, I am initializing Engine:
sgl.Engine(model_path=self.model_path, tokenizer_path=self.tokenizer_path)
At inference time, I am doing a prompt_function.run(). However, I get a "Please specify a backend" error. How do I use the Engine (or equivalent SRT engine which doesn't require me to spin up a HTTP server layer) ? How do I define the backend to use for these choices function calls? Or is there another way to define the choices via Engine
Beta Was this translation helpful? Give feedback.
All reactions