[Question] Resolving the context length issue in tgi-service when running ChatQNA #1394
Open
2 of 6 tasks
Labels
bug
Something isn't working
Priority
Undecided
OS type
Ubuntu
Hardware type
Xeon-other (Please let us know in description)
Installation method
Deploy method
Running nodes
Single Node
What's the version?
Description
Environment
Attempted Solutions
Increase context length option through argument, ultimately max of 8192 is not sufficient
Questions
Impact
Unable to get comprehensive responses from the model due to context length limitations.
Reproduce steps
Start Services
Steps to Reproduce
Error Log
Raw log
The text was updated successfully, but these errors were encountered: