Change the repository type filter
All
Repositories list
45 repositories
truss
PublicThe simplest way to serve AI/ML models in production.github
Publicautoscaler
Publicaxolotl
PublicHackMIT-2024
PublicWorkshop-TRT-LLM
Publicgpu-operator
PublicTensorRT-LLM
PublicTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.tensorrtllm_backend
Publicpython_backend
Publiclangchain
Publicdiffusers
Publicchainlit-cookbook
Publicpygmalion-6b-truss
Public archivempt-7b-base-truss
Public archivestablelm-truss
Public archivewizardlm-truss
Public archiveinfrastructure-take-home
Public templatebackend-take-home
Public templatevicunlocked-alpaca-30b
Public archivewizardlm-truss-1
Public archivestarcoder-truss
Public archivefalcon-7b-truss
Public archivekaniko
Publicdemos
Publicquestion_answering
Public