-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TRT support for MAISI #701
Conversation
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Signed-off-by: Boris Fomitchev <[email protected]>
Hi @yiheng-wang-nv , Is the CI pipeline broken? Thanks. |
Signed-off-by: Yiheng Wang <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Yiheng Wang <[email protected]>
Hi @KumoLiu , just FYI, the MAISI tensorrt enhancement PR contains the content of this PR: Project-MONAI/MONAI#8153 We may need to merge this one first before merging the MAISI one |
Signed-off-by: Boris Fomitchev <[email protected]>
Hi @borisfom ,
Thanks, |
Should be fine for MAISI as I tested.
Thanks, |
@binliunls : how come image_decoding is much slower with TRT? How do I run a test for that ? |
I was running the command line like |
Signed-off-by: Boris Fomitchev <[email protected]>
for more information, see https://pre-commit.ci
Signed-off-by: Boris Fomitchev <[email protected]>
for more information, see https://pre-commit.ci
Project-MONAI/MONAI#8153 has been merged. |
Hi @borisfom ,
Thanks, |
Well, stage by stage measurements are tricky as processing is asynchronous and the sections may spill into next.
Unless you synchronize between stages, it may be misleading..
Is TRT even used in image decoding ?
The only 100% right way to measure if, say, converting controlnet to TRT has positive or negative impact is to compare end-to-end runs with original controlnet and TRT controlnet etc.
From: binliunls ***@***.***>
Date: Saturday, November 16, 2024 at 1:28 AM
To: Project-MONAI/model-zoo ***@***.***>
Cc: Boris Fomitchev ***@***.***>, Mention ***@***.***>
Subject: Re: [Project-MONAI/model-zoo] TRT support for MAISI (PR #701)
Hi @borisfom<https://github.com/borisfom> ,
Here is the benchmark details about 100 times running on MAISI with 256x256x256 input shape on A100 80GB. I am not sure why the Image Decoding suffers a slowdown. Can be some overhead issues. Will try to figure it out later.
Latency Type
TRT Mean Latency (s)
Bundle Mean Latency (s)
Mask Preparation
2.897087729
2.793987193
Feature Generation
35.12124193
76.54545327
Image Decoding
1.483238726
1.194563277
Latency Type
TRT Median Latency (s)
Bundle Median Latency (s)
Mask Preparation
2.90212667
2.80731046
Feature Generation
35.12641037
76.54435086
Image Decoding
1.490729215
1.17928219
Thanks,
Bin
—
Reply to this email directly, view it on GitHub<#701 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/ADMIIL5M4WLMO3QO434WIMT2A4F5FAVCNFSM6AAAAABQAK6QZGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIOBQGQ4TENZSGU>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Hi @borisfom , thanks for the contribution. I tried with the latest monai image with monai version 1.4.1rc1 built by @KumoLiu , but got error:
Could you please also help to update the README file and add some information about the requirement environment? (Like what we do for vista3d: https://github.com/Project-MONAI/model-zoo/tree/dev/models/vista3d#execute-inference-with-the-tensorrt-model) |
cc @binliunls also need the benchmark |
Hi @yiheng-wang-nv, the error is due to NVIDIA Volta support (GPUs with compute capability 7.0) has been removed starting with TensorRT 10.5. Here are some description in the |
@KumoLiu @binliunls : Correct, this is the issue with Volta support dropped in TensorRT 10.5. Probably, both Vista3D and MAISI READMEs should be updated with mention of that limitation - clients running Volta may still enjoy TRT acceleration with TensorRT 10.3-10.4.
|
Hi @yiheng-wang-nv , Thanks, |
/build |
Description
TRT optimization support for MAISI.
Depends on Project-MONAI/MONAI#8153
To successfully export MAISI, either 24.11 PyTorch container (or latest public Torch nightly) is needed, or this patch needs to be applied to 24.10-based container: