Query: Parallel_batches support for CNN and NLP model #98

avinashcpandey · 2022-03-25T09:56:47Z

I see WideDeep has parallel batch support where no. of input samples can be broken into N chunks and run parallely.

Eg. if we are giving parallel_batchess 28, inter=28 intra and num_threads=1 for CSL single socket.
Benchmark will launch 28 graph execution parallely and each will work on provided BS. Please correct me here.

This is the reference I followed.
https://github.com/IntelAI/models/blob/master/benchmarks/recommendation/tensorflow/wide_deep_large_ds/inference/fp32/Advanced.md

Now I want to do the same for CNN model (eg. inceptionV3)
For example Given BS is 280, and given parallel batches is 28. Then 28 graph execution will be created where each graph execution will work on 10 images.

Is this supported? If yes, how can I use it?
If No, Is it there in plan?

I want to do the same for BERT also.

* Upgrade to Horovod 0.22.1 Signed-off-by: Abolfazl Shahbazi <[email protected]> * upgrade to even a newer Horovod commit 87094a4 Signed-off-by: Abolfazl Shahbazi <[email protected]>

sramakintel · 2024-03-25T17:15:08Z

@avinashcpandey do you still need help with this issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query: Parallel_batches support for CNN and NLP model #98

Query: Parallel_batches support for CNN and NLP model #98

avinashcpandey commented Mar 25, 2022

sramakintel commented Mar 25, 2024

Query: Parallel_batches support for CNN and NLP model #98

Query: Parallel_batches support for CNN and NLP model #98

Comments

avinashcpandey commented Mar 25, 2022

sramakintel commented Mar 25, 2024