This document has instructions for running SSD-ResNet34 inference using Intel-optimized TensorFlow.
The SSD-ResNet34 accuracy script accuracy.sh
uses the
COCO validation dataset in the TF records
format. See the COCO dataset document for
instructions on downloading and preprocessing the COCO validation dataset.
The inference scripts use synthetic data, so no dataset is required.
After the script to convert the raw images to the TF records file completes, rename the tf_records file:
mv ${OUTPUT_DIR}/coco_val.record ${OUTPUT_DIR}/validation-00000-of-00001
Set the DATASET_DIR
to the folder that has the validation-00000-of-00001
file when running the accuracy test. Note that the inference performance
test uses synthetic dataset.
Script name | Description |
---|---|
accuracy_1200.sh | Measures the inference accuracy (providing a DATASET_DIR environment variable is required) for the specified precision (fp32, int8 or bfloat16) with an input size of 1200x1200. |
accuracy.sh | Measures the inference accuracy (providing a DATASET_DIR environment variable is required) for the specified precision (fp32, int8 or bfloat16) with an input size of 300x300. |
inference_1200.sh | Runs inference with a batch size of 1 using synthetic data for the specified precision (fp32, int8 or bfloat16) with an input size of 1200x1200. Prints out the time spent per batch and total samples/second. |
inference.sh | Runs inference with a batch size of 1 using synthetic data for the specified precision (fp32, int8 or bfloat16) with an input size of 300x300. Prints out the time spent per batch and total samples/second. |
multi_instance_online_inference_1200.sh | Runs multi instance realtime inference (batch-size=1) using 4 cores per instance for the specified precision (fp32, int8 or bfloat16). Uses synthetic data with an input size of 1200x1200. Waits for all instances to complete, then prints a summarized throughput value. |
multi_instance_batch_inference_1200.sh | Runs multi instance batch inference (batch-size=16) using 1 instance per socket for the specified precision (fp32, int8 or bfloat16). Uses synthetic data with an input size of 1200x1200. Waits for all instances to complete, then prints a summarized throughput value. |
Setup your environment using the instructions below, depending on if you are using AI Tools:
Setup using AI Tools on Linux | Setup without AI Tools on Linux | Setup without AI Tools on Windows |
---|---|---|
To run using AI Tools on Linux you will need:
|
To run without AI Tools on Linux you will need:
|
To run without AI Tools on Windows you will need:
|
The TensorFlow models and
benchmarks repos are used by
SSD-ResNet34 inference. Clone those at the git SHAs specified
below and set the TF_MODELS_DIR
environment variable to point to the
directory where the models repo was cloned.
git clone --single-branch https://github.com/tensorflow/models.git tf_models
git clone --single-branch https://github.com/tensorflow/benchmarks.git ssd-resnet-benchmarks
cd tf_models
export TF_MODELS_DIR=$(pwd)
git checkout f505cecde2d8ebf6fe15f40fb8bc350b2b1ed5dc
cd ../ssd-resnet-benchmarks
git checkout 509b9d288937216ca7069f31cfb22aaa7db6a4a7
cd ..
Download the SSD-ResNet34 pretrained model for either the 300x300 or 1200x1200
input size, depending on which quickstart script you are
going to run. Set the PRETRAINED_MODEL
environment variable for the path to the
pretrained model that you'll be using.
If you run on Windows, please use a browser to download the pretrained model using the link below.
For Linux, run:
# SSD-ResNet34 FP32 and BFloat16 300x300 Pretrained model
wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/ssd_resnet34_fp32_bs1_pretrained_model.pb
export PRETRAINED_MODEL=$(pwd)/ssd_resnet34_fp32_bs1_pretrained_model.pb
# SSD-ResNet34 FP32 and BFloat16 1200x1200 Pretrained model
wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/ssd_resnet34_fp32_1200x1200_pretrained_model.pb
export PRETRAINED_MODEL=$(pwd)/ssd_resnet34_fp32_1200x1200_pretrained_model.pb
# SSD-ResNet34 Int8 300x300 Pretrained model
wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/ssd_resnet34_int8_bs1_pretrained_model.pb
export PRETRAINED_MODEL=$(pwd)/ssd_resnet34_int8_bs1_pretrained_model.pb
# SSD-ResNet34 Int8 1200x1200 Pretrained model
wget https://storage.googleapis.com/intel-optimized-tensorflow/models/v1_8/ssd_resnet34_int8_1200x1200_pretrained_model.pb
export PRETRAINED_MODEL=$(pwd)/ssd_resnet34_int8_1200x1200_pretrained_model.pb
# SSD-ResNet34 Int8 1200x1200 Pretrained model for OneDnn Graph (Only used when the plugin Intel Extension for Tensorflow is installed, as OneDNN Graph optimization is enabled by default at this point)
wget https://storage.googleapis.com/intel-optimized-tensorflow/models/2_12_0/ssd_rn34_itex_int8.pb
export PRETRAINED_MODEL=$(pwd)/ssd_rn34_itex_int8.pb
Set the environment variables and run quickstart script on either Linux or Windows systems. If the accuracy test is being run, then set the DATASET_DIR
to point to the folder where the COCO dataset
validation-00000-of-00001
file is located. See the list of quickstart scripts for details on the different options.
# cd to your AI Reference Models directory
cd models
# set environment variables
export DATASET_DIR=<directory with the validation-*-of-* files (for accuracy testing only)>
export TF_MODELS_DIR=<path to the TensorFlow Models repo>
export PRECISION=<set the precision to "int8" or "fp32" or "bfloat16">
export PRETRAINED_MODEL=<path to the 300x300 or 1200x1200 pretrained model pb file>
export OUTPUT_DIR=<path to the directory where log files will be written>
# For a custom batch size, set env var `BATCH_SIZE` or it will run with a default value.
export BATCH_SIZE=<customized batch size value>
./quickstart/object_detection/tensorflow/ssd-resnet34/inference/cpu/<script name>.sh
Using cmd.exe
, run:
# cd to your AI Reference Models directory
cd models
set PRETRAINED_MODEL=<path to the 300x300 or 1200x1200 pretrained model pb file>
set DATASET_DIR=<directory with the validation-*-of-* files (for accuracy testing only)>
set PRECISION=<set the precision to "int8" or "fp32">
set OUTPUT_DIR=<directory where log files will be written>
set TF_MODELS_DIR=<path to the TensorFlow Models repo>
# For a custom batch size, set env var `BATCH_SIZE` or it will run with a default value.
set BATCH_SIZE=<customized batch size value>
bash quickstart\object_detection\tensorflow\ssd-resnet34\inference\cpu\<script name>.sh
Note: You may use
cygpath
to convert the Windows paths to Unix paths before setting the environment variables. As an example, if the dataset location on Windows isD:\user\coco_dataset
, convert the Windows path to Unix as shown:cygpath D:\user\coco_dataset /d/user/coco_dataset
Then, set the
DATASET_DIR
environment variableset DATASET_DIR=/d/user/coco_dataset
.
- To run more advanced use cases, see the instructions for the available precisions FP32 Int8 BFloat16 for calling the
launch_benchmark.py
script directly. - To run the model using docker, please see the Intel® Developer Catalog
workload container:
https://www.intel.com/content/www/us/en/developer/articles/containers/ssd-resnet34-fp32-inference-tensorflow-container.html.