Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
datasets		datasets
modeling		modeling
tests		tests
README.md		README.md
arch.jpg		arch.jpg
benchmarks.yml		benchmarks.yml
required_apt_packages.txt		required_apt_packages.txt
requirements.txt		requirements.txt
run.py		run.py

README.md

Frozen in time

Frozen in time, a joint Video and image retriever for end-end retrieval, based on the original project, optimised for Graphcore's IPU. A Joint Video and Image Encoder for End-to-End Retrieval

Framework	Domain	Model	Datasets	Tasks	Training	Inference	Reference
PyTorch	Vision	Frozen in time		WebVid, MSR-VTT	✅ Min. 8 IPUs (POD16) required	❌	Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval

WebVid data can be found here.

Instructions summary

Install and enable the Poplar SDK (see Poplar SDK setup)
Install the system and Python requirements (see Environment setup)
Download the WebVid and MSR-VTT dataset (See Dataset setup)

Poplar SDK setup

To check if your Poplar SDK has already been enabled, run:

 echo $POPLAR_SDK_ENABLED

If no path is provided, then follow these steps:

Navigate to your Poplar SDK root directory
Enable the Poplar SDK with:

cd poplar-<OS version>-<SDK version>-<hash>
. enable.sh

Additionally, enable PopART with:

cd popart-<OS version>-<SDK version>-<hash>
. enable.sh

More detailed instructions on setting up your Poplar environment are available in the Poplar quick start guide.

Environment setup

To prepare your environment, follow these steps:

Create and activate a Python3 virtual environment:

python3 -m venv <venv name>
source <venv path>/bin/activate

Navigate to the Poplar SDK root directory
Install the PopTorch (PyTorch) wheel:

cd <poplar sdk root dir>
pip3 install poptorch...x86_64.whl

Navigate to this example's root directory
Install the apt requirements:

sudo apt install $(< required_apt_packages.txt)

Install the Python requirements:

pip3 install -r requirements.txt

More detailed instructions on setting up your PyTorch environment are available in the PyTorch quick start guide.

Dataset setup

WebVid

As of 23 February 2024, these datasets are no longer available. The project GitHub repository provides some guidance.

Download the videos:

python3 datasets/download.py --csv_path <path_to_train_csv> --part 0
python3 datasets/download.py --csv_path <path_to_val_csv> --part 0

Clean the videos:

mkdir data/WebVid/metadata
python3 datasets/clean_videos.py --csv_path <path_to_train_csv> --video_path data/WebVid/videos/ --clean_csv_path <path_to_clean_train_csv>
python3 datasets/clean_videos.py --csv_path <path_to_val_csv> --video_path data/WebVid/videos/ --clean_csv_path <path_to_clean_val_csv>

Disk space required:

MSR-VTT

Download the dataset from the source

Disk space required:

Running and benchmarking

To run a tested and optimised configuration and to reproduce the performance shown on our performance results page, use the examples_utils module (installed automatically as part of the environment setup) to run one or more benchmarks. The benchmarks are provided in the benchmarks.yml file in this example's root directory.

For example:

python3 -m examples_utils benchmark --spec <path to benchmarks.yml file>

Or to run a specific benchmark in the benchmarks.yml file provided:

python3 -m examples_utils benchmark --spec <path to benchmarks.yml file> --benchmark <name of benchmark>

For more information on using the examples-utils benchmarking module, please refer to the README.

License

This application is licensed under MIT license. Please see the LICENSE file at the top-level of this repository.

The following files are created by Graphcore and are licensed under MIT License:

README.md
clean_videos.py
modeling/model_patch.py
test/*

The following files include code derived from the Frozen-in-time repository which uses MIT license:

arch.jpg
run.py
modeling/init.py
modeling/loss.py
modeling/metric.py
modeling/model.py
modeling/video_transformer.py
modeling/trainer.py
configs/*
datasets/*

External packages:

torchvision is licensed under BSD 3-Clause License
transformers is licensed under Apache-2.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytorch

pytorch

README.md

Frozen in time

Instructions summary

Poplar SDK setup

Environment setup

Dataset setup

WebVid

MSR-VTT

Running and benchmarking

License

Files

pytorch

Directory actions

More options

Directory actions

More options

Latest commit

History

pytorch

Folders and files

parent directory

README.md

Frozen in time

Instructions summary

Poplar SDK setup

Environment setup

Dataset setup

WebVid

MSR-VTT

Running and benchmarking

License