Bethge Lab

foolbox Public

A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX

CiteME Public

CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.

Python 39 4

model-vs-human Public

Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)

Python 339 51

robust-detection-benchmark Public

Code, data and benchmark from the paper "Benchmarking Robustness in Object Detection: Autonomous Driving when Winter is Coming" (NeurIPS 2019 ML4AD)

Jupyter Notebook 181 24

imagecorruptions Public

Python package to corrupt arbitrary images.

Python 420 69

stylize-datasets Public

A script that applies the AdaIN style transfer method to arbitrary datasets

Python 157 37

Provide feedback