Pinned Loading
Repositories
- sort-and-search Public
Code for the paper: "Efficient Lifelong Model Evaluation in an Era of Rapid Progress" [NeurIPS'24]
- model-vs-human Public
Benchmark your model on out-of-distribution datasets with carefully collected human comparison data (NeurIPS 2021 Oral)
- frequency_determines_performance Public
Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurIPS'24]
- DataTypeIdentification Public
Code for the ICLR'24 paper: "Visual Data-Type Understanding does not emerge from Scaling Vision-Language Models"
- robustness Public
Robustness and adaptation of ImageNet scale models. Pre-Release, stay tuned for updates.
- game-of-noise Public
Trained model weights, training and evaluation code from the paper "A simple way to make neural networks robust against diverse image corruptions"