Pose Evaluation

The lack of automatic pose evaluation metrics is a major obstacle in the development of sign language generation models.

Goals

The primary objective of this repository is to house a suite of automatic evaluation metrics specifically tailored for sign language poses. This includes metrics proposed by Ham2Pose¹ as well as custom-developed metrics unique to our approach. We recognize the distinct challenges in evaluating single signs versus continuous signing, and our methods reflect this differentiation.

TODO:

Qualitative Evaluation
Quantitative Evaluation

Qualitative Evaluation

To qualitatively demonstrate the efficacy of these evaluation metrics, we implement a nearest-neighbor search for selected signs from the TODO corpus. The rationale is straightforward: the closer the sign is to its nearest neighbor in the corpus, the more effective the evaluation metric is in capturing the nuances of sign language transcription and translation.

Distribution of Scores

Using a sample of the corpus, we compute the any-to-any scores for each metric. Intuitively, we expect a good metric given any two random signs to produce a bad score, since most signs are unrelated. This should be reflected in the distribution of scores, which should be skewed towards lower scores.

Nearest Neighbor Search

INSERT TABLE HERE

Quantitative Evaluation

Isolated Sign Evaluation

Given an isolated sign corpus such as AUTSL², we repeat the evaluation of Ham2Pose¹ on our metrics.

We also repeat the experiments of Atwell et al.³ to evaluate the bias of our metrics on different protected attributes.

Continuous Sign Evaluation

We evaluate each metric in the context of continuous signing with our continuous metrics alongside our segmented metrics and correlate to human judgments.

Evaluation Metrics

TODO list evaluation metrics here.

Cite

If you use our toolkit in your research or projects, please consider citing the work.

@misc{pose-evaluation2024,
    title={Pose Evaluation: Metrics for Evaluating Sign Langauge Generation Models},
    author={Zifan Jiang, Colin Leong, Amit Moryossef},
    howpublished={\url{https://github.com/sign-language-processing/pose-evaluation}},
    year={2024}
}

Contributions:

Zifan, Colin, and Amit developed the evaluation metrics and tools.
Zifan, Anne, and Lisa conducted the qualitative and quantitative evaluations.

References

Ham2Pose: Animating Sign Language Notation into Pose Sequences.

AUTSL: A Large Scale Multi-modal Turkish Sign Language Dataset and Baseline Methods.

Studying and Mitigating Biases in Sign Language Understanding Models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 268–283, Miami, Florida, USA. Association for Computational Linguistics.

Rotem Shalev-Arkushin, Amit Moryossef, and Ohad Fried. ↩ ↩²
Ozge Mercanoglu Sincan and Hacer Yalim Keles. ↩
Katherine Atwell, Danielle Bragg, and Malihe Alikhani. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.github/workflows		.github/workflows
assets/distribution		assets/distribution
pose_evaluation		pose_evaluation
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pose Evaluation

Goals

TODO:

Qualitative Evaluation

Distribution of Scores

Nearest Neighbor Search

Quantitative Evaluation

Isolated Sign Evaluation

Continuous Sign Evaluation

Evaluation Metrics

Cite

Contributions:

References

About

Releases

Packages

Contributors 2

Languages

sign-language-processing/pose-evaluation

Folders and files

Latest commit

History

Repository files navigation

Pose Evaluation

Goals

TODO:

Qualitative Evaluation

Distribution of Scores

Nearest Neighbor Search

Quantitative Evaluation

Isolated Sign Evaluation

Continuous Sign Evaluation

Evaluation Metrics

Cite

Contributions:

References

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages