A list of excellent works on various stages of Sign Language Production (SLP) (automatic translation from spoken sentences to sign language sequences). It contains an extensive literature review on the field of deep learning-based sign language generation, sign language synthesis, sign language avatar recovery, pose estimation, and all related publications.
I am gathering these papers as literature for my PhD, and thought others may be interested. If you have any updates, please feel free to contribute or email me at [email protected].
- T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text. [ACL-ccfa][Paper]
- MS2SL: Multimodal Spoken Data-Driven Continuous Sign Language Production. [ACL-ccba][Paper]
- Unsupervised Sign Language Translation and Generation [ACL-ccba][Paper][Code]
- Pose Guided Fine-Grained Sign Language Video Generation. [ECCV-ccfb][Paper]
- SignGen: End-to-End Sign Language Video Generation with Latent Diffusion. [ECCV-ccfb][Paper]
- Semantic-driven diffusion for sign language production with gloss-pose latent spaces alignment. [CVIU-ccfb][Paper]
- Multi-Channel Spatio-Temporal Transformer for Sign Language Production. [LREC-ccfb][Paper]
- Attentional bias for hands: Cascade dual‐decoder transformer for sign language production [IETCV-ccfc][Paper]
- Select and Reorder: A Novel Approach for Neural Sign Language Production. [LREC-ccfb][Paper]
- Sign Language Production With Latent Motion Transformer.[WACV 2024][Paper]
- SignNet: Single Channel Sign Generation using Metric Embedded Learning. [ICAFGR-ccbc][Paper]
- Signing at Scale: Learning to Co-Articulate Signs for Large-Scale Photo-Realistic Sign Language Production. [CVPR-ccfa][Paper]
- G2P-DDM: Generating Sign Pose Sequence from Gloss Sequence with Discrete Diffusion Model. [AAAI-ccfa][Paper]
- Modeling Intensification for Sign Language Generation: A Computational Approach. [ACL-ccfa][Paper][Code]
- DualSign: Semi-Supervised Sign Language Production with Balanced Multi-Modal Multi-Task Dual Transformation. [ACM Multimedia-ccfa][Paper]
- Gloss Semantic-Enhanced Network with Online Back-Translation for Sign Language Production. [ACM Multimedia-ccfa][Paper]
- Mixed SIGNals: Sign Language Production via a Mixture of Motion Primitives. [ICCV-ccfa][Paper]
- Continuous 3D Multi-Channel Sign Language Production via Progressive Transformers and Mixture Density Networks. [IJCV-ccba][Paper]
- Towards Fast and High-Quality Sign Language Production. [ACM Multimedia-ccfa][Paper]
- Towards Automatic Speech to Sign Language Generation. [Interspeech-ccfa][Paper]
- Non-Autoregressive Sign Language Production with Gaussian Space. [BMVC-ccfc][Paper]
- Text2Sign: Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks. [IJCV-ccba][Paper]
- Progressive Transformers for End-to-End Sign Language Production. [ECCV-ccfb][Paper][Code]
- Signsynth: Data-driven sign language video generation [ECCV-ccfb][Paper]
- Skeleton-based Chinese sign language recognition and generation for bidirectional communication between deaf and hearing people. [NN-ccfb][Paper]
- Adversarial Training for Multi-Channel Sign Language Production. [BMVC-ccfc][Paper]
- Cross-modal Neural Sign Language Translation [ACM Multimedia-ccfa][Paper]
- Deep Gesture Video Generation With Learning on Regions of Interest. [TMM-ccfb][Paper]
- Neural Sign Actors: A diffusion model for 3D sign language production from text. [CVPR-ccba][Paper]
- SignAvatars: A Large-scale 3D Sign Language Holistic Motion Dataset and Benchmark. [ECCV-ccfb][Paper]
- A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars. [ECCV-ccfb][paper]
- SignAvatar: Sign Language 3D Motion Reconstruction and Generation. [ICAFGR-ccbc][Paper]
- A Comparative Study of Video-Based Human Representations for American Sign Language Alphabet Generation. [ICAFGR-ccbc][Paper]
- SynthSL: Expressive Humans for Sign Language Image Synthesis [ICAFGR-ccbc][Paper]
- Reconstructing Signing Avatars from Video Using Linguistic Priors. [CVPR-ccfa][Paper]
- There and Back Again: 3D Sign Language Generation from Text Using Back-Translation. [3DV-ccfc][Paper]
- Uncertainty-aware Sign Language Video Retrieval with Probability Distribution Modeling. [ECCV-ccfb][Paper]
- SignCLIP: Connecting Text and Sign Language by Contrastive Learning. [EMNLP-ccfb][Paper]
- CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning [CVPR-ccfa][Paper][Code]
-
Phoenix-2014T: Please follow https://www-i6.informatik.rwth-aachen.de/~koller/RWTH-PHOENIX-2014-T/.
-
CSL-Daily: Please follow http://home.ustc.edu.cn/~zhouh156/dataset/csl-daily/.
-
WLASL: Please follow https://dxli94.github.io/WLASL/.
-
MSASL: Please follow https://www.microsoft.com/en-us/research/project/ms-asl/.
- Jointly Harnessing Prior Structures and Temporal Consistency for Sign Language Video Generation. [TOMM-ccfb][Paper]
- Ham2Pose: Animating Sign Language Notation into Pose Sequences. [CVPR-ccfa][Paper]
- ANONYSIGN: Novel Human Appearance Synthesis for Sign Language Video Anonymisation [ICAFGR-ccbc][Paper]
- Enhancing Sign Language Teaching: A Mixed Reality Approach for Immersive Learning and Multi-Dimensional Feedback.
- Improving Gloss-free Sign Language Translation by Reducing Representation Density
- Learning to Score Sign Language with Two-stage Method