Khassanov Y 1

1. Speakingfaces: A large-scale multimodal dataset of voice commands with visual and thermal video streams
1