assel-j 1

1. End-to-End Multi-Modal Speaker Change Detection with Pre-Trained Models
1