Maxime Burchi et al.: Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer. (2024)journals/corr/abs-2405-1298310.48550/ARXIV.2405.12983Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer.5Maxime Burchi1Krishna C. Puvvada2Jagadeesh Balam3Boris Ginsburg4Radu Timofte5CoRRCoRRabs/2405.129832024provenance information for RDF data of dblp record 'journals/corr/abs-2405-12983'2024-06-24T20:41:43+0200