default search action
26th TSD 2023: Pilsen, Czech Republic
- Kamil Ekstein, Frantisek Pártl, Miloslav Konopík:
Text, Speech, and Dialogue - 26th International Conference, TSD 2023, Pilsen, Czech Republic, September 4-6, 2023, Proceedings. Lecture Notes in Computer Science 14102, Springer 2023, ISBN 978-3-031-40497-9
Text
- Xiaotian Wang, Tingxuan Li, Takuya Tamura, Shunsuke Nishida, Fuzhu Zhu, Takehito Utsuro:
Japanese How-to Tip Machine Reading Comprehension by Multi-task Learning Based on Generative Model. 3-14 - Ales Zagar, Marko Robnik-Sikonja:
One Model to Rule Them All: Ranking Slovene Summarizers. 15-24 - Frantisek Trebuna, Kristína Szabová, Ondrej Bojar:
Searching for Reasons of Transformers' Success: Memorization vs Generalization. 25-32 - Hynek Kydlícek, Jindrich Libovický:
A Dataset and Strong Baselines for Classification of Czech News Texts. 33-44 - Noémi Vadász:
Resolving Hungarian Anaphora with ChatGPT. 45-57 - György Orosz, Gergo Szabó, Péter Berkecz, Zsolt Szántó, Richárd Farkas:
Advancing Hungarian Text Processing with HuSpaCy: Efficient and Accurate NLP Pipelines. 58-69 - Gregor Donaj, Spela Antloga:
ParaDiom - A Parallel Corpus of Idiomatic Texts. 70-81 - Kai Hartung, Aaricia Herygers, Shubham Vijay Kurlekar, Khabbab Zakaria, Taylan Volkan, Sören Gröttrup, Munir Georges:
Measuring Sentiment Bias in Machine Translation. 82-93 - Zijian Gyozo Yang, László János Laki, Tamás Váradi, Gábor Prószéky:
Mono- and Multilingual GPT-3 Models for Hungarian. 94-104 - Vojtech John, Zdenek Zabokrtský:
The Unbearable Lightness of Morph Classification. 105-115 - Denis Memmesheimer, Karin Harbusch:
A German Parallel Clausal Coordinate Ellipsis Corpus that Aligns Sentences from the TüBa-D/Z Treebank with Reconstructed Canonical Forms. 116-128
Speech
- José Vicente Egas López, Gábor Gosztolya:
Identifying Subjects Wearing a Mask from the Speech by Means of Encoded Speech Representations. 131-140 - Tobias Weise, Andreas K. Maier, Kubilay Can Demir, Paula Andrea Pérez-Toro, Tomás Arias-Vergara, Björn Heismann, Elmar Nöth, Maria Schuster, Seung Hee Yang:
Impact of Including Pathological Speech in Pre-training on Pathology Detection. 141-153 - Tomás Jelínek:
Morphological Tagging and Lemmatization of Spoken Corpora of Czech. 154-163 - Thibault Bañeras Roux, Jane Wottawa, Mickael Rouvier, Téva Merlin, Richard Dufour:
HATS: An Open Data Set Integrating Human Perception Applied to the Evaluation of Automatic Speech Recognition Metrics. 164-175 - Frantisek Kynych, Jindrich Zdánský, Petr Cerva, Lukás Mateju:
Online Speaker Diarization Using Optimized SE-ResNet Architecture. 176-187 - Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Anderson da Silva Soares, Arlindo R. Galvão Filho:
CML-TTS: A Multilingual Dataset for Speech Synthesis in Low-Resource Languages. 188-199 - Jan Nouza, Lukás Mateju, Petr Cerva, Jindrich Zdánský:
Developing State-of-the-Art End-to-End ASR for Norwegian. 200-213 - Jindrich Matousek, Daniel Tihelka:
VITS: Quality Vs. Speed Analysis. 214-225 - Juan Camilo Vásquez-Correa, Haritz Arzelus, Juan M. Martín-Doñas, Joaquín Arellano, Ander González-Docasal, Aitor Álvarez:
When Whisper Meets TTS: Domain Adaptation Using only Synthetic Speech Data. 226-238 - Lars Formoe, Dan Bruun Mygind, Espen Løkke, Hasan Ogul:
Unsupervised Learning for Automatic Speech Recognition in Air Traffic Control Environment. 239-248 - Natalia Kalashnikova, Mathilde Hutin, Ioana Vasilescu, Laurence Devillers:
The Effect of Human-Likeliness in French Robot-Directed Speech: A Study of Speech Rate and Fluency. 249-257 - Juan M. Martín-Doñas, Haritz Arzelus, Aitor Álvarez, Joaquín Arellano:
An Online Diarization Approach for Streaming Applications Based on Tree-Clustering and Bayesian Resegmentation. 258-269 - Frederico S. Oliveira, Edresson Casanova, Arnaldo Cândido Júnior, Lucas R. S. Gris, Anderson da Silva Soares, Arlindo R. Galvão Filho:
Evaluation of Speech Representations for MOS Prediction. 270-282 - Soumyajit Mitra, Swayambhu Nath Ray, Bharat Padi, Raghavendra Bilgi, Harish Arsikere, Shalini Ghosh, Ajay Srinivasamurthy, Sri Garimella:
Unified Modeling of Multi-Domain Multi-Device ASR Systems. 283-292 - Lily Wadoux, Nelly Barbot, Jonathan Chevelu, Damien Lolive:
Voice Cloning for Voice Disorders: Impact of Phonetic Content. 293-303 - Raul Monteiro, Diogo Pernes:
Towards End-to-End Speech-to-Text Summarization. 304-316 - Georgios Karakasidis, Nathaniel R. Robinson, Yaroslav Getman, Atieno Ogayo, Ragheb Al-Ghezi, Ananya Ayasi, Shinji Watanabe, David R. Mortensen, Mikko Kurimo:
Multilingual TTS Accent Impressions for Accented ASR. 317-327 - Jan Lehecka, Josef V. Psutka, Josef Psutka:
Transfer Learning of Transformer-Based Speech Recognition Models from Czech to Slovak. 328-338 - Cristian D. Ríos-Urrego, Daniel Escobar-Grisales, Santiago Andres Moreno-Acevedo, Paula Andrea Pérez-Toro, Elmar Nöth, Juan Rafael Orozco-Arroyave:
Automatic Pronunciation Assessment of Non-native English Based on Phonological Analysis. 339-348 - Santiago Andres Moreno-Acevedo, Cristian D. Ríos-Urrego, Juan Camilo Vásquez-Correa, Jan Rusz, Elmar Nöth, Juan Rafael Orozco-Arroyave:
Language Generalization Using Active Learning in the Context of Parkinson's Disease Classification. 349-359
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.