default search action
Jan Trmal
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c44]Ruizhe Huang, Mahsa Yarmohammadi, Jan Trmal, Jing Liu, Desh Raj, Leibny Paola García, Alexei V. Ivanov, Patrick Ehlen, Mingzhi Yu, Dan Povey, Sanjeev Khudanpur:
ConEC: Earnings Call Dataset with Real-world Contexts for Benchmarking Contextual Speech Recognition. LREC/COLING 2024: 3700-3706 - 2023
- [c43]Ruizhe Huang, Matthew Wiesner, Leibny Paola García-Perera, Daniel Povey, Jan Trmal, Sanjeev Khudanpur:
Building Keyword Search System from End-To-End Asr Systems. ICASSP 2023: 1-5 - 2021
- [c42]Matthew Wiesner, Mousmita Sarma, Ashish Arora, Desh Raj, Dongji Gao, Ruizhe Huang, Supreet Preet, Moris Johnson, Zikra Iqbal, Nagendra Goel, Jan Trmal, Leibny Paola García-Perera, Sanjeev Khudanpur:
Training Hybrid Models on Noisy Transliterated Transcripts for Code-Switched Speech Recognition. Interspeech 2021: 2906-2910 - [c41]Guoguo Chen, Shuzhou Chai, Guan-Bo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-Domain ASR Corpus with 10, 000 Hours of Transcribed Audio. Interspeech 2021: 3670-3674 - [i11]Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021) - [i10]Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan:
GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10, 000 Hours of Transcribed Audio. CoRR abs/2106.06909 (2021) - [i9]Piotr Zelasko, Daniel Povey, Jan "Yenda" Trmal, Sanjeev Khudanpur:
Lhotse: a speech data representation library for the modern deep learning ecosystem. CoRR abs/2110.12561 (2021) - 2020
- [c40]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-Task Self-Supervised Learning for Robust Speech Recognition. ICASSP 2020: 6989-6993 - [c39]Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas:
DiPCo - Dinner Party Corpus. INTERSPEECH 2020: 434-436 - [c38]Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky:
Induced Inflection-Set Keyword Search in Speech. SIGMORPHON 2020: 210-216 - [i8]Mirco Ravanelli, Jianyuan Zhong, Santiago Pascual, Pawel Swietojanski, João Monteiro, Jan Trmal, Yoshua Bengio:
Multi-task self-supervised learning for Robust Speech Recognition. CoRR abs/2001.09239 (2020)
2010 – 2019
- 2019
- [c37]Matthew Wiesner, Oliver Adams, David Yarowsky, Jan Trmal, Sanjeev Khudanpur:
Zero-Shot Pronunciation Lexicons for Cross-Language Acoustic Model Transfer. ASRU 2019: 1048-1054 - [c36]Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak:
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP 2019: 1-5 - [c35]Ashish Arora, Paola García, Shinji Watanabe, Vimal Manohar, Yiwen Shao, Sanjeev Khudanpur, Chun-Chieh Chang, Babak Rekabdar, Bagher BabaAli, Daniel Povey, David Etter, Desh Raj, Hossein Hadian, Jan Trmal:
Using ASR Methods for OCR. ICDAR 2019: 663-668 - [i7]Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta, Tinh Nguyen, Xuewen Luo, Björn Hoffmeister, Jan Trmal, Maurizio Omologo, Roland Maas:
DiPCo - Dinner Party Corpus. CoRR abs/1909.13447 (2019) - [i6]Oliver Adams, Matthew Wiesner, Jan Trmal, Garrett Nicolai, David Yarowsky:
Induced Inflection-Set Keyword Search in Speech. CoRR abs/1910.12299 (2019) - 2018
- [c34]Jan Svec, Josef V. Psutka, Jan Trmal, Lubas Smfdl, Pavel Ircing, Jan Sedmidubský:
On the Use of Grapheme Models for Searching in Large Spoken Archives. ICASSP 2018: 6259-6263 - [c33]Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal:
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines. INTERSPEECH 2018: 1561-1565 - [c32]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. INTERSPEECH 2018: 2052-2056 - [c31]Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. Odyssey 2018: 54-59 - [c30]Hossein Hadian, Daniel Povey, Hossein Sameti, Jan Trmal, Sanjeev Khudanpur:
Improving LF-MMI Using Unconstrained Supervisions for ASR. SLT 2018: 43-47 - [c29]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. SLT 2018: 656-663 - [c28]Lubos Smídl, Jan Svec, Ales Prazák, Jan Trmal:
Semi-Supervised Training of DNN-Based Acoustic Model for ATC Speech Recognition. SPECOM 2018: 646-655 - [i5]Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Sanjeev Khudanpur, Najim Dehak:
The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection. CoRR abs/1802.08731 (2018) - [i4]Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal:
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines. CoRR abs/1803.10609 (2018) - [i3]Chunxi Liu, Matthew Wiesner, Shinji Watanabe, Craig Harman, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Low-Resource Contextual Topic Identification on Speech. CoRR abs/1807.06204 (2018) - 2017
- [c27]Mirko Hannemann, Jan Trmal, Lucas Ondel, Santosh Kesiraju, Lukás Burget:
Bayesian joint-sequence models for grapheme-to-phoneme conversion. ICASSP 2017: 2836-2840 - [c26]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech Without ASR. INTERSPEECH 2017: 2501-2505 - [c25]Jan Svec, Josef V. Psutka, Lubos Smídl, Jan Trmal:
A Relevance Score Estimation for Spoken Term Detection Based on RNN-Generated Pronunciation Embeddings. INTERSPEECH 2017: 2934-2938 - [c24]Jan Trmal, Matthew Wiesner, Vijayaditya Peddinti, Xiaohui Zhang, Pegah Ghahremani, Yiming Wang, Vimal Manohar, Hainan Xu, Daniel Povey, Sanjeev Khudanpur:
The Kaldi OpenKWS System: Improving Low Resource Keyword Search. INTERSPEECH 2017: 3597-3601 - [i2]Chunxi Liu, Jan Trmal, Matthew Wiesner, Craig Harman, Sanjeev Khudanpur:
Topic Identification for Speech without ASR. CoRR abs/1703.07476 (2017) - [i1]Jan Trmal, Gaurav Kumar, Vimal Manohar, Sanjeev Khudanpur, Matt Post, Paul McNamee:
Using of heterogeneous corpora for training of an ASR system. CoRR abs/1706.00321 (2017) - 2016
- [c23]Eleanor Chodroff, Matthew Maciejewski, Jan Trmal, Sanjeev Khudanpur, John Godfrey:
New release of Mixer-6: Improved validity for phonetic study of speaker variation and identification. LREC 2016 - 2015
- [c22]Gaurav Kumar, Graeme W. Blackwood, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
A Coarse-Grained Model for Optimal Coupling of ASR and SMT Systems for Speech Translation. EMNLP 2015: 1902-1907 - 2014
- [c21]Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Improving deep neural network acoustic models using generalized maxout networks. ICASSP 2014: 215-219 - [c20]Pegah Ghahremani, Bagher BabaAli, Daniel Povey, Korbinian Riedhammer, Jan Trmal, Sanjeev Khudanpur:
A pitch extraction algorithm tuned for automatic speech recognition. ICASSP 2014: 2494-2498 - [c19]Justin T. Chiu, Yun Wang, Jan Trmal, Daniel Povey, Guoguo Chen, Alexander I. Rudnicky:
Combination of FST and CN search in spoken term detection. INTERSPEECH 2014: 2784-2788 - [c18]Chunxi Liu, Aren Jansen, Guoguo Chen, Keith Kintzley, Jan Trmal, Sanjeev Khudanpur:
Low-resource open vocabulary keyword search using point process models. INTERSPEECH 2014: 2789-2793 - [c17]Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani, Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David Yarowsky, Florian Metze:
A keyword search system using open source software. SLT 2014: 530-535 - 2013
- [c16]Guoguo Chen, Oguz Yilmaz, Jan Trmal, Daniel Povey, Sanjeev Khudanpur:
Using proxies for OOV keywords in the keyword search task. ASRU 2013: 416-421 - [c15]Guoguo Chen, Sanjeev Khudanpur, Daniel Povey, Jan Trmal, David Yarowsky, Oguz Yilmaz:
Quantifying the value of pronunciation lexicons for keyword search in lowresource languages. ICASSP 2013: 8560-8564 - 2012
- [j1]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors. IEEE Trans. Speech Audio Process. 20(6): 1818-1828 (2012) - [c14]Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka:
Novel Approach to Live Captioning Through Re-speaking: Tailoring Speech Recognition to Re-speaker's Needs. INTERSPEECH 2012: 1372-1375 - [c13]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Full covariance Gaussian mixture models evaluation on GPU. ISSPIT 2012: 203-207 - [c12]Ales Prazák, Zdenek Loose, Jan Trmal, Josef V. Psutka, Josef Psutka:
Captioning of Live TV Programs through Speech Recognition and Re-speaking. TSD 2012: 513-519 - 2011
- [c11]Jan Vanek, Jan Trmal, Josef V. Psutka, Josef Psutka:
Optimization of the Gaussian Mixture Model Evaluation on GPU. INTERSPEECH 2011: 1737-1740 - 2010
- [c10]Jan Trmal, Jan Zelinka, Ludek Müller:
On speaker adaptive training of artificial neural networks. INTERSPEECH 2010: 554-557 - [c9]Jan Zelinka, Jan Trmal, Ludek Müller:
Low-dimensional space transforms of posteriors in speech recognition. INTERSPEECH 2010: 1193-1196 - [c8]Jan Trmal, Ales Prazák, Zdenek Loose, Josef Psutka:
Online TV Captioning of Czech Parliamentary Sessions. TSD 2010: 416-422 - [c7]Jan Trmal, Jan Zelinka, Ludek Müller:
Adaptation of a Feedforward Artificial Neural Network Using a Linear Transform. TSD 2010: 423-430 - [c6]Jan Zelinka, Lubos Smídl, Jan Trmal, Ludek Müller:
Posterior Estimates and Transforms for Speech Recognition. TSD 2010: 480-487
2000 – 2009
- 2009
- [c5]Jindrich Matousek, Radek Skarnitzl, Pavel Machac, Jan Trmal:
Identification and automatic detection of parasitic speech sounds. INTERSPEECH 2009: 876-879 - 2008
- [c4]Jan Trmal, Marek Hrúz, Jan Zelinka, Pavel Campr, Ludek Müller:
Feature space transforms for Czech sign-language recognition. INTERSPEECH 2008: 2036-2039 - [c3]Miroslav Nagy, Petr Hanzlícek, Jana Zvárová, Tatjana Dostálová, Michaela Seydlova, Radim Hippman, Lubos Smídl, Jan Trmal, Josef Psutka:
Voice-controlled Data Entry in Dental Electronic Health Record. MIE 2008: 529-534 - 2006
- [c2]Jan Trmal, Jan Vanek, Ludek Müller, Jan Zelinka:
Independent components for acoustic modeling. INTERSPEECH 2006 - [c1]Jan Trmal, Jan Zelinka, Jan Vanek, Ludek Müller:
Silence/Speech Detection Method Based on Set of Decision Graphs. TSD 2006: 539-546
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-06-13 21:00 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint