iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.org/pid/03/7184.xml

Erica Cooper

Aidan Pine Erica Cooper David Guzmán Eric Joanis Anna Kazantseva Ross Krekoski Roland Kuhn 0001 Samuel Larkin Patrick Littell Delaney Lothian Akwiratékha' Martin Korin Richmond Marc Tessier Cassia Valentini-Botinhao Dan Wells Junichi Yamagishi Speech Generation for Indigenous Language Education. 101723 2025 90 Comput. Speech Lang. https://doi.org/10.1016/j.csl.2024.101723 db/journals/csl/csl90.html#PineCGJKKKLLLMRTVWY25 streams/journals/csl

Chang Zeng Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances. 101619 2024 86 Comput. Speech Lang. https://doi.org/10.1016/j.csl.2024.101619 db/journals/csl/csl86.html#ZengMWCY24 streams/journals/csl

Cheng Gong Xin Wang 0037 Erica Cooper Dan Wells Longbiao Wang Jianwu Dang 0001 Korin Richmond Junichi Yamagishi ZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations. 4036-4051 2024 32 IEEE ACM Trans. Audio Speech Lang. Process. https://doi.org/10.1109/TASLP.2024.3451951 db/journals/taslp/taslp32.html#GongWCWWDRY24 streams/journals/taslp

Aditya Ravuri Erica Cooper Junichi Yamagishi Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction. 580-584 2024 ICASSP Workshops https://doi.org/10.1109/ICASSPW62465.2024.10626267 conf/icassp/2024w db/conf/icassp/icassp2024w.html#RavuriCY24 streams/conf/icassp Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Nicholas W. D. Evans Massimiliano Todisco Jean-François Bonastre Mickael Rouvier Synvox2: Towards A Privacy-Friendly Voxceleb2 Dataset. 11421-11425 2024 ICASSP https://doi.org/10.1109/ICASSP48485.2024.10446513 conf/icassp/2024 db/conf/icassp/icassp2024.html#MiaoWCYETBR24

Lin Zhang Xin Wang 0037 Erica Cooper Mireia Díez Federico Landini Nicholas W. D. Evans Junichi Yamagishi Spoof Diarization: "What Spoofed When" in Partially Spoofed Audio. 2024 abs/2406.07816 CoRR https://doi.org/10.48550/arXiv.2406.07816 db/journals/corr/corr2406.html#abs-2406-07816 streams/journals/corr

Zhengyang Chen Xuechen Liu Erica Cooper Junichi Yamagishi Yanmin Qian Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems. 2024 abs/2406.08812 CoRR https://doi.org/10.48550/arXiv.2406.08812 db/journals/corr/corr2406.html#abs-2406-08812

Cheng Gong Erica Cooper Xin Wang 0037 Chunyu Qiang Mengzhe Geng Dan Wells Longbiao Wang Jianwu Dang 0001 Marc Tessier Aidan Pine Korin Richmond Junichi Yamagishi An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios. 2024 abs/2406.08911 CoRR https://doi.org/10.48550/arXiv.2406.08911 db/journals/corr/corr2406.html#abs-2406-08911

Chang Zeng Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Spoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches. 2024 abs/2409.06327 CoRR https://doi.org/10.48550/arXiv.2409.06327 db/journals/corr/corr2409.html#abs-2409-06327 streams/journals/corr

Wen-Chin Huang Szu-Wei Fu Erica Cooper Ryandhimas E. Zezario Tomoki Toda Hsin-Min Wang Junichi Yamagishi Yu Tsao 0001 The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction. 2024 abs/2409.07001 CoRR https://doi.org/10.48550/arXiv.2409.07001 db/journals/corr/corr2409.html#abs-2409-07001 streams/journals/corr

Lin Zhang Xin Wang 0037 Erica Cooper Nicholas W. D. Evans Junichi Yamagishi The PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance. 813-825 2023 31 IEEE ACM Trans. Audio Speech Lang. Process. https://doi.org/10.1109/TASLP.2022.3233236 db/journals/taslp/taslp31.html#ZhangWCEY23

Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Speaker Anonymization Using Orthogonal Householder Neural Network. 3681-3695 2023 31 IEEE ACM Trans. Audio Speech Lang. Process. https://doi.org/10.1109/TASLP.2023.3313429 db/journals/taslp/taslp31.html#MiaoWCYT23

Lifan Zhong Erica Cooper Junichi Yamagishi Nobuaki Minematsu Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music. 2312-2319 2023 APSIPA ASC https://doi.org/10.1109/APSIPAASC58517.2023.10317292 conf/apsipa/2023 db/conf/apsipa/apsipa2023.html#ZhongCYM23 Erica Cooper Wen-Chin Huang Yu Tsao 0001 Hsin-Min Wang Tomoki Toda Junichi Yamagishi The Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains. 1-7 2023 ASRU https://doi.org/10.1109/ASRU57964.2023.10389763 conf/asru/2023 db/conf/asru/asru2023.html#CooperHTWTY23 Hemant Yadav Erica Cooper Junichi Yamagishi Sunayana Sitaram Rajiv Ratn Shah Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting. 1-7 2023 ASRU https://doi.org/10.1109/ASRU57964.2023.10389797 conf/asru/2023 db/conf/asru/asru2023.html#YadavCYSS23 Xuan Shi Erica Cooper Xin Wang 0037 Junichi Yamagishi Shrikanth Narayanan Can Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems? 1-5 2023 ICASSP https://doi.org/10.1109/ICASSP49357.2023.10095848 conf/icassp/2023 db/conf/icassp/icassp2023.html#ShiCWYN23 Erica Cooper Junichi Yamagishi Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech. 1104-1108 2023 INTERSPEECH https://doi.org/10.21437/Interspeech.2023-1076 conf/interspeech/2023 db/conf/interspeech/interspeech2023.html#CooperY23 Chang Zeng Xin Wang 0037 Xiaoxiao Miao Erica Cooper Junichi Yamagishi Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms. 1998-2002 2023 INTERSPEECH https://doi.org/10.21437/Interspeech.2023-125 conf/interspeech/2023 db/conf/interspeech/interspeech2023.html#Zeng0MCY23 Lin Zhang Xin Wang 0037 Erica Cooper Nicholas W. D. Evans Junichi Yamagishi Range-Based Equal Error Rate for Spoof Localization. 3212-3216 2023 INTERSPEECH https://doi.org/10.21437/Interspeech.2023-1214 conf/interspeech/2023 db/conf/interspeech/interspeech2023.html#Zhang0CEY23 Orian Sharoni Roee Shenberg Erica Cooper SASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion. 5566-5570 2023 INTERSPEECH https://doi.org/10.21437/Interspeech.2023-430 conf/interspeech/2023 db/conf/interspeech/interspeech2023.html#SharoniSC23

Lin Zhang Xin Wang 0037 Erica Cooper Nicholas W. D. Evans Junichi Yamagishi Range-Based Equal Error Rate for Spoof Localization. 2023 abs/2305.17739 CoRR https://doi.org/10.48550/arXiv.2305.17739 db/journals/corr/corr2305.html#abs-2305-17739

Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Language-independent speaker anonymization using orthogonal Householder neural network. 2023 abs/2305.18823 CoRR https://doi.org/10.48550/arXiv.2305.18823 db/journals/corr/corr2305.html#abs-2305-18823

Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Nicholas W. D. Evans Massimiliano Todisco Jean-François Bonastre Mickael Rouvier SynVox2: Towards a privacy-friendly VoxCeleb2 dataset. 2023 abs/2309.06141 CoRR https://doi.org/10.48550/arXiv.2309.06141 db/journals/corr/corr2309.html#abs-2309-06141

Nicolas Jonason Xin Wang 0037 Erica Cooper Lauri Juvela Bob L. T. Sturm Junichi Yamagishi DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input. 2023 abs/2309.07658 CoRR https://doi.org/10.48550/arXiv.2309.07658 db/journals/corr/corr2309.html#abs-2309-07658

Hemant Yadav Erica Cooper Junichi Yamagishi Sunayana Sitaram Rajiv Ratn Shah Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting. 2023 abs/2310.05078 CoRR https://doi.org/10.48550/arXiv.2310.05078 db/journals/corr/corr2310.html#abs-2310-05078

Xuechen Liu Xin Wang 0037 Erica Cooper Xiaoxiao Miao Junichi Yamagishi Speaker-Text Retrieval via Contrastive Learning. 2023 abs/2312.06055 CoRR https://doi.org/10.48550/arXiv.2312.06055 db/journals/corr/corr2312.html#abs-2312-06055

Cheng Gong Xin Wang 0037 Erica Cooper Dan Wells Longbiao Wang Jianwu Dang 0001 Korin Richmond Junichi Yamagishi ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations. 2023 abs/2312.14398 CoRR https://doi.org/10.48550/arXiv.2312.14398 db/journals/corr/corr2312.html#abs-2312-14398

Aditya Ravuri Erica Cooper Junichi Yamagishi Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction. 2023 abs/2312.15616 CoRR https://doi.org/10.48550/arXiv.2312.15616 db/journals/corr/corr2312.html#abs-2312-15616

Xuan Shi Erica Cooper Junichi Yamagishi Use of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds. 367-377 2022 30 IEEE ACM Trans. Audio Speech Lang. Process. https://doi.org/10.1109/TASLP.2022.3140549 db/journals/taslp/taslp30.html#ShiCY22

Wen-Chin Huang Erica Cooper Junichi Yamagishi Tomoki Toda LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. 896-900 2022 ICASSP https://doi.org/10.1109/ICASSP43922.2022.9747222 conf/icassp/2022 db/conf/icassp/icassp2022.html#HuangCYT22 Chang Zeng Xin Wang 0037 Erica Cooper Xiaoxiao Miao Junichi Yamagishi Attention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances. 6717-6721 2022 ICASSP https://doi.org/10.1109/ICASSP43922.2022.9746688 conf/icassp/2022 db/conf/icassp/icassp2022.html#ZengWCMY22 Erica Cooper Wen-Chin Huang Tomoki Toda Junichi Yamagishi Generalization Ability of MOS Prediction Networks. 8442-8446 2022 ICASSP https://doi.org/10.1109/ICASSP43922.2022.9746395 conf/icassp/2022 db/conf/icassp/icassp2022.html#CooperHTY22 Cheng-I Jeff Lai Erica Cooper Yang Zhang 0001 Shiyu Chang Kaizhi Qian Yi-Lun Liao Yung-Sung Chuang Alexander H. Liu Junichi Yamagishi David D. Cox James R. Glass On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. 8447-8451 2022 ICASSP https://doi.org/10.1109/ICASSP43922.2022.9747728 conf/icassp/2022 db/conf/icassp/icassp2022.html#LaiCZCQLCLYCG22 Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. 4426-4430 2022 INTERSPEECH https://doi.org/10.21437/Interspeech.2022-11065 conf/interspeech/2022 db/conf/interspeech/interspeech2022.html#MiaoWCYT22 Wen-Chin Huang Erica Cooper Yu Tsao 0001 Hsin-Min Wang Tomoki Toda Junichi Yamagishi The VoiceMOS Challenge 2022. 4536-4540 2022 INTERSPEECH https://doi.org/10.21437/Interspeech.2022-970 conf/interspeech/2022 db/conf/interspeech/interspeech2022.html#HuangC0WTY22 Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Language-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models. 279-286 2022 Odyssey https://doi.org/10.21437/Odyssey.2022-39 conf/odyssey/2022 db/conf/odyssey/odyssey2022.html#Miao0CYT22

Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models. 2022 abs/2202.13097 CoRR https://arxiv.org/abs/2202.13097 db/journals/corr/corr2202.html#abs-2202-13097

Wen-Chin Huang Erica Cooper Yu Tsao 0001 Hsin-Min Wang Tomoki Toda Junichi Yamagishi The VoiceMOS Challenge 2022. 2022 abs/2203.11389 CoRR https://doi.org/10.48550/arXiv.2203.11389 db/journals/corr/corr2203.html#abs-2203-11389

Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Natalia A. Tomashenko Analyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions. 2022 abs/2203.14834 CoRR https://doi.org/10.48550/arXiv.2203.14834 db/journals/corr/corr2203.html#abs-2203-14834

Lin Zhang Xin Wang 0037 Erica Cooper Nicholas W. D. Evans Junichi Yamagishi The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance. 2022 abs/2204.05177 CoRR https://doi.org/10.48550/arXiv.2204.05177 db/journals/corr/corr2204.html#abs-2204-05177

Chang Zeng Xiaoxiao Miao Xin Wang 0037 Erica Cooper Junichi Yamagishi Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances. 2022 abs/2209.00485 CoRR https://doi.org/10.48550/arXiv.2209.00485 db/journals/corr/corr2209.html#abs-2209-00485

Xuan Shi Erica Cooper Xin Wang 0037 Junichi Yamagishi Shrikanth Narayanan Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems? 2022 abs/2211.13868 CoRR https://doi.org/10.48550/arXiv.2211.13868 db/journals/corr/corr2211.html#abs-2211-13868

Shuhei Kato Yusuke Yasuda Xin Wang 0037 Erica Cooper Junichi Yamagishi How Similar or Different is Rakugo Speech Synthesizer to Professional Performers? 6488-6492 2021 ICASSP https://doi.org/10.1109/ICASSP39728.2021.9414175 conf/icassp/2021 db/conf/icassp/icassp2021.html#KatoYWCY21 Jennifer Williams 0001 Yi Zhao 0006 Erica Cooper Junichi Yamagishi Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. 7053-7057 2021 ICASSP https://doi.org/10.1109/ICASSP39728.2021.9413543 conf/icassp/2021 db/conf/icassp/icassp2021.html#WilliamsZCY21 Lin Zhang Xin Wang 0037 Erica Cooper Junichi Yamagishi Jose Patino 0001 Nicholas W. D. Evans An Initial Investigation for Detecting Partially Spoofed Audio. 4264-4268 2021 Interspeech https://doi.org/10.21437/Interspeech.2021-738 conf/interspeech/2021 db/conf/interspeech/interspeech2021.html#ZhangWCY0E21 Jennifer Williams 0001 Jason Fong Erica Cooper Junichi Yamagishi Exploring Disentanglement with Multilingual and Monolingual VQ-VAE. 124-129 2021 SSW https://doi.org/10.21437/SSW.2021-22 conf/ssw/2021 db/conf/ssw/ssw2021.html#0001FCY21 Erica Cooper Xin Wang 0037 Junichi Yamagishi Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. 130-135 2021 SSW https://doi.org/10.21437/SSW.2021-23 conf/ssw/2021 db/conf/ssw/ssw2021.html#Cooper0Y21 Erica Cooper Junichi Yamagishi How do Voices from Past Speech Synthesis Challenges Compare Today? 183-188 2021 SSW https://doi.org/10.21437/SSW.2021-32 conf/ssw/2021 db/conf/ssw/ssw2021.html#CooperY21

Chang Zeng Xin Wang 0037 Erica Cooper Junichi Yamagishi Attention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances. 2021 abs/2104.01541 CoRR https://arxiv.org/abs/2104.01541 db/journals/corr/corr2104.html#abs-2104-01541

Lin Zhang Xin Wang 0037 Erica Cooper Junichi Yamagishi Jose Patino 0001 Nicholas W. D. Evans An Initial Investigation for Detecting Partially Spoofed Audio. 2021 abs/2104.02518 CoRR https://arxiv.org/abs/2104.02518 db/journals/corr/corr2104.html#abs-2104-02518

Erica Cooper Xin Wang 0037 Junichi Yamagishi Text-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis. 2021 abs/2104.12292 CoRR https://arxiv.org/abs/2104.12292 db/journals/corr/corr2104.html#abs-2104-12292

Jennifer Williams 0001 Jason Fong Erica Cooper Junichi Yamagishi Exploring Disentanglement with Multilingual and Monolingual VQ-VAE. 2021 abs/2105.01573 CoRR https://arxiv.org/abs/2105.01573 db/journals/corr/corr2105.html#abs-2105-01573

Erica Cooper Junichi Yamagishi How do Voices from Past Speech Synthesis Challenges Compare Today? 2021 abs/2105.02373 CoRR https://arxiv.org/abs/2105.02373 db/journals/corr/corr2105.html#abs-2105-02373

Xuan Shi Erica Cooper Junichi Yamagishi Use of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms. 2021 abs/2107.11506 CoRR https://arxiv.org/abs/2107.11506 db/journals/corr/corr2107.html#abs-2107-11506

Lin Zhang Xin Wang 0037 Erica Cooper Junichi Yamagishi Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection. 2021 abs/2107.14132 CoRR https://arxiv.org/abs/2107.14132 db/journals/corr/corr2107.html#abs-2107-14132

Cheng-I Jeff Lai Erica Cooper Yang Zhang 0001 Shiyu Chang Kaizhi Qian Yi-Lun Liao Yung-Sung Chuang Alexander H. Liu Junichi Yamagishi David D. Cox James R. Glass On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. 2021 abs/2110.01147 CoRR https://arxiv.org/abs/2110.01147 db/journals/corr/corr2110.html#abs-2110-01147

Wen-Chin Huang Erica Cooper Junichi Yamagishi Tomoki Toda LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech. 2021 abs/2110.09103 CoRR https://arxiv.org/abs/2110.09103 db/journals/corr/corr2110.html#abs-2110-09103

Shuhei Kato Yusuke Yasuda Xin Wang 0037 Erica Cooper Shinji Takaki Junichi Yamagishi Modeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences. 138149-138161 2020 8 IEEE Access https://doi.org/10.1109/ACCESS.2020.3011975 db/journals/access/access8.html#KatoYWCTY20

Erica Cooper Cheng-I Lai Yusuke Yasuda Fuming Fang Xin Wang 0037 Nanxin Chen Junichi Yamagishi Zero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings. 6184-6188 2020 ICASSP https://doi.org/10.1109/ICASSP40776.2020.9054535 conf/icassp/2020 db/conf/icassp/icassp2020.html#CooperLYFWCY20 Erica Cooper Cheng-I Lai Yusuke Yasuda Junichi Yamagishi Can Speaker Augmentation Improve Multi-Speaker End-to-End TTS? 3979-3983 2020 INTERSPEECH https://doi.org/10.21437/Interspeech.2020-1229 conf/interspeech/2020 db/conf/interspeech/interspeech2020.html#CooperLYY20 Yi Zhao 0006 Haoyu Li Cheng-I Lai Jennifer Williams 0001 Erica Cooper Junichi Yamagishi Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. 4417-4421 2020 INTERSPEECH https://doi.org/10.21437/Interspeech.2020-1615 conf/interspeech/2020 db/conf/interspeech/interspeech2020.html#0006LLWCY20

Yi Zhao 0006 Haoyu Li Cheng-I Lai Jennifer Williams 0001 Erica Cooper Junichi Yamagishi Improved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction. 2020 abs/2005.07884 CoRR https://arxiv.org/abs/2005.07884 db/journals/corr/corr2005.html#abs-2005-07884

Antoine Perquin Erica Cooper Junichi Yamagishi Grapheme or phoneme? An Analysis of Tacotron's Embedded Representations. 2020 abs/2010.10694 CoRR https://arxiv.org/abs/2010.10694 db/journals/corr/corr2010.html#abs-2010-10694

Jennifer Williams 0001 Yi Zhao 0006 Erica Cooper Junichi Yamagishi Learning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm. 2020 abs/2010.10727 CoRR https://arxiv.org/abs/2010.10727 db/journals/corr/corr2010.html#abs-2010-10727

Shuhei Kato Yusuke Yasuda Xin Wang 0037 Erica Cooper Junichi Yamagishi How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? 2020 abs/2010.11549 CoRR https://arxiv.org/abs/2010.11549 db/journals/corr/corr2010.html#abs-2010-11549

Erica Cooper Xin Wang 0037 Yi Zhao 0006 Yusuke Yasuda Junichi Yamagishi Pretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis. 2020 abs/2011.04839 CoRR https://arxiv.org/abs/2011.04839 db/journals/corr/corr2011.html#abs-2011-04839

Erica Cooper Text-to-Speech Synthesis Using Found Data for Low-Resource Languages. Columbia University, USA 2019 https://doi.org/10.7916/d8-vdzp-j870 Shuhei Kato Yusuke Yasuda Xin Wang 0037 Erica Cooper Shinji Takaki Junichi Yamagishi Rakugo speech synthesis using segment-to-segment neural transduction and style tokens - toward speech synthesis for entertaining audiences. 111-116 2019 SSW https://doi.org/10.21437/SSW.2019-20 conf/ssw/2019 db/conf/ssw/ssw2019.html#KatoY0CTY19 Elshadai Tesfaye Biru Yishak Tofik Mohammed David Tofu Erica Cooper Julia Hirschberg Subset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis. 205-210 2019 SSW https://doi.org/10.21437/SSW.2019-37 conf/ssw/2019 db/conf/ssw/ssw2019.html#BiruMTCH19 Kai-Zhan Lee Erica Cooper Julia Hirschberg A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis. 2873-2877 2018 INTERSPEECH https://doi.org/10.21437/Interspeech.2018-1313 conf/interspeech/2018 db/conf/interspeech/interspeech2018.html#LeeCH18 Erica Cooper Xinyue Wang Alison Chang Yocheved Levitan Julia Hirschberg Utterance Selection for Optimizing Intelligibility of TTS Voices Trained on ASR Data. 3971-3975 2017 INTERSPEECH https://doi.org/10.21437/Interspeech.2017-465 conf/interspeech/2017 db/conf/interspeech/interspeech2017.html#CooperWCLH17 Gideon Mendels Erica Cooper Julia Hirschberg Babler - Data Collection from the Web to Support Speech Recognition and Keyword Search. 72-81 2016 WAC@ACL https://doi.org/10.18653/v1/W16-2609 conf/aclwac/2016 db/conf/aclwac/aclwac2016.html#MendelsCH16 Erica Cooper Alison Chang Yocheved Levitan Julia Hirschberg Data Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis. 357-361 2016 INTERSPEECH https://doi.org/10.21437/Interspeech.2016-502 conf/interspeech/2016 db/conf/interspeech/interspeech2016.html#CooperCLH16 Gideon Mendels Erica Cooper Victor Soto Julia Hirschberg Mark J. F. Gales Kate M. Knill Anton Ragni Haipeng Wang Improving speech recognition and keyword search for low resource languages using web data. 829-833 2015 INTERSPEECH https://doi.org/10.21437/Interspeech.2015-260 conf/interspeech/2015 db/conf/interspeech/interspeech2015.html#MendelsCSHGKRW15 Victor Soto Erica Cooper Lidia Mangu Andrew Rosenberg Julia Hirschberg Rescoring Confusion Networks for Keyword Search. 7088-7092 2014 ICASSP https://doi.org/10.1109/ICASSP.2014.6854975 conf/icassp/2014 db/conf/icassp/icassp2014.html#SotoCMRH14 Victor Soto Erica Cooper Andrew Rosenberg Julia Hirschberg Cross-language phrase boundary detection. 8460-8464 2013 ICASSP https://doi.org/10.1109/ICASSP.2013.6639316 conf/icassp/2013 db/conf/icassp/icassp2013.html#SotoCRH13 Dogan Can Erica Cooper Abhinav Sethy Christopher M. White Bhuvana Ramabhadran Murat Saraclar Effect of pronounciations on OOV queries in spoken term detection. 3957-3960 2009 ICASSP https://doi.org/10.1109/ICASSP.2009.4960494 https://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4960494 conf/icassp/2009 db/conf/icassp/icassp2009.html#CanCSWRS09 Christopher M. White Abhinav Sethy Bhuvana Ramabhadran Patrick J. Wolfe Erica Cooper Murat Saraclar James K. Baker Unsupervised pronunciation validation. 4301-4304 2009 ICASSP https://doi.org/10.1109/ICASSP.2009.4960580 https://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4960580 conf/icassp/2009 db/conf/icassp/icassp2009.html#WhiteSRWCSB09 Dogan Can Erica Cooper Arnab Ghoshal Martin Jansche Sanjeev Khudanpur Bhuvana Ramabhadran Michael Riley 0001 Murat Saraclar Abhinav Sethy Morgan Ulinski Christopher M. White Web derived pronunciations for spoken term detection. 83-90 2009 SIGIR https://doi.org/10.1145/1571941.1571958 conf/sigir/2009 db/conf/sigir/sigir2009.html#CanCGJKRRSSUW09 James K. Baker Elshadai Tesfaye Biru Jean-François Bonastre Dogan Can Alison Chang Shiyu Chang Nanxin Chen Zhengyang Chen Yung-Sung Chuang David D. Cox Jianwu Dang 0001 Mireia Díez Nicholas W. D. Evans Fuming Fang Jason Fong Szu-Wei Fu Mark J. F. Gales Mengzhe Geng Arnab Ghoshal James R. Glass Cheng Gong David Guzmán Julia Hirschberg Wen-Chin Huang Martin Jansche Eric Joanis Nicolas Jonason Lauri Juvela Shuhei Kato Anna Kazantseva Sanjeev Khudanpur Kate M. Knill Ross Krekoski Roland Kuhn 0001 Cheng-I LaiCheng-I Jeff Lai Federico Landini Samuel Larkin Kai-Zhan Lee Yocheved Levitan Haoyu Li Yi-Lun Liao Patrick Littell Alexander H. Liu Xuechen Liu Delaney Lothian Lidia Mangu Akwiratékha' Martin Gideon Mendels Xiaoxiao Miao Nobuaki Minematsu Yishak Tofik Mohammed Shri NarayananShrikanth Narayanan Jose Patino 0001 Antoine Perquin Aidan Pine Kaizhi Qian Yanmin Qian Chunyu Qiang Anton Ragni Bhuvana Ramabhadran Aditya Ravuri Korin Richmond Michael Riley 0001 Andrew Rosenberg Mickael Rouvier Murat Saraclar Abhinav Sethy Rajiv Ratn Shah Orian Sharoni Roee Shenberg Xuan Shi Sunayana Sitaram Victor Soto Bob L. T. Sturm Shinji Takaki Marc Tessier Tomoki Toda Massimiliano Todisco David Tofu Natalia A. Tomashenko Yu Tsao 0001 Morgan Ulinski Cassia Valentini-Botinhao Haipeng Wang Hsin-Min Wang Longbiao Wang Xin Wang 0037 Xinyue Wang Dan Wells Christopher M. White Jennifer Williams 0001 Patrick J. Wolfe Hemant Yadav Junichi Yamagishi Yusuke Yasuda Chang Zeng Ryandhimas E. Zezario Lin Zhang Yang Zhang 0001 Yi Zhao 0006 Lifan Zhong