Erica CooperAidan PineErica CooperDavid GuzmánEric JoanisAnna KazantsevaRoss KrekoskiRoland Kuhn 0001Samuel LarkinPatrick LittellDelaney LothianAkwiratékha' MartinKorin RichmondMarc TessierCassia Valentini-BotinhaoDan WellsJunichi YamagishiSpeech Generation for Indigenous Language Education.101723202590Comput. Speech Lang.https://doi.org/10.1016/j.csl.2024.101723db/journals/csl/csl90.html#PineCGJKKKLLLMRTVWY25streams/journals/cslChang ZengXiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiJoint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances.101619202486Comput. Speech Lang.https://doi.org/10.1016/j.csl.2024.101619db/journals/csl/csl86.html#ZengMWCY24streams/journals/cslCheng GongXin Wang 0037Erica CooperDan WellsLongbiao WangJianwu Dang 0001Korin RichmondJunichi YamagishiZMM-TTS: Zero-Shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-Supervised Discrete Speech Representations.4036-4051202432IEEE ACM Trans. Audio Speech Lang. Process.https://doi.org/10.1109/TASLP.2024.3451951db/journals/taslp/taslp32.html#GongWCWWDRY24streams/journals/taslpAditya RavuriErica CooperJunichi YamagishiUncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction.580-5842024ICASSP Workshopshttps://doi.org/10.1109/ICASSPW62465.2024.10626267conf/icassp/2024wdb/conf/icassp/icassp2024w.html#RavuriCY24streams/conf/icasspXiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNicholas W. D. EvansMassimiliano TodiscoJean-François BonastreMickael RouvierSynvox2: Towards A Privacy-Friendly Voxceleb2 Dataset.11421-114252024ICASSPhttps://doi.org/10.1109/ICASSP48485.2024.10446513conf/icassp/2024db/conf/icassp/icassp2024.html#MiaoWCYETBR24Lin ZhangXin Wang 0037Erica CooperMireia DíezFederico LandiniNicholas W. D. EvansJunichi YamagishiSpoof Diarization: "What Spoofed When" in Partially Spoofed Audio.2024abs/2406.07816CoRRhttps://doi.org/10.48550/arXiv.2406.07816db/journals/corr/corr2406.html#abs-2406-07816streams/journals/corrZhengyang ChenXuechen LiuErica CooperJunichi YamagishiYanmin QianGenerating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems.2024abs/2406.08812CoRRhttps://doi.org/10.48550/arXiv.2406.08812db/journals/corr/corr2406.html#abs-2406-08812Cheng GongErica CooperXin Wang 0037Chunyu QiangMengzhe GengDan WellsLongbiao WangJianwu Dang 0001Marc TessierAidan PineKorin RichmondJunichi YamagishiAn Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios.2024abs/2406.08911CoRRhttps://doi.org/10.48550/arXiv.2406.08911db/journals/corr/corr2406.html#abs-2406-08911Chang ZengXiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiSpoofing-Aware Speaker Verification Robust Against Domain and Channel Mismatches.2024abs/2409.06327CoRRhttps://doi.org/10.48550/arXiv.2409.06327db/journals/corr/corr2409.html#abs-2409-06327streams/journals/corrWen-Chin HuangSzu-Wei FuErica CooperRyandhimas E. ZezarioTomoki TodaHsin-Min WangJunichi YamagishiYu Tsao 0001The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction.2024abs/2409.07001CoRRhttps://doi.org/10.48550/arXiv.2409.07001db/journals/corr/corr2409.html#abs-2409-07001streams/journals/corrLin ZhangXin Wang 0037Erica CooperNicholas W. D. EvansJunichi YamagishiThe PartialSpoof Database and Countermeasures for the Detection of Short Fake Speech Segments Embedded in an Utterance.813-825202331IEEE ACM Trans. Audio Speech Lang. Process.https://doi.org/10.1109/TASLP.2022.3233236db/journals/taslp/taslp31.html#ZhangWCEY23Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoSpeaker Anonymization Using Orthogonal Householder Neural Network.3681-3695202331IEEE ACM Trans. Audio Speech Lang. Process.https://doi.org/10.1109/TASLP.2023.3313429db/journals/taslp/taslp31.html#MiaoWCYT23Lifan ZhongErica CooperJunichi YamagishiNobuaki MinematsuExploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music.2312-23192023APSIPA ASChttps://doi.org/10.1109/APSIPAASC58517.2023.10317292conf/apsipa/2023db/conf/apsipa/apsipa2023.html#ZhongCYM23Erica CooperWen-Chin HuangYu Tsao 0001Hsin-Min WangTomoki TodaJunichi YamagishiThe Voicemos Challenge 2023: Zero-Shot Subjective Speech Quality Prediction for Multiple Domains.1-72023ASRUhttps://doi.org/10.1109/ASRU57964.2023.10389763conf/asru/2023db/conf/asru/asru2023.html#CooperHTWTY23Hemant YadavErica CooperJunichi YamagishiSunayana SitaramRajiv Ratn ShahPartial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-Supervised Setting.1-72023ASRUhttps://doi.org/10.1109/ASRU57964.2023.10389797conf/asru/2023db/conf/asru/asru2023.html#YadavCYSS23Xuan ShiErica CooperXin Wang 0037Junichi YamagishiShrikanth NarayananCan Knowledge of End-to-End Text-to-Speech Models Improve Neural Midi-to-Audio Synthesis Systems?1-52023ICASSPhttps://doi.org/10.1109/ICASSP49357.2023.10095848conf/icassp/2023db/conf/icassp/icassp2023.html#ShiCWYN23Erica CooperJunichi YamagishiInvestigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech.1104-11082023INTERSPEECHhttps://doi.org/10.21437/Interspeech.2023-1076conf/interspeech/2023db/conf/interspeech/interspeech2023.html#CooperY23Chang ZengXin Wang 0037Xiaoxiao MiaoErica CooperJunichi YamagishiImproving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms.1998-20022023INTERSPEECHhttps://doi.org/10.21437/Interspeech.2023-125conf/interspeech/2023db/conf/interspeech/interspeech2023.html#Zeng0MCY23Lin ZhangXin Wang 0037Erica CooperNicholas W. D. EvansJunichi YamagishiRange-Based Equal Error Rate for Spoof Localization.3212-32162023INTERSPEECHhttps://doi.org/10.21437/Interspeech.2023-1214conf/interspeech/2023db/conf/interspeech/interspeech2023.html#Zhang0CEY23Orian SharoniRoee ShenbergErica CooperSASPEECH: A Hebrew Single Speaker Dataset for Text To Speech and Voice Conversion.5566-55702023INTERSPEECHhttps://doi.org/10.21437/Interspeech.2023-430conf/interspeech/2023db/conf/interspeech/interspeech2023.html#SharoniSC23Lin ZhangXin Wang 0037Erica CooperNicholas W. D. EvansJunichi YamagishiRange-Based Equal Error Rate for Spoof Localization.2023abs/2305.17739CoRRhttps://doi.org/10.48550/arXiv.2305.17739db/journals/corr/corr2305.html#abs-2305-17739Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoLanguage-independent speaker anonymization using orthogonal Householder neural network.2023abs/2305.18823CoRRhttps://doi.org/10.48550/arXiv.2305.18823db/journals/corr/corr2305.html#abs-2305-18823Lifan ZhongErica CooperJunichi YamagishiNobuaki MinematsuExploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music.2023abs/2306.08850CoRRhttps://doi.org/10.48550/arXiv.2306.08850db/journals/corr/corr2306.html#abs-2306-08850Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNicholas W. D. EvansMassimiliano TodiscoJean-François BonastreMickael RouvierSynVox2: Towards a privacy-friendly VoxCeleb2 dataset.2023abs/2309.06141CoRRhttps://doi.org/10.48550/arXiv.2309.06141db/journals/corr/corr2309.html#abs-2309-06141Nicolas JonasonXin Wang 0037Erica CooperLauri JuvelaBob L. T. SturmJunichi YamagishiDDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input.2023abs/2309.07658CoRRhttps://doi.org/10.48550/arXiv.2309.07658db/journals/corr/corr2309.html#abs-2309-07658Hemant YadavErica CooperJunichi YamagishiSunayana SitaramRajiv Ratn ShahPartial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting.2023abs/2310.05078CoRRhttps://doi.org/10.48550/arXiv.2310.05078db/journals/corr/corr2310.html#abs-2310-05078Xuechen LiuXin Wang 0037Erica CooperXiaoxiao MiaoJunichi YamagishiSpeaker-Text Retrieval via Contrastive Learning.2023abs/2312.06055CoRRhttps://doi.org/10.48550/arXiv.2312.06055db/journals/corr/corr2312.html#abs-2312-06055Cheng GongXin Wang 0037Erica CooperDan WellsLongbiao WangJianwu Dang 0001Korin RichmondJunichi YamagishiZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations.2023abs/2312.14398CoRRhttps://doi.org/10.48550/arXiv.2312.14398db/journals/corr/corr2312.html#abs-2312-14398Aditya RavuriErica CooperJunichi YamagishiUncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction.2023abs/2312.15616CoRRhttps://doi.org/10.48550/arXiv.2312.15616db/journals/corr/corr2312.html#abs-2312-15616Xuan ShiErica CooperJunichi YamagishiUse of Speaker Recognition Approaches for Learning and Evaluating Embedding Representations of Musical Instrument Sounds.367-377202230IEEE ACM Trans. Audio Speech Lang. Process.https://doi.org/10.1109/TASLP.2022.3140549db/journals/taslp/taslp30.html#ShiCY22Wen-Chin HuangErica CooperJunichi YamagishiTomoki TodaLDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech.896-9002022ICASSPhttps://doi.org/10.1109/ICASSP43922.2022.9747222conf/icassp/2022db/conf/icassp/icassp2022.html#HuangCYT22Chang ZengXin Wang 0037Erica CooperXiaoxiao MiaoJunichi YamagishiAttention Back-End for Automatic Speaker Verification with Multiple Enrollment Utterances.6717-67212022ICASSPhttps://doi.org/10.1109/ICASSP43922.2022.9746688conf/icassp/2022db/conf/icassp/icassp2022.html#ZengWCMY22Erica CooperWen-Chin HuangTomoki TodaJunichi YamagishiGeneralization Ability of MOS Prediction Networks.8442-84462022ICASSPhttps://doi.org/10.1109/ICASSP43922.2022.9746395conf/icassp/2022db/conf/icassp/icassp2022.html#CooperHTY22Cheng-I Jeff LaiErica CooperYang Zhang 0001Shiyu ChangKaizhi QianYi-Lun LiaoYung-Sung ChuangAlexander H. LiuJunichi YamagishiDavid D. CoxJames R. GlassOn the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.8447-84512022ICASSPhttps://doi.org/10.1109/ICASSP43922.2022.9747728conf/icassp/2022db/conf/icassp/icassp2022.html#LaiCZCQLCLYCG22Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoAnalyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions.4426-44302022INTERSPEECHhttps://doi.org/10.21437/Interspeech.2022-11065conf/interspeech/2022db/conf/interspeech/interspeech2022.html#MiaoWCYT22Wen-Chin HuangErica CooperYu Tsao 0001Hsin-Min WangTomoki TodaJunichi YamagishiThe VoiceMOS Challenge 2022.4536-45402022INTERSPEECHhttps://doi.org/10.21437/Interspeech.2022-970conf/interspeech/2022db/conf/interspeech/interspeech2022.html#HuangC0WTY22Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoLanguage-Independent Speaker Anonymization Approach Using Self-Supervised Pre-Trained Models.279-2862022Odysseyhttps://doi.org/10.21437/Odyssey.2022-39conf/odyssey/2022db/conf/odyssey/odyssey2022.html#Miao0CYT22Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoLanguage-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models.2022abs/2202.13097CoRRhttps://arxiv.org/abs/2202.13097db/journals/corr/corr2202.html#abs-2202-13097Wen-Chin HuangErica CooperYu Tsao 0001Hsin-Min WangTomoki TodaJunichi YamagishiThe VoiceMOS Challenge 2022.2022abs/2203.11389CoRRhttps://doi.org/10.48550/arXiv.2203.11389db/journals/corr/corr2203.html#abs-2203-11389Xiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiNatalia A. TomashenkoAnalyzing Language-Independent Speaker Anonymization Framework under Unseen Conditions.2022abs/2203.14834CoRRhttps://doi.org/10.48550/arXiv.2203.14834db/journals/corr/corr2203.html#abs-2203-14834Lin ZhangXin Wang 0037Erica CooperNicholas W. D. EvansJunichi YamagishiThe PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance.2022abs/2204.05177CoRRhttps://doi.org/10.48550/arXiv.2204.05177db/journals/corr/corr2204.html#abs-2204-05177Chang ZengXiaoxiao MiaoXin Wang 0037Erica CooperJunichi YamagishiJoint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances.2022abs/2209.00485CoRRhttps://doi.org/10.48550/arXiv.2209.00485db/journals/corr/corr2209.html#abs-2209-00485Xuan ShiErica CooperXin Wang 0037Junichi YamagishiShrikanth NarayananCan Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?2022abs/2211.13868CoRRhttps://doi.org/10.48550/arXiv.2211.13868db/journals/corr/corr2211.html#abs-2211-13868Shuhei KatoYusuke YasudaXin Wang 0037Erica CooperJunichi YamagishiHow Similar or Different is Rakugo Speech Synthesizer to Professional Performers?6488-64922021ICASSPhttps://doi.org/10.1109/ICASSP39728.2021.9414175conf/icassp/2021db/conf/icassp/icassp2021.html#KatoYWCY21Jennifer Williams 0001Yi Zhao 0006Erica CooperJunichi YamagishiLearning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm.7053-70572021ICASSPhttps://doi.org/10.1109/ICASSP39728.2021.9413543conf/icassp/2021db/conf/icassp/icassp2021.html#WilliamsZCY21Lin ZhangXin Wang 0037Erica CooperJunichi YamagishiJose Patino 0001Nicholas W. D. EvansAn Initial Investigation for Detecting Partially Spoofed Audio.4264-42682021Interspeechhttps://doi.org/10.21437/Interspeech.2021-738conf/interspeech/2021db/conf/interspeech/interspeech2021.html#ZhangWCY0E21Jennifer Williams 0001Jason FongErica CooperJunichi YamagishiExploring Disentanglement with Multilingual and Monolingual VQ-VAE.124-1292021SSWhttps://doi.org/10.21437/SSW.2021-22conf/ssw/2021db/conf/ssw/ssw2021.html#0001FCY21Erica CooperXin Wang 0037Junichi YamagishiText-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis.130-1352021SSWhttps://doi.org/10.21437/SSW.2021-23conf/ssw/2021db/conf/ssw/ssw2021.html#Cooper0Y21Erica CooperJunichi YamagishiHow do Voices from Past Speech Synthesis Challenges Compare Today?183-1882021SSWhttps://doi.org/10.21437/SSW.2021-32conf/ssw/2021db/conf/ssw/ssw2021.html#CooperY21Chang ZengXin Wang 0037Erica CooperJunichi YamagishiAttention Back-end for Automatic Speaker Verification with Multiple Enrollment Utterances.2021abs/2104.01541CoRRhttps://arxiv.org/abs/2104.01541db/journals/corr/corr2104.html#abs-2104-01541Lin ZhangXin Wang 0037Erica CooperJunichi YamagishiJose Patino 0001Nicholas W. D. EvansAn Initial Investigation for Detecting Partially Spoofed Audio.2021abs/2104.02518CoRRhttps://arxiv.org/abs/2104.02518db/journals/corr/corr2104.html#abs-2104-02518Erica CooperXin Wang 0037Junichi YamagishiText-to-Speech Synthesis Techniques for MIDI-to-Audio Synthesis.2021abs/2104.12292CoRRhttps://arxiv.org/abs/2104.12292db/journals/corr/corr2104.html#abs-2104-12292Jennifer Williams 0001Jason FongErica CooperJunichi YamagishiExploring Disentanglement with Multilingual and Monolingual VQ-VAE.2021abs/2105.01573CoRRhttps://arxiv.org/abs/2105.01573db/journals/corr/corr2105.html#abs-2105-01573Erica CooperJunichi YamagishiHow do Voices from Past Speech Synthesis Challenges Compare Today?2021abs/2105.02373CoRRhttps://arxiv.org/abs/2105.02373db/journals/corr/corr2105.html#abs-2105-02373Xuan ShiErica CooperJunichi YamagishiUse of speaker recognition approaches for learning timbre representations of musical instrument sounds from raw waveforms.2021abs/2107.11506CoRRhttps://arxiv.org/abs/2107.11506db/journals/corr/corr2107.html#abs-2107-11506Lin ZhangXin Wang 0037Erica CooperJunichi YamagishiMulti-Task Learning in Utterance-Level and Segmental-Level Spoof Detection.2021abs/2107.14132CoRRhttps://arxiv.org/abs/2107.14132db/journals/corr/corr2107.html#abs-2107-14132Cheng-I Jeff LaiErica CooperYang Zhang 0001Shiyu ChangKaizhi QianYi-Lun LiaoYung-Sung ChuangAlexander H. LiuJunichi YamagishiDavid D. CoxJames R. GlassOn the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis.2021abs/2110.01147CoRRhttps://arxiv.org/abs/2110.01147db/journals/corr/corr2110.html#abs-2110-01147Wen-Chin HuangErica CooperJunichi YamagishiTomoki TodaLDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech.2021abs/2110.09103CoRRhttps://arxiv.org/abs/2110.09103db/journals/corr/corr2110.html#abs-2110-09103Shuhei KatoYusuke YasudaXin Wang 0037Erica CooperShinji TakakiJunichi YamagishiModeling of Rakugo Speech and Its Limitations: Toward Speech Synthesis That Entertains Audiences.138149-13816120208IEEE Accesshttps://doi.org/10.1109/ACCESS.2020.3011975db/journals/access/access8.html#KatoYWCTY20Erica CooperCheng-I LaiYusuke YasudaFuming FangXin Wang 0037Nanxin ChenJunichi YamagishiZero-Shot Multi-Speaker Text-To-Speech with State-Of-The-Art Neural Speaker Embeddings.6184-61882020ICASSPhttps://doi.org/10.1109/ICASSP40776.2020.9054535conf/icassp/2020db/conf/icassp/icassp2020.html#CooperLYFWCY20Erica CooperCheng-I LaiYusuke YasudaJunichi YamagishiCan Speaker Augmentation Improve Multi-Speaker End-to-End TTS?3979-39832020INTERSPEECHhttps://doi.org/10.21437/Interspeech.2020-1229conf/interspeech/2020db/conf/interspeech/interspeech2020.html#CooperLYY20Yi Zhao 0006Haoyu LiCheng-I LaiJennifer Williams 0001Erica CooperJunichi YamagishiImproved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.4417-44212020INTERSPEECHhttps://doi.org/10.21437/Interspeech.2020-1615conf/interspeech/2020db/conf/interspeech/interspeech2020.html#0006LLWCY20Yi Zhao 0006Haoyu LiCheng-I LaiJennifer Williams 0001Erica CooperJunichi YamagishiImproved Prosody from Learned F0 Codebook Representations for VQ-VAE Speech Waveform Reconstruction.2020abs/2005.07884CoRRhttps://arxiv.org/abs/2005.07884db/journals/corr/corr2005.html#abs-2005-07884Antoine PerquinErica CooperJunichi YamagishiGrapheme or phoneme? An Analysis of Tacotron's Embedded Representations.2020abs/2010.10694CoRRhttps://arxiv.org/abs/2010.10694db/journals/corr/corr2010.html#abs-2010-10694Jennifer Williams 0001Yi Zhao 0006Erica CooperJunichi YamagishiLearning Disentangled Phone and Speaker Representations in a Semi-Supervised VQ-VAE Paradigm.2020abs/2010.10727CoRRhttps://arxiv.org/abs/2010.10727db/journals/corr/corr2010.html#abs-2010-10727Shuhei KatoYusuke YasudaXin Wang 0037Erica CooperJunichi YamagishiHow Similar or Different Is Rakugo Speech Synthesizer to Professional Performers?2020abs/2010.11549CoRRhttps://arxiv.org/abs/2010.11549db/journals/corr/corr2010.html#abs-2010-11549Erica CooperXin Wang 0037Yi Zhao 0006Yusuke YasudaJunichi YamagishiPretraining Strategies, Waveform Model Choice, and Acoustic Configurations for Multi-Speaker End-to-End Speech Synthesis.2020abs/2011.04839CoRRhttps://arxiv.org/abs/2011.04839db/journals/corr/corr2011.html#abs-2011-04839Erica CooperText-to-Speech Synthesis Using Found Data for Low-Resource Languages.Columbia University, USA2019https://doi.org/10.7916/d8-vdzp-j870Shuhei KatoYusuke YasudaXin Wang 0037Erica CooperShinji TakakiJunichi YamagishiRakugo speech synthesis using segment-to-segment neural transduction and style tokens - toward speech synthesis for entertaining audiences.111-1162019SSWhttps://doi.org/10.21437/SSW.2019-20conf/ssw/2019db/conf/ssw/ssw2019.html#KatoY0CTY19Elshadai Tesfaye BiruYishak Tofik MohammedDavid TofuErica CooperJulia HirschbergSubset Selection, Adaptation, Gemination and Prosody Prediction for Amharic Text-to-Speech Synthesis.205-2102019SSWhttps://doi.org/10.21437/SSW.2019-37conf/ssw/2019db/conf/ssw/ssw2019.html#BiruMTCH19Kai-Zhan LeeErica CooperJulia HirschbergA Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis.2873-28772018INTERSPEECHhttps://doi.org/10.21437/Interspeech.2018-1313conf/interspeech/2018db/conf/interspeech/interspeech2018.html#LeeCH18Erica CooperXinyue WangAlison ChangYocheved LevitanJulia HirschbergUtterance Selection for Optimizing Intelligibility of TTS Voices Trained on ASR Data.3971-39752017INTERSPEECHhttps://doi.org/10.21437/Interspeech.2017-465conf/interspeech/2017db/conf/interspeech/interspeech2017.html#CooperWCLH17Gideon MendelsErica CooperJulia HirschbergBabler - Data Collection from the Web to Support Speech Recognition and Keyword Search.72-812016WAC@ACLhttps://doi.org/10.18653/v1/W16-2609conf/aclwac/2016db/conf/aclwac/aclwac2016.html#MendelsCH16Erica CooperAlison ChangYocheved LevitanJulia HirschbergData Selection and Adaptation for Naturalness in HMM-Based Speech Synthesis.357-3612016INTERSPEECHhttps://doi.org/10.21437/Interspeech.2016-502conf/interspeech/2016db/conf/interspeech/interspeech2016.html#CooperCLH16Gideon MendelsErica CooperVictor SotoJulia HirschbergMark J. F. GalesKate M. KnillAnton RagniHaipeng WangImproving speech recognition and keyword search for low resource languages using web data.829-8332015INTERSPEECHhttps://doi.org/10.21437/Interspeech.2015-260conf/interspeech/2015db/conf/interspeech/interspeech2015.html#MendelsCSHGKRW15Victor SotoErica CooperLidia ManguAndrew RosenbergJulia HirschbergRescoring Confusion Networks for Keyword Search.7088-70922014ICASSPhttps://doi.org/10.1109/ICASSP.2014.6854975conf/icassp/2014db/conf/icassp/icassp2014.html#SotoCMRH14Victor SotoErica CooperAndrew RosenbergJulia HirschbergCross-language phrase boundary detection.8460-84642013ICASSPhttps://doi.org/10.1109/ICASSP.2013.6639316conf/icassp/2013db/conf/icassp/icassp2013.html#SotoCRH13Dogan CanErica CooperAbhinav SethyChristopher M. WhiteBhuvana RamabhadranMurat SaraclarEffect of pronounciations on OOV queries in spoken term detection.3957-39602009ICASSPhttps://doi.org/10.1109/ICASSP.2009.4960494https://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4960494conf/icassp/2009db/conf/icassp/icassp2009.html#CanCSWRS09Christopher M. WhiteAbhinav SethyBhuvana RamabhadranPatrick J. WolfeErica CooperMurat SaraclarJames K. BakerUnsupervised pronunciation validation.4301-43042009ICASSPhttps://doi.org/10.1109/ICASSP.2009.4960580https://doi.ieeecomputersociety.org/10.1109/ICASSP.2009.4960580conf/icassp/2009db/conf/icassp/icassp2009.html#WhiteSRWCSB09Dogan CanErica CooperArnab GhoshalMartin JanscheSanjeev KhudanpurBhuvana RamabhadranMichael Riley 0001Murat SaraclarAbhinav SethyMorgan UlinskiChristopher M. WhiteWeb derived pronunciations for spoken term detection.83-902009SIGIRhttps://doi.org/10.1145/1571941.1571958conf/sigir/2009db/conf/sigir/sigir2009.html#CanCGJKRRSSUW09James K. BakerElshadai Tesfaye BiruJean-François BonastreDogan CanAlison ChangShiyu ChangNanxin ChenZhengyang ChenYung-Sung ChuangDavid D. CoxJianwu Dang 0001Mireia DíezNicholas W. D. EvansFuming FangJason FongSzu-Wei FuMark J. F. GalesMengzhe GengArnab GhoshalJames R. GlassCheng GongDavid GuzmánJulia HirschbergWen-Chin HuangMartin JanscheEric JoanisNicolas JonasonLauri JuvelaShuhei KatoAnna KazantsevaSanjeev KhudanpurKate M. KnillRoss KrekoskiRoland Kuhn 0001Cheng-I LaiCheng-I Jeff LaiFederico LandiniSamuel LarkinKai-Zhan LeeYocheved LevitanHaoyu LiYi-Lun LiaoPatrick LittellAlexander H. LiuXuechen LiuDelaney LothianLidia ManguAkwiratékha' MartinGideon MendelsXiaoxiao MiaoNobuaki MinematsuYishak Tofik MohammedShri NarayananShrikanth NarayananJose Patino 0001Antoine PerquinAidan PineKaizhi QianYanmin QianChunyu QiangAnton RagniBhuvana RamabhadranAditya RavuriKorin RichmondMichael Riley 0001Andrew RosenbergMickael RouvierMurat SaraclarAbhinav SethyRajiv Ratn ShahOrian SharoniRoee ShenbergXuan ShiSunayana SitaramVictor SotoBob L. T. SturmShinji TakakiMarc TessierTomoki TodaMassimiliano TodiscoDavid TofuNatalia A. TomashenkoYu Tsao 0001Morgan UlinskiCassia Valentini-BotinhaoHaipeng WangHsin-Min WangLongbiao WangXin Wang 0037Xinyue WangDan WellsChristopher M. WhiteJennifer Williams 0001Patrick J. WolfeHemant YadavJunichi YamagishiYusuke YasudaChang ZengRyandhimas E. ZezarioLin ZhangYang Zhang 0001Yi Zhao 0006Lifan Zhong