iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.org/pid/248/9136.ris

Provider: Schloss Dagstuhl - Leibniz Center for Informatics Database: dblp computer science bibliography Content:text/plain; charset="utf-8" TY - CPAPER ID - DBLP:conf/eccv/LiWHOA24 AU - Li, Tingle AU - Wang, Renhao AU - Huang, Po-Yao AU - Owens, Andrew AU - Anumanchipalli, Gopala TI - Self-Supervised Audio-Visual Soundscape Stylization. BT - Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXX SP - 20 EP - 40 PY - 2024// DO - 10.1007/978-3-031-72989-8_2 UR - https://doi.org/10.1007/978-3-031-72989-8_2 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2409-14340 AU - Li, Tingle AU - Wang, Renhao AU - Huang, Po-Yao AU - Owens, Andrew AU - Anumanchipalli, Gopala TI - Self-Supervised Audio-Visual Soundscape Stylization. JO - CoRR VL - abs/2409.14340 PY - 2024// DO - 10.48550/ARXIV.2409.14340 UR - https://doi.org/10.48550/arXiv.2409.14340 ER - TY - CPAPER ID - DBLP:conf/asru/LianFFLKCWNLA23 AU - Lian, Jiachen AU - Feng, Carly AU - Farooqi, Naasir AU - Li, Steve AU - Kashyap, Anshul AU - Cho, Cheol Jun AU - Wu, Peter AU - Netzorg, Robbie AU - Li, Tingle AU - Anumanchipalli, Gopala Krishna TI - Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection. BT - IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2023, Taipei, Taiwan, December 16-20, 2023 SP - 1 EP - 8 PY - 2023// DO - 10.1109/ASRU57964.2023.10389771 UR - https://doi.org/10.1109/ASRU57964.2023.10389771 ER - TY - CPAPER ID - DBLP:conf/icml/DuTLLYWYZ23 AU - Du, Chenzhuang AU - Teng, Jiaye AU - Li, Tingle AU - Liu, Yichen AU - Yuan, Tianyuan AU - Wang, Yue AU - Yuan, Yang AU - Zhao, Hang TI - On Uni-Modal Feature Learning in Supervised Multi-Modal Learning. BT - International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA. SP - 8632 EP - 8656 PY - 2023// UR - https://proceedings.mlr.press/v202/du23e.html ER - TY - CPAPER ID - DBLP:conf/interspeech/WuLLZLBG0A23 AU - Wu, Peter AU - Li, Tingle AU - Lu, Yijing AU - Zhang, Yubin AU - Lian, Jiachen AU - Black, Alan W. AU - Goldstein, Louis AU - Watanabe, Shinji AU - Anumanchipalli, Gopala Krishna TI - Deep Speech Synthesis from MRI-Based Articulatory Representations. BT - 24th Annual Conference of the International Speech Communication Association, Interspeech 2023, Dublin, Ireland, August 20-24, 2023. SP - 5132 EP - 5136 PY - 2023// DO - 10.21437/INTERSPEECH.2023-2316 UR - https://doi.org/10.21437/Interspeech.2023-2316 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2305-01233 AU - Du, Chenzhuang AU - Teng, Jiaye AU - Li, Tingle AU - Liu, Yichen AU - Yuan, Tianyuan AU - Wang, Yue AU - Yuan, Yang AU - Zhao, Hang TI - On Uni-Modal Feature Learning in Supervised Multi-Modal Learning. JO - CoRR VL - abs/2305.01233 PY - 2023// DO - 10.48550/ARXIV.2305.01233 UR - https://doi.org/10.48550/arXiv.2305.01233 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2312-12810 AU - Lian, Jiachen AU - Feng, Carly AU - Farooqi, Naasir AU - Li, Steve AU - Kashyap, Anshul AU - Cho, Cheol Jun AU - Wu, Peter AU - Netzorg, Robbie AU - Li, Tingle AU - Anumanchipalli, Gopala Krishna TI - Unconstrained Dysfluency Modeling for Dysfluent Speech Transcription and Detection. JO - CoRR VL - abs/2312.12810 PY - 2023// DO - 10.48550/ARXIV.2312.12810 UR - https://doi.org/10.48550/arXiv.2312.12810 ER - TY - CPAPER ID - DBLP:conf/eccv/LiLOZ22 AU - Li, Tingle AU - Liu, Yichen AU - Owens, Andrew AU - Zhao, Hang TI - Learning Visual Styles from Audio-Visual Associations. BT - Computer Vision - ECCV 2022 - 17th European Conference, Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part XXXVII SP - 235 EP - 252 PY - 2022// DO - 10.1007/978-3-031-19836-6_14 UR - https://doi.org/10.1007/978-3-031-19836-6_14 ER - TY - CPAPER ID - DBLP:conf/interspeech/ZhaoYLZN22 AU - Zhao, Running AU - Yu, Jiangtao AU - Li, Tingle AU - Zhao, Hang AU - Ngai, Edith C. H. TI - Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals. BT - 23rd Annual Conference of the International Speech Communication Association, Interspeech 2022, Incheon, Korea, September 18-22, 2022. SP - 4666 EP - 4670 PY - 2022// DO - 10.21437/INTERSPEECH.2022-738 UR - https://doi.org/10.21437/Interspeech.2022-738 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2205-05072 AU - Li, Tingle AU - Liu, Yichen AU - Owens, Andrew AU - Zhao, Hang TI - Learning Visual Styles from Audio-Visual Associations. JO - CoRR VL - abs/2205.05072 PY - 2022// DO - 10.48550/ARXIV.2205.05072 UR - https://doi.org/10.48550/arXiv.2205.05072 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2206-11066 AU - Zhao, Running AU - Yu, Jiangtao AU - Li, Tingle AU - Zhao, Hang AU - Ngai, Edith C. H. TI - Radio2Speech: High Quality Speech Recovery from Radio Frequency Signals. JO - CoRR VL - abs/2206.11066 PY - 2022// DO - 10.48550/ARXIV.2206.11066 UR - https://doi.org/10.48550/arXiv.2206.11066 ER - TY - CPAPER ID - DBLP:conf/interspeech/LiLHZ21 AU - Li, Tingle AU - Liu, Yichen AU - Hu, Chenxu AU - Zhao, Hang TI - CVC: Contrastive Learning for Non-Parallel Voice Conversion. BT - 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30 - September 3, 2021. SP - 1324 EP - 1328 PY - 2021// DO - 10.21437/INTERSPEECH.2021-137 UR - https://doi.org/10.21437/Interspeech.2021-137 ER - TY - CPAPER ID - DBLP:conf/iscslp/LiCHL21 AU - Li, Tingle AU - Chen, Jiawei AU - Hou, Haowen AU - Li, Ming TI - Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation. BT - 12th International Symposium on Chinese Spoken Language Processing, ISCSLP 2021, Hong Kong, January 24-27, 2021 SP - 1 EP - 5 PY - 2021// DO - 10.1109/ISCSLP49672.2021.9362081 UR - https://doi.org/10.1109/ISCSLP49672.2021.9362081 ER - TY - CPAPER ID - DBLP:conf/nips/HuTLWWZ21 AU - Hu, Chenxu AU - Tian, Qiao AU - Li, Tingle AU - Wang, Yuping AU - Wang, Yuxuan AU - Zhao, Hang TI - Neural Dubber: Dubbing for Videos According to Scripts. BT - Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. SP - 16582 EP - 16595 PY - 2021// UR - https://proceedings.neurips.cc/paper/2021/hash/8a9c8ac001d3ef9e4ce39b1177295e03-Abstract.html ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2106-11059 AU - Du, Chenzhuang AU - Li, Tingle AU - Liu, Yichen AU - Wen, Zixin AU - Hua, Tianyu AU - Wang, Yue AU - Zhao, Hang TI - Improving Multi-Modal Learning with Uni-Modal Teachers. JO - CoRR VL - abs/2106.11059 PY - 2021// UR - https://arxiv.org/abs/2106.11059 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2110-08243 AU - Hu, Chenxu AU - Tian, Qiao AU - Li, Tingle AU - Wang, Yuping AU - Wang, Yuxuan AU - Zhao, Hang TI - Neural Dubber: Dubbing for Silent Videos According to Scripts. JO - CoRR VL - abs/2110.08243 PY - 2021// UR - https://arxiv.org/abs/2110.08243 ER - TY - CPAPER ID - DBLP:conf/interspeech/LiLBL20 AU - Li, Tingle AU - Lin, Qingjian AU - Bao, Yuanyuan AU - Li, Ming TI - Atss-Net: Target Speaker Separation via Attention-Based Neural Network. BT - 21st Annual Conference of the International Speech Communication Association, Interspeech 2020, Virtual Event, Shanghai, China, October 25-29, 2020. SP - 1411 EP - 1415 PY - 2020// DO - 10.21437/INTERSPEECH.2020-1436 UR - https://doi.org/10.21437/Interspeech.2020-1436 ER - TY - CPAPER ID - DBLP:conf/interspeech/LinLL20 AU - Lin, Qingjian AU - Li, Tingle AU - Li, Ming TI - The DKU Speech Activity Detection and Speaker Identification Systems for Fearless Steps Challenge Phase-02. BT - 21st Annual Conference of the International Speech Communication Association, Interspeech 2020, Virtual Event, Shanghai, China, October 25-29, 2020. SP - 2607 EP - 2611 PY - 2020// DO - 10.21437/INTERSPEECH.2020-1915 UR - https://doi.org/10.21437/Interspeech.2020-1915 ER - TY - CPAPER ID - DBLP:conf/odyssey/LinLYWL20 AU - Lin, Qingjian AU - Li, Tingle AU - Yang, Lin AU - Wang, Junjie AU - Li, Ming TI - Optimal Mapping Loss: A Faster Loss for End-to-End Speaker Diarization. BT - Odyssey 2020: The Speaker and Language Recognition Workshop, 1-5 November 2020, Tokyo, Japan SP - 125 EP - 131 PY - 2020// DO - 10.21437/ODYSSEY.2020-18 UR - https://doi.org/10.21437/Odyssey.2020-18 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2005-09200 AU - Li, Tingle AU - Lin, Qingjian AU - Bao, Yuanyuan AU - Li, Ming TI - Atss-Net: Target Speaker Separation via Attention-based Neural Network. JO - CoRR VL - abs/2005.09200 PY - 2020// UR - https://arxiv.org/abs/2005.09200 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2011-00782 AU - Li, Tingle AU - Liu, Yichen AU - Hu, Chenxu AU - Zhao, Hang TI - CVC: Contrastive Learning for Non-parallel Voice Conversion. JO - CoRR VL - abs/2011.00782 PY - 2020// UR - https://arxiv.org/abs/2011.00782 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1909-05746 AU - Li, Tingle AU - Chen, Jiawei AU - Hou, Haowen AU - Li, Ming TI - TF-Attention-Net: An End To End Neural Network For Singing Voice Separation. JO - CoRR VL - abs/1909.05746 PY - 2019// UR - http://arxiv.org/abs/1909.05746 ER -