dblp: Keyu An
https://dblp.org/pid/254/2051.html
dblp person page RSS feedSat, 30 Nov 2024 01:11:11 +0100en-USdaily1released under the CC0 1.0 licensedblp@dagstuhl.de (dblp team)dblp@dagstuhl.de (dblp team)Computers/Computer_Science/Publications/Bibliographieshttp://www.rssboard.org/rss-specificationhttps://dblp.org/img/logo.144x51.pngdblp: Keyu Anhttps://dblp.org/pid/254/2051.html14451FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs.https://doi.org/10.48550/arXiv.2407.04051Keyu An, Qian Chen, Chong Deng, Zhihao Du, Changfeng Gao, Zhifu Gao, Yue Gu, Ting He, Hangrui Hu, Kai Hu, Shengpeng Ji, Yabin Li, Zerui Li, Heng Lu, Haoneng Luo, Xiang Lv, Bin Ma, Ziyang Ma, Chongjia Ni, Changhe Song, Jiaqi Shi, Xian Shi, Hao Wang, Wen Wang, Yuxuan Wang, Zhangyu Xiao, Zhijie Yan, Yexin Yang, Bin Zhang, Qinglin Zhang, Shiliang Zhang, Nan Zhao, Siqi Zheng: FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs.CoRRabs/2407.04051 (2024)]]>https://dblp.org/rec/journals/corr/abs-2407-04051Mon, 01 Jan 2024 00:00:00 +0100Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition.https://doi.org/10.48550/arXiv.2409.17746Keyu An, Zerui Li, Zhifu Gao, Shiliang Zhang: Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition.CoRRabs/2409.17746 (2024)]]>https://dblp.org/rec/journals/corr/abs-2409-17746Mon, 01 Jan 2024 00:00:00 +0100Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study.https://doi.org/10.48550/arXiv.2409.17750Keyu An, Shiliang Zhang, Zhijie Yan: Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study.CoRRabs/2409.17750 (2024)]]>https://dblp.org/rec/journals/corr/abs-2409-17750Mon, 01 Jan 2024 00:00:00 +0100Analysis of Omni-Channel Evolution Game Strategy for E-Commerce Enterprises in the Context of Online and Offline Integration.https://doi.org/10.3390/systems11070321Yingying Cheng, Bo Xie, Keyu An: Analysis of Omni-Channel Evolution Game Strategy for E-Commerce Enterprises in the Context of Online and Offline Integration.Syst.11(7): 321 (2023)]]>https://dblp.org/rec/journals/systems/ChengXA23Sat, 01 Jul 2023 01:00:00 +0200BAT: Boundary aware transducer for memory-efficient and low-latency ASR.https://doi.org/10.21437/Interspeech.2023-770Keyu An, Xian Shi, Shiliang Zhang: BAT: Boundary aware transducer for memory-efficient and low-latency ASR.INTERSPEECH2023: 4963-4967]]>https://dblp.org/rec/conf/interspeech/AnSZ23Sun, 01 Jan 2023 00:00:00 +0100Exploring RWKV for Memory Efficient and Low Latency Streaming ASR.https://doi.org/10.48550/arXiv.2309.14758Keyu An, Shiliang Zhang: Exploring RWKV for Memory Efficient and Low Latency Streaming ASR.CoRRabs/2309.14758 (2023)]]>https://dblp.org/rec/journals/corr/abs-2309-14758Sun, 01 Jan 2023 00:00:00 +0100Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures.https://doi.org/10.48550/arXiv.2312.14860Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan: Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures.CoRRabs/2312.14860 (2023)]]>https://dblp.org/rec/journals/corr/abs-2312-14860Sun, 01 Jan 2023 00:00:00 +0100Dynamic Research on Three-Player Evolutionary Game in Waste Product Recycling Supply Chain System.https://doi.org/10.3390/systems10050185Bo Xie, Keyu An, Yingying Cheng: Dynamic Research on Three-Player Evolutionary Game in Waste Product Recycling Supply Chain System.Syst.10(5): 185 (2022)]]>https://dblp.org/rec/journals/systems/XieAC22Sat, 01 Jan 2022 00:00:00 +0100CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR.https://doi.org/10.21437/Interspeech.2022-11214Keyu An, Huahuan Zheng, Zhijian Ou, Hongyu Xiang, Ke Ding, Guanglu Wan: CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR.INTERSPEECH2022: 2103-2107]]>https://dblp.org/rec/conf/interspeech/AnZOXDW22Sat, 01 Jan 2022 00:00:00 +0100An Empirical Study of Language Model Integration for Transducer based Speech Recognition.https://doi.org/10.21437/Interspeech.2022-10576Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan: An Empirical Study of Language Model Integration for Transducer based Speech Recognition.INTERSPEECH2022: 3904-3908]]>https://dblp.org/rec/conf/interspeech/ZhengAOHDW22Sat, 01 Jan 2022 00:00:00 +0100Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.https://doi.org/10.1109/ISCSLP57327.2022.10038153Keyu An, Ji Xiao, Zhijian Ou: Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.ISCSLP2022: 180-184]]>https://dblp.org/rec/conf/iscslp/AnXO22Sat, 01 Jan 2022 00:00:00 +0100Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.https://doi.org/10.48550/arXiv.2203.16757Keyu An, Zhijian Ou: Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study.CoRRabs/2203.16757 (2022)]]>https://dblp.org/rec/journals/corr/abs-2203-16757Sat, 01 Jan 2022 00:00:00 +0100CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR.https://doi.org/10.48550/arXiv.2203.16758Keyu An, Huahuan Zheng, Zhijian Ou, Hongyu Xiang, Ke Ding, Guanglu Wan: CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR.CoRRabs/2203.16758 (2022)]]>https://dblp.org/rec/journals/corr/abs-2203-16758Sat, 01 Jan 2022 00:00:00 +0100An Empirical Study of Language Model Integration for Transducer based Speech Recognition.https://doi.org/10.48550/arXiv.2203.16776Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan: An Empirical Study of Language Model Integration for Transducer based Speech Recognition.CoRRabs/2203.16776 (2022)]]>https://dblp.org/rec/journals/corr/abs-2203-16776Sat, 01 Jan 2022 00:00:00 +0100Multilingual and Crosslingual Speech Recognition Using Phonological-Vector Based Phone Embeddings.https://doi.org/10.1109/ASRU51503.2021.9687966Chengrui Zhu, Keyu An, Huahuan Zheng, Zhijian Ou: Multilingual and Crosslingual Speech Recognition Using Phonological-Vector Based Phone Embeddings.ASRU2021: 1034-1041]]>https://dblp.org/rec/conf/asru/ZhuAZO21Fri, 01 Jan 2021 00:00:00 +0100Deformable TDNN with Adaptive Receptive Fields for Speech Recognition.https://doi.org/10.21437/Interspeech.2021-387Keyu An, Yi Zhang, Zhijian Ou: Deformable TDNN with Adaptive Receptive Fields for Speech Recognition.Interspeech2021: 2067-2071]]>https://dblp.org/rec/conf/interspeech/AnZO21Fri, 01 Jan 2021 00:00:00 +0100Efficient Neural Architecture Search for End-to-End Speech Recognition Via Straight-Through Gradients.https://doi.org/10.1109/SLT48900.2021.9383527Huahuan Zheng, Keyu An, Zhijian Ou: Efficient Neural Architecture Search for End-to-End Speech Recognition Via Straight-Through Gradients.SLT2021: 60-67]]>https://dblp.org/rec/conf/slt/ZhengAO21Fri, 01 Jan 2021 00:00:00 +0100The SLT 2021 Children Speech Recognition Challenge: Open Datasets, Rules and Baselines.https://doi.org/10.1109/SLT48900.2021.9383608Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao: The SLT 2021 Children Speech Recognition Challenge: Open Datasets, Rules and Baselines.SLT2021: 1117-1123]]>https://dblp.org/rec/conf/slt/YuYWAXOLLM21Fri, 01 Jan 2021 00:00:00 +0100Deformable TDNN with adaptive receptive fields for speech recognition.https://arxiv.org/abs/2104.14791Keyu An, Yi Zhang, Zhijian Ou: Deformable TDNN with adaptive receptive fields for speech recognition.CoRRabs/2104.14791 (2021)]]>https://dblp.org/rec/journals/corr/abs-2104-14791Fri, 01 Jan 2021 00:00:00 +0100Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings.https://arxiv.org/abs/2107.05038Chengrui Zhu, Keyu An, Huahuan Zheng, Zhijian Ou: Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings.CoRRabs/2107.05038 (2021)]]>https://dblp.org/rec/journals/corr/abs-2107-05038Fri, 01 Jan 2021 00:00:00 +0100Sequential Deformation for Accurate Scene Text Detection.https://doi.org/10.1007/978-3-030-58526-6_7Shanyu Xiao, Liangrui Peng, Ruijie Yan, Keyu An, Gang Yao, Jaesik Min: Sequential Deformation for Accurate Scene Text Detection.ECCV (29)2020: 108-124]]>https://dblp.org/rec/conf/eccv/XiaoPYAYM20Wed, 01 Jan 2020 00:00:00 +0100CAT: A CTC-CRF Based ASR Toolkit Bridging the Hybrid and the End-to-End Approaches Towards Data Efficiency and Low Latency.https://doi.org/10.21437/Interspeech.2020-2732Keyu An, Hongyu Xiang, Zhijian Ou: CAT: A CTC-CRF Based ASR Toolkit Bridging the Hybrid and the End-to-End Approaches Towards Data Efficiency and Low Latency.INTERSPEECH2020: 566-570]]>https://dblp.org/rec/conf/interspeech/AnXO20Wed, 01 Jan 2020 00:00:00 +0100CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency.https://arxiv.org/abs/2005.13326Keyu An, Hongyu Xiang, Zhijian Ou: CAT: A CTC-CRF based ASR Toolkit Bridging the Hybrid and the End-to-end Approaches towards Data Efficiency and Low Latency.CoRRabs/2005.13326 (2020)]]>https://dblp.org/rec/journals/corr/abs-2005-13326Wed, 01 Jan 2020 00:00:00 +0100Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients.https://arxiv.org/abs/2011.05649Huahuan Zheng, Keyu An, Zhijian Ou: Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients.CoRRabs/2011.05649 (2020)]]>https://dblp.org/rec/journals/corr/abs-2011-05649Wed, 01 Jan 2020 00:00:00 +0100The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines.https://arxiv.org/abs/2011.06724Fan Yu, Zhuoyuan Yao, Xiong Wang, Keyu An, Lei Xie, Zhijian Ou, Bo Liu, Xiulin Li, Guanqiong Miao: The SLT 2021 children speech recognition challenge: Open datasets, rules and baselines.CoRRabs/2011.06724 (2020)]]>https://dblp.org/rec/journals/corr/abs-2011-06724Wed, 01 Jan 2020 00:00:00 +0100CAT: CRF-based ASR Toolkit.http://arxiv.org/abs/1911.08747Keyu An, Hongyu Xiang, Zhijian Ou: CAT: CRF-based ASR Toolkit.CoRRabs/1911.08747 (2019)]]>https://dblp.org/rec/journals/corr/abs-1911-08747Tue, 01 Jan 2019 00:00:00 +0100