default search action
Atsuhiko Kai
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c37]Yoshiki Niimura, Jun Takemoto, Atsuhiko Kai, Seiichi Nakagawa:
Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition. APSIPA ASC 2023: 8-14 - [c36]Shogo Miwa, Atsuhiko Kai:
Dialect Speech Recognition Modeling using Corpus of Japanese Dialects and Self-Supervised Learning-based Model XLSR. INTERSPEECH 2023: 4928-4932 - 2022
- [j11]Raufun Nahar, Shogo Miwa, Atsuhiko Kai:
Domain Adaptation with Augmented Data by Deep Neural Network Based Method Using Re-Recorded Speech for Automatic Speech Recognition in Real Environment. Sensors 22(24): 9945 (2022) - 2021
- [c35]Takumi Kurokawa, Atsuhiko Kai:
Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection. APSIPA ASC 2021: 1037-1042 - [c34]Takumi Kurokawa, Atsuhiko Kai:
Robust Query-by-example Spoken Term Detection for Unknown Words Using Speech Retrieval-oriented E2E ASR Modeling. GCCE 2021: 316-317 - [c33]Ryota Sakai, Atsuhiko Kai, Seiichi Nakagawa:
Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG. LifeTech 2021: 373-375 - 2020
- [c32]Takumi Kurokawa, Atsuhiko Kai, Hiroki Kondo:
Effects of End-to-end ASR and Score Fusion Model Learning for Improved Query-by-example Spoken Term Detection. APSIPA 2020: 654-661 - [c31]Raufun Nahar, Atsuhiko Kai:
Effect of Data Augmentation on DNN-Based VAD for Automatic Speech Recognition in Noisy Environment. GCCE 2020: 368-372
2010 – 2019
- 2018
- [c30]Raufun Nahar, Takashi Kawai, Atsuhiko Kai:
Multi-Condition Training of Denoising Autoencoder by Augmenting Simulated Reverberant Speech Data. GCCE 2018: 334-338 - 2017
- [c29]Yuji Terada, Kenta Tamiya, Atsuhiko Kai:
Investigation of efficient semi-automatic correction method using STD for automatic captioning. GCCE 2017: 1-2 - 2016
- [j10]Bo Ren, Longbiao Wang, Liang Lu, Yuma Ueda, Atsuhiko Kai:
Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition. Multim. Tools Appl. 75(9): 5093-5108 (2016) - [j9]Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. J. Signal Process. Syst. 82(2): 151-161 (2016) - [c28]Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai:
Combining State-Level Spotting and Posterior-Based Acoustic Match for Improved Query-by-Example Spoken Term Detection. INTERSPEECH 2016: 740-744 - [c27]Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai:
Combining State-level and DNN-based Acoustic Matches for Efficient Spoken Term Detection in NTCIR-12 SpokenQuery&Doc-2 Task. NTCIR 2016 - 2015
- [j8]Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, Takanori Yamada, Weifeng Li, Masahiro Iwahashi:
Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification. EURASIP J. Audio Speech Music. Process. 2015: 12 (2015) - [j7]Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Bo Ren:
Environment-dependent denoising autoencoder for distant-talking speech recognition. EURASIP J. Adv. Signal Process. 2015: 92 (2015) - [c26]Bo Ren, Longbiao Wang, Atsuhiko Kai, Zhaofeng Zhang:
Speech selection and environmental adaptation for asynchronous speech recognition. APSIPA 2015: 119-124 - 2014
- [j6]Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation. EURASIP J. Audio Speech Music. Process. 2014: 15 (2014) - [c25]Longbiao Wang, Bo Ren, Yuma Ueda, Atsuhiko Kai, Shunta Teraoka, Taku Fukushima:
Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording. APSIPA 2014: 1-5 - [c24]Mitsuaki Makino, Naoki Yamamoto, Atsuhiko Kai:
Utilizing state-level distance vector representation for improved spoken term detection by text and spoken queries. INTERSPEECH 2014: 1732-1736 - [c23]Ikuya Hirano, Kong-Aik Lee, Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Single-sided approach to discriminative PLDA training for text-independent speaker verification without using expanded i-vector. ISCSLP 2014: 59-63 - [c22]Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization. ISCSLP 2014: 379-383 - [c21]Satoshi Shiota, Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Weifeng Li:
Distant-talking speech recognition using multi-channel LMS and multiple-step linear prediction. ISCSLP 2014: 384-388 - [c20]Mitsuaki Makino, Atsuhiko Kai:
Combining Subword and State-level Dissimilarity Measures for Improved Spoken Term Detection in NTCIR-11 SpokenQuery&Doc Task. NTCIR 2014 - [c19]Yuta Kawakami, Longbiao Wang, Atsuhiko Kai, Seiichi Nakagawa:
Speaker Identification by Combining Various Vocal Tract and Vocal Source Features. TSD 2014: 382-389 - 2013
- [c18]Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Weifeng Li:
Speech recognition using blind source separation and dereverberation method for mixed sound of speech and music. APSIPA 2013: 1-4 - [c17]Naoki Yamamoto, Atsuhiko Kai:
Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection. APSIPA 2013: 1-4 - [c16]Longbiao Wang, Zhaofeng Zhang, Atsuhiko Kai:
Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach. ICASSP 2013: 7224-7228 - [c15]Takanori Yamada, Longbiao Wang, Atsuhiko Kai:
Improvement of distant-talking speaker identification using bottleneck features of DNN. INTERSPEECH 2013: 3661-3664 - [c14]Naoki Yamamoto, Atsuhiko Kai:
Spoken Term Detection Using Distance-Vector based Dissimilarity Measures and Its Evaluation on the NTCIR-10 SpokenDoc-2 Task. NTCIR 2013 - 2012
- [j5]Longbiao Wang, Kyohei Odani, Atsuhiko Kai:
Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array. EURASIP J. Adv. Signal Process. 2012: 12 (2012) - [c13]Ikuya Hirano, Longbiao Wang, Atsuhiko Kai, Seiichi Nakagawa:
On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition. APSIPA 2012: 1-4 - [c12]Longbiao Wang, Zhaofeng Zhang, Atsuhiko Kai, Yoshiki Kishi:
Distant-talking speaker identification using a reverberation model with various artificial room impulse responses. APSIPA 2012: 1-4 - [c11]Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Dereverberantion based on generalized spectral subtraction for distant-talking speaker recognition. APSIPA 2012: 1-4 - [c10]Kyohei Odani, Longbiao Wang, Atsuhiko Kai:
Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment. INTERSPEECH 2012: 1251-1254 - 2011
- [c9]Longbiao Wang, Kyohei Odani, Atsuhiko Kai:
Evaluation of Hands-Free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm. TSD 2011: 131-138
2000 – 2009
- 2007
- [j4]Noriki Fujiwara, Toshihiko Itoh, Kenji Araki, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh:
Spoken language understanding method using confidence measure and dialogue history. Syst. Comput. Jpn. 38(9): 21-31 (2007) - 2004
- [c8]Toshihiko Itoh, Atsuhiko Kai, Yukihiro Itoh, Tatsuhiro Konishi:
An understanding strategy based on plausibility score in recognition history using CSR confidence measure. INTERSPEECH 2004: 2133-2136 - [p1]Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212 - 2002
- [c7]Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh:
Linguistic and acoustic changes of user²s utterances caused by different dialogue situations. INTERSPEECH 2002: 545-548 - [c6]Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi, Yukihiro Itoh:
Influence of different dialogue situations on user²s behavior in spoken corrections. INTERSPEECH 2002: 1189-1192 - 2000
- [c5]Atsuhiko Kai, Takahiro Nakano, Seiichi Nakagawa:
Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment. ICMI 2000: 549-556
1990 – 1999
- 1998
- [j3]Atsuhiko Kai, Seiichi Nakagawa:
Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies. Syst. Comput. Jpn. 29(9): 43-53 (1998) - [c4]Atsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa:
Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system. ICSLP 1998 - 1995
- [j2]Atsuhiko Kai, Seiichi Nakagawa:
Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System. IEICE Trans. Inf. Syst. 78-D(6): 698-704 (1995) - [c3]Atsuhiko Kai, Seiichi Nakagawa:
Investigation on unknown word processing and strategies for spontaneous speech understanding. EUROSPEECH 1995: 2095-2098 - 1994
- [j1]Seiichi Nakagawa, Atsuhiko Kai:
A context-free grammar-driven, one-pass HMM-based continuous speech recognition method. Syst. Comput. Jpn. 25(4): 92-102 (1994) - [c2]Atsuhiko Kai, Seiichi Nakagawa:
Evaluation of unknown word processing in a spoken word recognition system. ICSLP 1994: 2151-2154 - 1992
- [c1]Atsuhiko Kai, Seiichi Nakagawa:
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar. ICSLP 1992: 257-260
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-06-17 00:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint