iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.uni-trier.de/pid/36/5709.html

dblp: Atsuhiko Kai

default search action

combined dblp search
author search
venue search
publication search

ask others

Atsuhiko Kai

> Home > Persons

Person information

SPARQL queries

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2023
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/NiimuraTKN23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/NiimuraTKN23
Yoshiki Niimura, Jun Takemoto, Atsuhiko Kai, Seiichi Nakagawa:
Attention-based CNN and Relative Phase Feature Modeling for Improved Imagined Speech Recognition. APSIPA ASC 2023: 8-14
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiwaK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiwaK23
Shogo Miwa, Atsuhiko Kai:
Dialect Speech Recognition Modeling using Corpus of Japanese Dialects and Self-Supervised Learning-based Model XLSR. INTERSPEECH 2023: 4928-4932
2022
[j11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/NaharMK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/NaharMK22
Raufun Nahar, Shogo Miwa, Atsuhiko Kai:
Domain Adaptation with Augmented Data by Deep Neural Network Based Method Using Re-Recorded Speech for Automatic Speech Recognition in Real Environment. Sensors 22(24): 9945 (2022)
2021
[c35]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/KurokawaK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KurokawaK21
Takumi Kurokawa, Atsuhiko Kai:
Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection. APSIPA ASC 2021: 1037-1042
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/KurokawaK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/KurokawaK21
Takumi Kurokawa, Atsuhiko Kai:
Robust Query-by-example Spoken Term Detection for Unknown Words Using Speech Retrieval-oriented E2E ASR Modeling. GCCE 2021: 316-317
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/lifetech/SakaiKN21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lifetech/SakaiKN21
Ryota Sakai, Atsuhiko Kai, Seiichi Nakagawa:
Classification of Imagined and Heard Speech Using Amplitude Spectrum and Relative Phase of EEG. LifeTech 2021: 373-375
2020
[c32]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/KurokawaKK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/KurokawaKK20
Takumi Kurokawa, Atsuhiko Kai, Hiroki Kondo:
Effects of End-to-end ASR and Score Fusion Model Learning for Improved Query-by-example Spoken Term Detection. APSIPA 2020: 654-661
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/NaharK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/NaharK20
Raufun Nahar, Atsuhiko Kai:
Effect of Data Augmentation on DNN-Based VAD for Automatic Speech Recognition in Noisy Environment. GCCE 2020: 368-372

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2018
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/NaharKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/NaharKK18
Raufun Nahar, Takashi Kawai, Atsuhiko Kai:
Multi-Condition Training of Denoising Autoencoder by Augmenting Simulated Reverberant Speech Data. GCCE 2018: 334-338
2017
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/gcce/TeradaTK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/gcce/TeradaTK17
Yuji Terada, Kenta Tamiya, Atsuhiko Kai:
Investigation of efficient semi-automatic correction method using STD for automatic captioning. GCCE 2017: 1-2
2016
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/mta/RenWLUK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mta/RenWLUK16
Bo Ren, Longbiao Wang, Liang Lu, Yuma Ueda, Atsuhiko Kai:
Combination of bottleneck feature extraction and dereverberation for distant-talking speech recognition. Multim. Tools Appl. 75(9): 5093-5108 (2016)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/vlsisp/UedaWKXCL16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/vlsisp/UedaWKXCL16
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel Dereverberation for Distant-Talking Speech Recognition by Combining Denoising Autoencoder and Temporal Structure Normalization. J. Signal Process. Syst. 82(2): 151-161 (2016)
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OishiMMK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OishiMMK16
Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai:
Combining State-Level Spotting and Posterior-Based Acoustic Match for Improved Query-by-Example Spoken Term Detection. INTERSPEECH 2016: 740-744
[c27]
- view
  - electronic edition @ nii.ac.jp (open access)
  - details & citations
- export record
  dblp key:
  - conf/ntcir/OishiMMK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ntcir/OishiMMK16
Shuji Oishi, Tatsuya Matsuba, Mitsuaki Makino, Atsuhiko Kai:
Combining State-level and DNN-based Acoustic Matches for Efficient Spoken Term Detection in NTCIR-12 SpokenQuery&Doc-2 Task. NTCIR 2016
2015
[j8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhangWKYLI15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhangWKYLI15
Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai, Takanori Yamada, Weifeng Li, Masahiro Iwahashi:
Deep neural network-based bottleneck feature and denoising autoencoder-based dereverberation for distant-talking speaker identification. EURASIP J. Audio Speech Music. Process. 2015: 12 (2015)
[j7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/UedaWKR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/UedaWKR15
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Bo Ren:
Environment-dependent denoising autoencoder for distant-talking speech recognition. EURASIP J. Adv. Signal Process. 2015: 92 (2015)
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/RenWKZ15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/RenWKZ15
Bo Ren, Longbiao Wang, Atsuhiko Kai, Zhaofeng Zhang:
Speech selection and environmental adaptation for asynchronous speech recognition. APSIPA 2015: 119-124
2014
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/ZhangWK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/ZhangWK14
Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Distant-talking speaker identification by generalized spectral subtraction-based dereverberation and its efficient computation. EURASIP J. Audio Speech Music. Process. 2014: 15 (2014)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangRUKTF14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangRUKTF14
Longbiao Wang, Bo Ren, Yuma Ueda, Atsuhiko Kai, Shunta Teraoka, Taku Fukushima:
Denoising autoencoder and environment adaptation for distant-talking speech recognition with asynchronous speech recording. APSIPA 2014: 1-5
[c24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakinoYK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakinoYK14
Mitsuaki Makino, Naoki Yamamoto, Atsuhiko Kai:
Utilizing state-level distance vector representation for improved spoken term detection by text and spoken queries. INTERSPEECH 2014: 1732-1736
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/HiranoLZWK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/HiranoLZWK14
Ikuya Hirano, Kong-Aik Lee, Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Single-sided approach to discriminative PLDA training for text-independent speaker verification without using expanded i-vector. ISCSLP 2014: 59-63
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/UedaWKXCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/UedaWKXCL14
Yuma Ueda, Longbiao Wang, Atsuhiko Kai, Xiong Xiao, Engsiong Chng, Haizhou Li:
Single-channel dereverberation for distant-talking speech recognition by combining denoising autoencoder and temporal structure normalization. ISCSLP 2014: 379-383
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/ShiotaWOKL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/ShiotaWOKL14
Satoshi Shiota, Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Weifeng Li:
Distant-talking speech recognition using multi-channel LMS and multiple-step linear prediction. ISCSLP 2014: 384-388
[c20]
- view
  - electronic edition @ nii.ac.jp (open access)
  - details & citations
- export record
  dblp key:
  - conf/ntcir/MakinoK14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ntcir/MakinoK14
Mitsuaki Makino, Atsuhiko Kai:
Combining Subword and State-level Dissimilarity Measures for Improved Spoken Term Detection in NTCIR-11 SpokenQuery&Doc Task. NTCIR 2014
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/KawakamiWKN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/KawakamiWKN14
Yuta Kawakami, Longbiao Wang, Atsuhiko Kai, Seiichi Nakagawa:
Speaker Identification by Combining Various Vocal Tract and Vocal Source Features. TSD 2014: 382-389
2013
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/WangOKL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangOKL13
Longbiao Wang, Kyohei Odani, Atsuhiko Kai, Weifeng Li:
Speech recognition using blind source separation and dereverberation method for mixed sound of speech and music. APSIPA 2013: 1-4
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/YamamotoK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/YamamotoK13
Naoki Yamamoto, Atsuhiko Kai:
Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection. APSIPA 2013: 1-4
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZK13
Longbiao Wang, Zhaofeng Zhang, Atsuhiko Kai:
Hands-free speaker identification based on spectral subtraction using a multi-channel least mean square approach. ICASSP 2013: 7224-7228
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadaWK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadaWK13
Takanori Yamada, Longbiao Wang, Atsuhiko Kai:
Improvement of distant-talking speaker identification using bottleneck features of DNN. INTERSPEECH 2013: 3661-3664
[c14]
- view
  - electronic edition @ nii.ac.jp (open access)
  - details & citations
- export record
  dblp key:
  - conf/ntcir/YamamotoK13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ntcir/YamamotoK13
Naoki Yamamoto, Atsuhiko Kai:
Spoken Term Detection Using Distance-Vector based Dissimilarity Measures and Its Evaluation on the NTCIR-10 SpokenDoc-2 Task. NTCIR 2013
2012
[j5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/WangOK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/WangOK12
Longbiao Wang, Kyohei Odani, Atsuhiko Kai:
Dereverberation and denoising based on generalized spectral subtraction by multi-channel LMS algorithm using a small-scale microphone array. EURASIP J. Adv. Signal Process. 2012: 12 (2012)
[c13]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/HiranoWKN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/HiranoWKN12
Ikuya Hirano, Longbiao Wang, Atsuhiko Kai, Seiichi Nakagawa:
On the use of phase information-based joint factor analysis for speaker verification under channel mismatch condition. APSIPA 2012: 1-4
[c12]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/WangZKK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/WangZKK12
Longbiao Wang, Zhaofeng Zhang, Atsuhiko Kai, Yoshiki Kishi:
Distant-talking speaker identification using a reverberation model with various artificial room impulse responses. APSIPA 2012: 1-4
[c11]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/apsipa/ZhangWK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ZhangWK12
Zhaofeng Zhang, Longbiao Wang, Atsuhiko Kai:
Dereverberantion based on generalized spectral subtraction for distant-talking speaker recognition. APSIPA 2012: 1-4
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OdaniWK12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OdaniWK12
Kyohei Odani, Longbiao Wang, Atsuhiko Kai:
Speech Recognition by Denoising and Dereverberation Based on Spectral Subtraction in a Real Noisy Reverberant Environment. INTERSPEECH 2012: 1251-1254
2011
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/WangOK11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/WangOK11
Longbiao Wang, Kyohei Odani, Atsuhiko Kai:
Evaluation of Hands-Free Large Vocabulary Continuous Speech Recognition by Blind Dereverberation Based on Spectral Subtraction by Multi-channel LMS Algorithm. TSD 2011: 131-138

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2007
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/scjapan/FujiwaraIAKKI07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scjapan/FujiwaraIAKKI07
Noriki Fujiwara, Toshihiko Itoh, Kenji Araki, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh:
Spoken language understanding method using confidence measure and dialogue history. Syst. Comput. Jpn. 38(9): 21-31 (2007)
2004
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItohKIK04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItohKIK04
Toshihiko Itoh, Atsuhiko Kai, Yukihiro Itoh, Tatsuhiro Konishi:
An understanding strategy based on plausibility score in recognition history using CSR confidence measure. INTERSPEECH 2004: 2133-2136
[p1]
- no documents available
  - details & citations
- export record
  dblp key:
  - series/cogtech/KawamotoSNNNIMYKLYKTHMYDUS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/cogtech/KawamotoSNNNIMYKLYKTHMYDUS04
Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212
2002
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ItohKKI02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ItohKKI02
Toshihiko Itoh, Atsuhiko Kai, Tatsuhiro Konishi, Yukihiro Itoh:
Linguistic and acoustic changes of user²s utterances caused by different dialogue situations. INTERSPEECH 2002: 545-548
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaiNIKI02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaiNIKI02
Atsuhiko Kai, Yukari Nonomura, Toshihiko Itoh, Tatsuhiro Konishi, Yukihiro Itoh:
Influence of different dialogue situations on user²s behavior in spoken corrections. INTERSPEECH 2002: 1189-1192
2000
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/KaiNN00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/KaiNN00
Atsuhiko Kai, Takahiro Nakano, Seiichi Nakagawa:
Usability of Browser-Based Pen-Touch/Speech User Interfaces for Form-Based Application in Mobile Environment. ICMI 2000: 549-556

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1998
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/scjapan/KaiN98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scjapan/KaiN98
Atsuhiko Kai, Seiichi Nakagawa:
Comparison of continuous speech recognition systems with unknown-word processing for speech disfluencies. Syst. Comput. Jpn. 29(9): 43-53 (1998)
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaiHN98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaiHN98
Atsuhiko Kai, Yoshifumi Hirose, Seiichi Nakagawa:
Dealing with out-of-vocabulary words and speech disfluencies in an n-gram based speech understanding system. ICSLP 1998
1995
[j2]
- view
  - electronic edition @ ieice.org
  - details & citations
- export record
  dblp key:
  - journals/ieicet/KaiN95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/KaiN95
Atsuhiko Kai, Seiichi Nakagawa:
Relationship among Recognition Rate, Rejection Rate and False Alarm Rate in a Spoken Word Recognition System. IEICE Trans. Inf. Syst. 78-D(6): 698-704 (1995)
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaiN95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaiN95
Atsuhiko Kai, Seiichi Nakagawa:
Investigation on unknown word processing and strategies for spontaneous speech understanding. EUROSPEECH 1995: 2095-2098
1994
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/scjapan/NakagawaK94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/scjapan/NakagawaK94
Seiichi Nakagawa, Atsuhiko Kai:
A context-free grammar-driven, one-pass HMM-based continuous speech recognition method. Syst. Comput. Jpn. 25(4): 92-102 (1994)
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaiN94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaiN94
Atsuhiko Kai, Seiichi Nakagawa:
Evaluation of unknown word processing in a spoken word recognition system. ICSLP 1994: 2151-2154
1992
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaiN92
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaiN92
Atsuhiko Kai, Seiichi Nakagawa:
A frame-synchronous continuous speech recognition algorithm using a top-down parsing of context-free grammar. ICSLP 1992: 257-260

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.