default search action
Taichi Asami
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c40]Takanori Ashihara, Marc Delcroix, Takafumi Moriya, Kohei Matsuura, Taichi Asami, Yusuke Ijima:
What Do Self-Supervised Speech and Speaker Models Learn? New Findings from a Cross Model Layer-Wise Analysis. ICASSP 2024: 10166-10170 - [i3]Takanori Ashihara, Marc Delcroix, Takafumi Moriya, Kohei Matsuura, Taichi Asami, Yusuke Ijima:
What Do Self-Supervised Speech and Speaker Models Learn? New Findings From a Cross Model Layer-Wise Analysis. CoRR abs/2401.17632 (2024) - [i2]Takafumi Moriya, Takanori Ashihara, Masato Mimura, Hiroshi Sato, Kohei Matsuura, Ryo Masumura, Taichi Asami:
Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding. CoRR abs/2409.20313 (2024) - 2023
- [c39]Yuya Hikima, Yasunori Akagi, Hideaki Kim, Taichi Asami:
An Improved Approximation Algorithm for Wage Determination and Online Task Allocation in Crowd-Sourcing. AAAI 2023: 3977-3986 - [c38]Yuki Kitagishi, Hosana Kamiyama, Naohiro Tawara, Atsunori Ogawa, Noboru Miyazaki, Taichi Asami:
Coarse-Age Loss: A New Training Method Using Coarse-Age Labeled Data for Speaker Age Estimation. APSIPA ASC 2023: 2213-2220 - [c37]Takafumi Moriya, Hiroshi Sato, Tsubasa Ochiai, Marc Delcroix, Takanori Ashihara, Kohei Matsuura, Tomohiro Tanaka, Ryo Masumura, Atsunori Ogawa, Taichi Asami:
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data. INTERSPEECH 2023: 899-903 - [c36]Yuki Kitagishi, Naohiro Tawara, Atsunori Ogawa, Ryo Masumura, Taichi Asami:
What are differences? Comparing DNN and Human by Their Performance and Characteristics in Speaker Age Estimation. INTERSPEECH 2023: 1873-1877 - [c35]Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka, Yusuke Ijima, Taichi Asami, Marc Delcroix, Yukinori Honma:
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge? INTERSPEECH 2023: 2888-2892 - [i1]Takanori Ashihara, Takafumi Moriya, Kohei Matsuura, Tomohiro Tanaka, Yusuke Ijima, Taichi Asami, Marc Delcroix, Yukinori Honma:
SpeechGLUE: How Well Can Self-Supervised Speech Models Capture Linguistic Knowledge? CoRR abs/2306.08374 (2023) - 2022
- [c34]Hideaki Kim, Taichi Asami, Hiroyuki Toda:
Fast Bayesian Estimation of Point Process Intensity as Function of Covariates. NeurIPS 2022 - 2021
- [j12]Toshiaki Nishio, Yuichiro Yoshikawa, Kazuki Sakai, Takamasa Iio, Mariko Chiba, Taichi Asami, Yoshinori Isoda, Hiroshi Ishiguro:
The Effects of Physically Embodied Multiple Conversation Robots on the Elderly. Frontiers Robotics AI 8: 633045 (2021) - [j11]Ryo Masumura, Taichi Asami, Takanobu Oba, Sumitaka Sakauchi:
Hierarchical Latent Words Language Models for Automatic Speech Recognition. J. Inf. Process. 29: 360-369 (2021) - [c33]Takafumi Moriya, Tomohiro Tanaka, Takanori Ashihara, Tsubasa Ochiai, Hiroshi Sato, Atsushi Ando, Ryo Masumura, Marc Delcroix, Taichi Asami:
Streaming End-to-End Speech Recognition for Hybrid RNN-T/Attention Architecture. Interspeech 2021: 1787-1791
2010 – 2019
- 2019
- [j10]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent out-of-vocabulary word detection based on distribution of features. Comput. Speech Lang. 58: 247-259 (2019) - [j9]Ryo Masumura, Taichi Asami, Takanobu Oba, Sumitaka Sakauchi, Akinori Ito:
Latent Words Recurrent Neural Network Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 102-D(12): 2557-2567 (2019) - [j8]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi:
Viterbi Approximation of Latent Words Language Models for Automatic Speech Recognition. J. Inf. Process. 27: 168-176 (2019) - 2018
- [j7]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Domain Adaptation Based on Mixture of Latent Words Language Models for Automatic Speech Recognition. IEICE Trans. Inf. Syst. 101-D(6): 1581-1590 (2018) - [c32]Takafumi Moriya, Ryo Masumura, Taichi Asami, Yusuke Shinohara, Marc Delcroix, Yoshikazu Yamaguchi, Yushi Aono:
Progressive Neural Network-based Knowledge Transfer in Acoustic Models. APSIPA 2018: 998-1002 - [c31]Ryo Masumura, Yusuke Ijima, Taichi Asami, Hirokazu Masataki, Ryuichiro Higashinaka:
Neural Confnet Classification: Fully Neural Network Based Spoken Utterance Classification Using Word Confusion Networks. ICASSP 2018: 6039-6043 - 2017
- [c30]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling. APSIPA 2017: 1588-1591 - [c29]Go Irie, Taichi Asami, Shuhei Tarashima, Takayuki Kurozumi, Tetsuya Kinebuchi:
Cross-modal transfer with neural word vectors for image feature learning. ICASSP 2017: 2916-2920 - [c28]Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Atsunori Ogawa, Taichi Asami, Shigeru Katagiri, Tomohiro Nakatani:
Cumulative moving averaged bottleneck speaker vectors for online speaker adaptation of CNN-based acoustic models. ICASSP 2017: 5175-5179 - [c27]Taichi Asami, Ryo Masumura, Yoshikazu Yamaguchi, Hirokazu Masataki, Yushi Aono:
Domain adaptation of DNN acoustic models using knowledge distillation. ICASSP 2017: 5185-5189 - [c26]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono:
Parallel phonetically aware DNNs and LSTM-RNNS for frame-by-frame discriminative modeling of spoken language identification. ICASSP 2017: 5260-5264 - [c25]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Kugatsu Sadamitsu, Kyosuke Nishida, Ryuichiro Higashinaka:
Hyperspherical Query Likelihood Models with Word Embeddings. IJCNLP(2) 2017: 210-216 - [c24]Yusuke Ijima, Nobukatsu Hojo, Ryo Masumura, Taichi Asami:
Prosody Aware Word-Level Encoder Based on BLSTM-RNNs for DNN-Based Speech Synthesis. INTERSPEECH 2017: 764-768 - [c23]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Ryo Ishii, Ryuichiro Higashinaka:
Online End-of-Turn Detection from Speech Based on Stacked Time-Asynchronous Sequential Networks. INTERSPEECH 2017: 1661-1665 - 2016
- [j6]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Investigation of Combining Various Major Language Model Technologies including Data Expansion and Adaptation. IEICE Trans. Inf. Syst. 99-D(10): 2452-2461 (2016) - [j5]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Satoshi Takahashi:
N-gram Approximation of Latent Words Language Models for Domain Robust Automatic Speech Recognition. IEICE Trans. Inf. Syst. 99-D(10): 2462-2470 (2016) - [c22]Atsushi Ando, Taichi Asami, Yoshikazu Yamaguchi, Yushi Aono:
Speaker recognition in duration-mismatched condition using bootstrapped i-vectors. APSIPA 2016: 1-4 - [c21]Yusuke Ijima, Taichi Asami, Hideyuki Mizuno:
Objective Evaluation Using Association Between Dimensions Within Spectral Features for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 337-341 - [c20]Taichi Asami, Ryo Masumura, Yushi Aono, Koichi Shinoda:
Recurrent Out-of-Vocabulary Word Detection Using Distribution of Features. INTERSPEECH 2016: 1320-1324 - [c19]Ryo Masumura, Taichi Asami, Hirokazu Masataki, Yushi Aono, Sumitaka Sakauchi:
Language Identification Based on Generative Modeling of Posteriorgram Sequences Extracted from Frame-by-Frame DNNs and LSTM-RNNs. INTERSPEECH 2016: 3275-3279 - 2015
- [c18]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Hierarchical Latent Words Language Models for Robust Modeling to Out-Of Domain Tasks. EMNLP 2015: 1896-1901 - [c17]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Combinations of various language model technologies including data expansion and adaptation in spontaneous speech recognition. INTERSPEECH 2015: 463-467 - [c16]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi, Akinori Ito:
Latent words recurrent neural network language models. INTERSPEECH 2015: 2380-2384 - [c15]Atsushi Ando, Taichi Asami, Manabu Okamoto, Hirokazu Masataki, Sumitaka Sakauchi:
Agreement and disagreement utterance detection in conversational speech by extracting and integrating local features. INTERSPEECH 2015: 2494-2498 - [c14]Taichi Asami, Ryo Masumura, Hirokazu Masataki, Manabu Okamoto, Sumitaka Sakauchi:
Training data selection for acoustic modeling via submodular optimization of joint kullback-leibler divergence. INTERSPEECH 2015: 3645-3649 - 2014
- [j4]Satoshi Kobashikawa, Taichi Asami, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Efficient data selection for speech recognition based on prior confidence estimation using speech and monophone models. Comput. Speech Lang. 28(6): 1287-1297 (2014) - [c13]Ryo Masumura, Taichi Asami, Takanobu Oba, Hirokazu Masataki, Sumitaka Sakauchi:
Mixture of latent words language models for domain adaptation. INTERSPEECH 2014: 1425-1429 - [c12]Taichi Asami, Ryo Masumura, Hirokazu Masataki, Sumitaka Sakauchi:
Read and spontaneous speech classification based on variance of GMM supervectors. INTERSPEECH 2014: 2375-2379 - 2013
- [j3]Satoshi Kobashikawa, Atsunori Ogawa, Taichi Asami, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Fast unsupervised adaptation based on efficient statistics accumulation using frame independent confidence within monophone states. Comput. Speech Lang. 27(1): 369-379 (2013) - [j2]Tomoko Izumi, Kenji Imamura, Taichi Asami, Kuniko Saito, Gen-ichiro Kikui, Satoshi Sato:
Normalizing Complex Functional Expressions in Japanese Predicates: Linguistically-Directed Rule-Based Paraphrasing and Its Application. ACM Trans. Asian Lang. Inf. Process. 12(3): 11:1-11:20 (2013) - [c11]Taichi Asami, Satoshi Kobashikawa, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi:
Unsupervised confidence calibration using examples of recognized words and their contexts. INTERSPEECH 2013: 2217-2221 - 2012
- [c10]Satoshi Kobashikawa, Takaaki Hori, Yoshikazu Yamaguchi, Taichi Asami, Hirokazu Masataki, Satoshi Takahashi:
Efficient Beam Width Control to Suppress Excessive Speech Recognition Computation Time Based on Prior Score Range Normalization. INTERSPEECH 2012: 1011-1014 - [c9]Taichi Asami, Satoshi Kobashikawa, Hirokazu Masataki, Osamu Yoshioka, Satoshi Takahashi:
Speech Data Clustering Based on Phoneme Error Trend for Unsupervised Acoustic Model Adaptation. INTERSPEECH 2012: 1760-1763 - [c8]Satoshi Kobashikawa, Takaaki Hori, Yoshikazu Yamaguchi, Taichi Asami, Hirokazu Masataki, Satoshi Takahashi:
Efficient prior and incremental beam width control to suppress excessive speech recognition time based on score range estimation. SLT 2012: 125-130 - 2011
- [c7]Takaaki Fukutomi, Satoshi Kobashikawa, Taichi Asami, Tsubasa Shinozaki, Hirokazu Masataki, Satoshi Takahashi:
Extracting call-reason segments from contact center dialogs by using automatically acquired boundary expressions. ICASSP 2011: 5584-5587 - [c6]Taichi Asami, Narichika Nomoto, Satoshi Kobashikawa, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Spoken Document Confidence Estimation Using Contextual Coherence. INTERSPEECH 2011: 1961-1964 - 2010
- [c5]Satoshi Kobashikawa, Taichi Asami, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Efficient data selection for speech recognition based on prior confidence estimation using speech and context independent models. INTERSPEECH 2010: 238-241 - [c4]Satoshi Kobashikawa, Taichi Asami, Yoshikazu Yamaguchi, Hirokazu Masataki, Satoshi Takahashi:
Efficient data selection for spoken document retrieval based on prior confidence estimation using speech and context independent models. SLT 2010: 200-205
2000 – 2009
- 2008
- [j1]Taichi Asami, Koji Iwano, Sadaoki Furui:
Evaluation of a Noise-Robust Multi-Stream Speaker Verification Method Using F0 Information. IEICE Trans. Inf. Syst. 91-D(3): 549-557 (2008) - 2006
- [c3]Taichi Asami, Koji Iwano, Sadaoki Furui:
A Stream-Weight and Threshold Estimation Method Using Adaboost for Multi-Stream Speaker Verification. ICASSP (5) 2006: 1081-1084 - 2005
- [c2]Taichi Asami, Koji Iwano, Sadaoki Furui:
Stream-weight optimization by LDA and adaboost for multi-stream speaker verification. INTERSPEECH 2005: 2185-2188 - 2004
- [c1]Koji Iwano, Taichi Asami, Sadaoki Furui:
Noise-robust speaker verification using F0 features. INTERSPEECH 2004: 1417-1420
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-22 20:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint