default search action

combined dblp search
author search
venue search
publication search

ask others

Takaaki Hori

> Home > Persons

Person information

Other persons with a similar name

see FAQ

Why are some names followed by a four digit number?

SPARQL queries

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/PrabhavalkarHSSW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/PrabhavalkarHSSW24
Rohit Prabhavalkar, Takaaki Hori, Tara N. Sainath, Ralf Schlüter, Shinji Watanabe:
End-to-End Speech Recognition: A Survey. IEEE ACM Trans. Audio Speech Lang. Process. 32: 325-351 (2024)
2023
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SwietojanskiBCSGHHMMSTZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SwietojanskiBCSGHHMMSTZ23
Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. ICASSP 2023: 1-5
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03329
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03329
Rohit Prabhavalkar, Takaaki Hori, Tara N. Sainath, Ralf Schlüter, Shinji Watanabe:
End-to-End Speech Recognition: A Survey. CoRR abs/2303.03329 (2023)
2022
[j23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/HiguchiMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/HiguchiMRH22
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels. IEEE J. Sel. Top. Signal Process. 16(6): 1424-1438 (2022)
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzHWR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzHWR22
Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux:
Sequence Transduction with Graph-Based Supervision. ICASSP 2022: 7212-7216
[c106]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChangMHWR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChangMHWR22
Xuankai Chang, Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux:
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR. ICASSP 2022: 7322-7326
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HiguchiMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HiguchiMRH22
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy. ICASSP 2022: 7672-7676
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShahGGCHMRH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShahGGCHMRH22
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning. ICASSP 2022: 7732-7736
[c103]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHR22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHR22
Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers. INTERSPEECH 2022: 4511-4515
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00232
Xuankai Chang, Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux:
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR. CoRR abs/2203.00232 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-01438
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-01438
Pawel Swietojanski, Stefan Braun, Dogan Can, Thiago Fraga da Silva, Arnab Ghoshal, Takaaki Hori, Roger Hsiao, Henry Mason, Erik McDermott, Honza Silovsky, Ruchir Travadi, Xiaodan Zhuang:
Variable Attention Masking for Configurable Transformer Transducer Speech Recognition. CoRR abs/2211.01438 (2022)
2021
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzHR21
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Capturing Multi-Resolution Context by Dilated Self-Attention. ICASSP 2021: 5869-5873
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzHR21a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzHR21a
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Semi-Supervised Speech Recognition Via Graph-Based Temporal Classification. ICASSP 2021: 6548-6552
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KhuranaMHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KhuranaMHR21
Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training. ICASSP 2021: 6553-6557
[c99]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHR21
Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers. Interspeech 2021: 586-590
[c98]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiguchiMRH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiguchiMRH21
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition. Interspeech 2021: 726-730
[c97]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoritzHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoritzHR21
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition. Interspeech 2021: 1822-1826
[c96]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriMHR21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriMHR21
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers. Interspeech 2021: 2097-2101
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02858
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Capturing Multi-Resolution Context by Dilated Self-Attention. CoRR abs/2104.02858 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-09426
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-09426
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers. CoRR abs/2104.09426 (2021)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08922
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition. CoRR abs/2106.08922 (2021)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01269
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition. CoRR abs/2107.01269 (2021)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02147
Chiori Hori, Takaaki Hori, Jonathan Le Roux:
Optimizing Latency for Online Video CaptioningUsing Audio-Visual Transformers. CoRR abs/2108.02147 (2021)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04948
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori:
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy. CoRR abs/2110.04948 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06894
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06894
Ankit P. Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori:
Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning. CoRR abs/2110.06894 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2111-01272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-01272
Niko Moritz, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux:
Sequence Transduction with Graph-based Supervision. CoRR abs/2111.01272 (2021)
2020
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiWMWHH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiWMWHH20
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky:
Multi-Stream End-to-End Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 646-655 (2020)
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzHR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzHR20
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Streaming Automatic Speech Recognition with the Transformer Model. ICASSP 2020: 6074-6078
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SariMHR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SariMHR20
Leda Sari, Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unsupervised Speaker Adaptation Using Attention-Based Speaker Memory for End-to-End ASR. ICASSP 2020: 7384-7388
[c93]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoritzWHR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoritzWHR20
Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux:
All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection. INTERSPEECH 2020: 3112-3116
[c92]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriMHR20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriMHR20
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux:
Transformer-Based Long-Context End-to-End Speech Recognition. INTERSPEECH 2020: 5011-5015
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-02674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-02674
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Streaming automatic speech recognition with the transformer model. CoRR abs/2001.02674 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06165
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06165
Leda Sari, Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR. CoRR abs/2002.06165 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-11382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-11382
Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux:
Multi-Pass Transformer for Machine Translation. CoRR abs/2009.11382 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-15653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-15653
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Semi-Supervised Speech Recognition via Graph-based Temporal Classification. CoRR abs/2010.15653 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-13439
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-13439
Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training. CoRR abs/2011.13439 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13006
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang:
The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans. CoRR abs/2012.13006 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HoriWKHHH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HoriWKHHH19
Takaaki Hori, Wen Wang, Yusuke Koji, Chiori Hori, Bret Harsham, John R. Hershey:
Adversarial training and decoding strategies for end-to-end neural conversation models. Comput. Speech Lang. 54: 122-139 (2019)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HoriPHHBITTYK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HoriPHHBITTYK19
Chiori Hori, Julien Perez, Ryuichiro Higashinaka, Takaaki Hori, Y-Lan Boureau, Michimasa Inaba, Yuiko Tsunomori, Tetsuro Takahashi, Koichiro Yoshino, Seokhwan Kim:
Overview of the sixth dialog system technology challenge: DSTC6. Comput. Speech Lang. 55: 1-25 (2019)
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/KaritaWWYZCHHIJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/KaritaWWYZCHHIJ19
Shigeki Karita, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto:
A Comparative Study on Transformer vs RNN in Speech Applications. ASRU 2019: 449-456
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MoritzHR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MoritzHR19
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Streaming End-to-End Speech Recognition with Joint CTC-Attention Based Models. ASRU 2019: 936-943
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/YaltaWHNO19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/YaltaWHNO19
Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based Multichannel End-to-End Speech Recognition for Everyday Home Environments^*. EUSIPCO 2019: 1-5
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriA0WHCMCLDEB19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriA0WHCMCLDEB19
Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaskarBWKHC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaskarBWKHC19
Murali Karthick Baskar, Lukás Burget, Shinji Watanabe, Martin Karafiát, Takaaki Hori, Jan Honza Cernocký:
Promising Accurate Prefix Boosting for Sequence-to-sequence ASR. ICASSP 2019: 5646-5650
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzHR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzHR19
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Triggered Attention for End-to-end Speech Recognition. ICASSP 2019: 5666-5670
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoWHBIVD19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoWHBIVD19
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. ICASSP 2019: 6191-6195
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriAHZWR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriAHZWR19
Takaaki Hori, Ramón Fernandez Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux:
Cycle-consistency Training for End-to-end Speech Recognition. ICASSP 2019: 6271-6275
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLMHWH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLMHWH19
Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Stream Attention-based Multi-array End-to-end Speech Recognition. ICASSP 2019: 7105-7109
[c82]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoritzHR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoritzHR19
Niko Moritz, Takaaki Hori, Jonathan Le Roux:
Unidirectional Neural Network Architectures for End-to-End Automatic Speech Recognition. INTERSPEECH 2019: 76-80
[c81]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriCMH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriCMH19
Chiori Hori, Anoop Cherian, Tim K. Marks, Takaaki Hori:
Joint Student-Teacher Learning for Audio-Visual Scene-Aware Dialog. INTERSPEECH 2019: 1886-1890
[c80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarafiatBWHWC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarafiatBWHWC19
Martin Karafiát, Murali Karthick Baskar, Shinji Watanabe, Takaaki Hori, Matthew Wiesner, Jan Cernocký:
Analysis of Multilingual Sequence-to-Sequence Speech Recognition Systems. INTERSPEECH 2019: 2220-2224
[c79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SekiHWRH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SekiHWRH19
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey:
End-to-End Multilingual Multi-Speaker Speech Recognition. INTERSPEECH 2019: 3755-3759
[c78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaskarWAHBC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaskarWAHBC19
Murali Karthick Baskar, Shinji Watanabe, Ramón Fernandez Astudillo, Takaaki Hori, Lukás Burget, Jan Cernocký:
Semi-Supervised Sequence-to-Sequence ASR Using Unpaired Speech and Text. INTERSPEECH 2019: 3790-3794
[c77]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SekiHWMR19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SekiHWMR19
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Niko Moritz, Jonathan Le Roux:
Vectorized Beam Search for CTC-Attention-Based Speech Recognition. INTERSPEECH 2019: 3825-3829
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1905-01152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-01152
Murali Karthick Baskar, Shinji Watanabe, Ramón Fernandez Astudillo, Takaaki Hori, Lukás Burget, Jan Cernocký:
Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text. CoRR abs/1905.01152 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1906-08041
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-08041
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky:
Multi-Stream End-to-End Speech Recognition. CoRR abs/1906.08041 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06317
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang:
A Comparative Study on Transformer vs RNN in Speech Applications. CoRR abs/1909.06317 (2019)
2018
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WatanabeRHSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WatanabeRHSH18
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey:
A Purely End-to-End System for Multi-speaker Speech Recognition. ACL (1) 2018: 2620-2630
[c75]
- view
  - electronic edition @ thecvf.com (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/cvpr/HoriHWWLCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HoriHWWLCM18
Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-Yok Lee, Anoop Cherian, Tim K. Marks:
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description. CVPR Workshops 2018: 2528-2531
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SettleRHWH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SettleRHWH18
Shane Settle, Jonathan Le Roux, Takaaki Hori, Shinji Watanabe, John R. Hershey:
End-to-End Multi-Speaker Speech Recognition. ICASSP 2018: 4819-4823
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SekiWHRH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SekiWHRH18
Hiroshi Seki, Shinji Watanabe, Takaaki Hori, Jonathan Le Roux, John R. Hershey:
An End-to-End Language-Tracking Speech Recognizer for Mixed-Language Speech. ICASSP 2018: 4919-4923
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OchiaiWKHH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OchiaiWKHH18
Tsubasa Ochiai, Shinji Watanabe, Shigeru Katagiri, Takaaki Hori, John R. Hershey:
Speaker Adaptation for Multichannel End-to-End Speech Recognition. ICASSP 2018: 6707-6711
[c71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHKHNUSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHKHNUSH18
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. INTERSPEECH 2018: 2207-2211
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HoriCW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HoriCW18
Takaaki Hori, Jaejin Cho, Shinji Watanabe:
End-to-end Speech Recognition With Word-Based Rnn Language Models. SLT 2018: 389-396
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HayashiWZTHAT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HayashiWZTHAT18
Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramón Fernandez Astudillo, Kazuya Takeda:
Back-Translation-Style Data Augmentation for end-to-end ASR. SLT 2018: 426-433
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChoBLWMYKWH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChoBLWMYKWH18
Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Mallidi, Nelson Yalta, Martin Karafiát, Shinji Watanabe, Takaaki Hori:
Multilingual Sequence-to-Sequence Speech Recognition: Architecture, Transfer Learning, and Language Modeling. SLT 2018: 521-527
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00015
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. CoRR abs/1804.00015 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1805-05826
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-05826
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey:
A Purely End-to-end System for Multi-speaker Speech Recognition. CoRR abs/1805.05826 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-08409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-08409
Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features. CoRR abs/1806.08409 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1807-10893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1807-10893
Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramón Fernandez Astudillo, Kazuya Takeda:
Back-Translation-Style Data Augmentation for End-to-End ASR. CoRR abs/1807.10893 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1808-02608
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-02608
Takaaki Hori, Jaejin Cho, Shinji Watanabe:
End-to-end Speech Recognition with Word-based RNN Language Models. CoRR abs/1808.02608 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-03459
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-03459
Jaejin Cho, Murali Karthick Baskar, Ruizhi Li, Matthew Wiesner, Sri Harish Reddy Mallidi, Nelson Yalta, Martin Karafiát, Shinji Watanabe, Takaaki Hori:
Multilingual sequence-to-sequence speech recognition: architecture, transfer learning, and language modeling. CoRR abs/1810.03459 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-01690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-01690
Takaaki Hori, Ramón Fernandez Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux:
Cycle-consistency training for end-to-end speech recognition. CoRR abs/1811.01690 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02162
Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language model integration based on memory control for sequence to sequence speech recognition. CoRR abs/1811.02162 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02735
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02735
Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata:
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments. CoRR abs/1811.02735 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02770
Murali Karthick Baskar, Lukás Burget, Shinji Watanabe, Martin Karafiát, Takaaki Hori, Jan Honza Cernocký:
Promising Accurate Prefix Boosting for sequence-to-sequence ASR. CoRR abs/1811.02770 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03451
Martin Karafiát, Murali Karthick Baskar, Shinji Watanabe, Takaaki Hori, Matthew Wiesner, Jan Honza Cernocký:
Analysis of Multilingual Sequence-to-Sequence speech recognition systems. CoRR abs/1811.03451 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-04568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-04568
Hiroshi Seki, Takaaki Hori, Shinji Watanabe:
Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition. CoRR abs/1811.04568 (2018)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-04897
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-04897
Ruizhi Li, Xiaofei Wang, Sri Harish Reddy Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Multi-encoder multi-resolution framework for end-to-end speech recognition. CoRR abs/1811.04897 (2018)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-04903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-04903
Xiaofei Wang, Ruizhi Li, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky:
Stream attention-based multi-array end-to-end speech recognition. CoRR abs/1811.04903 (2018)
2017
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/HoriCEHRMW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/HoriCEHRMW17
Takaaki Hori, Zhuo Chen, Hakan Erdogan, John R. Hershey, Jonathan Le Roux, Vikramjit Mitra, Shinji Watanabe:
Multi-microphone speech recognition integrating beamforming, robust feature extraction, and advanced DNN/RNN backend. Comput. Speech Lang. 46: 401-418 (2017)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/WatanabeHKHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/WatanabeHKHH17
Shinji Watanabe, Takaaki Hori, Suyoun Kim, John R. Hershey, Tomoki Hayashi:
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition. IEEE J. Sel. Top. Signal Process. 11(8): 1240-1253 (2017)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/OchiaiWHHX17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/OchiaiWHHX17
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey, Xiong Xiao:
Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming. IEEE J. Sel. Top. Signal Process. 11(8): 1274-1288 (2017)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/OgawaH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/OgawaH17
Atsunori Ogawa, Takaaki Hori:
Error detection and accuracy estimation in automatic speech recognition using deep bidirectional recurrent neural networks. Speech Commun. 89: 70-83 (2017)
[j15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/HayashiWTHRT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HayashiWTHRT17
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
Duration-Controlled LSTM for Polyphonic Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2059-2070 (2017)
[c67]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/acl/HoriWH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HoriWH17
Takaaki Hori, Shinji Watanabe, John R. Hershey:
Joint CTC/attention decoding for end-to-end speech recognition. ACL (1) 2017: 518-529
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WatanabeHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WatanabeHH17
Shinji Watanabe, Takaaki Hori, John R. Hershey:
Language independent end-to-end architecture for joint language identification and speech recognition. ASRU 2017: 265-271
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HoriWH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HoriWH17
Takaaki Hori, Shinji Watanabe, John R. Hershey:
Multi-level language modeling and decoding for open vocabulary end-to-end speech recognition. ASRU 2017: 287-293
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HoriHMH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HoriHMH17
Chiori Hori, Takaaki Hori, Tim K. Marks, John R. Hershey:
Early and late integration of audio features for automatic video description. ASRU 2017: 430-436
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HayashiWTHRT17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HayashiWTHRT17
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection. ICASSP 2017: 766-770
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimHW17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimHW17
Suyoun Kim, Takaaki Hori, Shinji Watanabe:
Joint CTC-attention based end-to-end speech recognition using multi-task learning. ICASSP 2017: 4835-4839
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WatanabeHRH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WatanabeHRH17
Shinji Watanabe, Takaaki Hori, Jonathan Le Roux, John R. Hershey:
Student-teacher network learning with enhanced features. ICASSP 2017: 5275-5279
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HoriHLZHHMS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HoriHLZHHMS17
Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, Bret Harsham, John R. Hershey, Tim K. Marks, Kazuhiro Sumi:
Attention-Based Multimodal Fusion for Video Description. ICCV 2017: 4203-4212
[c59]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/OchiaiWHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/OchiaiWHH17
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey:
Multichannel End-to-end Speech Recognition. ICML 2017: 2632-2641
[c58]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriWZC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriWZC17
Takaaki Hori, Shinji Watanabe, Yu Zhang, William Chan:
Advances in Joint CTC-Attention Based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM. INTERSPEECH 2017: 949-953
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/WatanabeHMDMH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/WatanabeHMDMH17
Shinji Watanabe, Takaaki Hori, Yajie Miao, Marc Delcroix, Florian Metze, John R. Hershey:
Toolkits for Robust Speech Processing. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 369-382
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HoriHLSHM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HoriHLSHM17
Chiori Hori, Takaaki Hori, Teng-Yok Lee, Kazuhiro Sumi, John R. Hershey, Tim K. Marks:
Attention-Based Multimodal Fusion for Video Description. CoRR abs/1701.03126 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/OchiaiWHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/OchiaiWHH17
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey:
Multichannel End-to-end Speech Recognition. CoRR abs/1703.04783 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HoriWZC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HoriWZC17
Takaaki Hori, Shinji Watanabe, Yu Zhang, William Chan:
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM. CoRR abs/1706.02737 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/HoriH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HoriH17
Chiori Hori, Takaaki Hori:
End-to-end Conversation Modeling Track in DSTC6. CoRR abs/1706.07440 (2017)
2016
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/OgawaHN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/OgawaHN16
Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura:
Estimating Speech Recognition Accuracy Based on Error Type Classification. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2400-2413 (2016)
[c57]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/HayashiWTHRT16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/HayashiWTHRT16
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, Jonathan Le Roux, Kazuya Takeda:
Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection. DCASE 2016: 35-39
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriHWH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriHWH16
Takaaki Hori, Chiori Hori, Shinji Watanabe, John R. Hershey:
Minimum word error training of long short-term memory recurrent neural network language models for speech recognition. ICASSP 2016: 5990-5994
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/HoriWHHHKFF16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/HoriWHHHKFF16
Chiori Hori, Shinji Watanabe, Takaaki Hori, Bret A. Harsham, John R. Hershey, Yusuke Koji, Youichi Fujii, Yuki Furumoto:
Driver confusion status detection using recurrent neural networks. ICME 2016: 1-6
[c54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHWH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHWH16
Chiori Hori, Takaaki Hori, Shinji Watanabe, John R. Hershey:
Context-Sensitive and Role-Dependent Spoken Language Understanding Using Bidirectional and Attention LSTMs. INTERSPEECH 2016: 3236-3240
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HoriWHWHRHKJZA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HoriWHWHRHKJZA16
Takaaki Hori, Hai Wang, Chiori Hori, Shinji Watanabe, Bret Harsham, Jonathan Le Roux, John R. Hershey, Yusuke Koji, Yi Jing, Zhaocheng Zhu, Takeyuki Aikawa:
Dialog state tracking with attention-based sequence-to-sequence learning. SLT 2016: 552-558
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TanakaMSWHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TanakaMSWHD16
Tomohiro Tanaka, Takafumi Moriya, Takahiro Shinozaki, Shinji Watanabe, Takaaki Hori, Kevin Duh:
Automated structure discovery and parameter tuning of neural network language model based on evolution strategy. SLT 2016: 665-671
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/KimHW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KimHW16
Suyoun Kim, Takaaki Hori, Shinji Watanabe:
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning. CoRR abs/1609.06773 (2016)
2015
[j13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasp/DelcroixYOKFIKE15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasp/DelcroixYOKFIKE15
Marc Delcroix, Takuya Yoshioka, Atsunori Ogawa, Yotaro Kubo, Masakiyo Fujimoto, Nobutaka Ito, Keisuke Kinoshita, Miquel Espi, Shoko Araki, Takaaki Hori, Tomohiro Nakatani:
Strategies for distant speech recognitionin reverberant environments. EURASIP J. Adv. Signal Process. 2015: 60 (2015)
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HoriCEHRMW15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HoriCEHRMW15
Takaaki Hori, Zhuo Chen, Hakan Erdogan, John R. Hershey, Jonathan Le Roux, Vikramjit Mitra, Shinji Watanabe:
The MERL/SRI system for the 3RD CHiME challenge using beamforming, robust feature extraction, and advanced speech recognition. ASRU 2015: 475-481
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OgawaH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OgawaH15a
Atsunori Ogawa, Takaaki Hori:
ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks. ICASSP 2015: 4370-4374
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DelcroixKHN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DelcroixKHN15
Marc Delcroix, Keisuke Kinoshita, Takaaki Hori, Tomohiro Nakatani:
Context adaptive deep neural networks for fast acoustic model adaptation. ICASSP 2015: 4535-4539
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DoNDH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DoNDH15
Quoc Truong Do, Satoshi Nakamura, Marc Delcroix, Takaaki Hori:
WFST-based structural classification integrating dnn acoustic features and RNN language features for speech recognition. ICASSP 2015: 4959-4963
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AoyamaOHH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AoyamaOHH15
Kazuo Aoyama, Atsunori Ogawa, Takashi Hattori, Takaaki Hori:
Double-layer neighborhood graph based similarity search for fast query-by-example spoken term detection. ICASSP 2015: 5216-5220
[c46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriokaIHK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriokaIHK15
Tsuyoshi Morioka, Tomoharu Iwata, Takaaki Hori, Tetsunori Kobayashi:
Multiscale recurrent neural network based language model. INTERSPEECH 2015: 2366-2370
2014
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/globalsip/DelcroixYOKFIKE14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/globalsip/DelcroixYOKFIKE14
Marc Delcroix, Takuya Yoshioka, Atsunori Ogawa, Yotaro Kubo, Masakiyo Fujimoto, Nobutaka Ito, Keisuke Kinoshita, Miquel Espi, Shoko Araki, Takaaki Hori, Tomohiro Nakatani:
Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition. GlobalSIP 2014: 522-526
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OgawaKHNN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OgawaKHNN14
Atsunori Ogawa, Keisuke Kinoshita, Takaaki Hori, Tomohiro Nakatani, Atsushi Nakamura:
Fast segment search for corpus-based speech enhancement based on speech recognition technology. ICASSP 2014: 1557-1561
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriKN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriKN14
Takaaki Hori, Yotaro Kubo, Atsushi Nakamura:
Real-time one-pass decoding with recurrent neural network language model for speech recognition. ICASSP 2014: 6364-6368
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AoyamaOHHN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AoyamaOHHN14
Kazuo Aoyama, Atsunori Ogawa, Takashi Hattori, Takaaki Hori, Atsushi Nakamura:
Zero-resource spoken term detection using hierarchical graph-based similarity search. ICASSP 2014: 7093-7097
[c41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboSHN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboSHN14
Yotaro Kubo, Jun Suzuki, Takaaki Hori, Atsushi Nakamura:
Restructuring output layers of deep neural networks using minimum risk parameter clustering. INTERSPEECH 2014: 1068-1072
2013
[b1]
- view
  authority control:
- export record
  dblp key:
  - series/synthesis/2013Hori
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/synthesis/2013Hori
Takaaki Hori, Atsushi Nakamura:
Speech Recognition Algorithms Based on Weighted Finite-State Transducers. Synthesis Lectures on Speech and Audio Processing, Morgan & Claypool Publishers 2013, ISBN 9781608454730
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/DelcroixKNAOHWFYOKSHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/DelcroixKNAOHWFYOKSHN13
Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Atsunori Ogawa, Takaaki Hori, Shinji Watanabe, Masakiyo Fujimoto, Takuya Yoshioka, Takanobu Oba, Yotaro Kubo, Mehrez Souden, Seong-Jun Hahm, Atsushi Nakamura:
Speech recognition in living rooms: Integrated speech enhancement and recognition system based on spatial, spectral and temporal modeling of sounds. Comput. Speech Lang. 27(3): 851-873 (2013)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/HahmWOFHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/HahmWOFHN13
Seong-Jun Hahm, Shinji Watanabe, Atsunori Ogawa, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura:
Prior-shared feature and model space speaker adaptation by consistently employing map estimation. Speech Commun. 55(3): 415-431 (2013)
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OgawaHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OgawaHN13
Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura:
Discriminative recognition rate estimation for N-best list and its application to N-best rescoring. ICASSP 2013: 6832-6836
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NakataniSAYHO13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NakataniSAYHO13
Tomohiro Nakatani, Mehrez Souden, Shoko Araki, Takuya Yoshioka, Takaaki Hori, Atsunori Ogawa:
Coupling beamforming with spatial and spectral feature based spectral enhancement and its application to meeting recognition. ICASSP 2013: 7249-7253
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuboHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuboHN13
Yotaro Kubo, Takaaki Hori, Atsushi Nakamura:
Large vocabulary continuous speech recognition based on WFST structured classifiers and deep bottleneck features. ICASSP 2013: 7629-7633
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HahmODFHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HahmODFHN13
Seong-Jun Hahm, Atsunori Ogawa, Marc Delcroix, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura:
Feature space variational Bayesian linear regression and its combination with model space VBLR. ICASSP 2013: 7898-7902
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AoyamaOHHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AoyamaOHHN13
Kazuo Aoyama, Atsunori Ogawa, Takashi Hattori, Takaaki Hori, Atsushi Nakamura:
Graph index based query-by-example search on a large speech data set. ICASSP 2013: 8520-8524
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboHN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboHN13
Yotaro Kubo, Takaaki Hori, Atsushi Nakamura:
A method for structure estimation of weighted finite-state transducers and its application to grapheme-to-phoneme conversion. INTERSPEECH 2013: 647-651
[c34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaOHMN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaOHMN13
Takanobu Oba, Atsunori Ogawa, Takaaki Hori, Hirokazu Masataki, Atsushi Nakamura:
Unsupervised discriminative language modeling using error rate estimator. INTERSPEECH 2013: 1223-1227
2012
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/ObaHNI12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/ObaHNI12
Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito:
Model Shrinkage for Discriminative Language Models. IEICE Trans. Inf. Syst. 95-D(5): 1465-1474 (2012)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ObaHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ObaHN12
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Efficient training of discriminative language models by sample selection. Speech Commun. 54(6): 791-800 (2012)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HoriAYFWOOOMKNNY12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HoriAYFWOOOMKNNY12
Takaaki Hori, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato:
Low-Latency Real-Time Meeting Recognition and Understanding Using Distant Microphones and Omni-Directional Camera. IEEE Trans. Speech Audio Process. 20(2): 499-513 (2012)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ObaHNI12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ObaHNI12
Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito:
Round-Robin Duel Discriminative Language Models. IEEE Trans. Speech Audio Process. 20(4): 1244-1255 (2012)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/KuboWHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/KuboWHN12
Yotaro Kubo, Shinji Watanabe, Takaaki Hori, Atsushi Nakamura:
Structural Classification Methods Based on Weighted Finite-State Transducers for Automatic Speech Recognition. IEEE Trans. Speech Audio Process. 20(8): 2240-2251 (2012)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WatanabeKOHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WatanabeKOHN12
Shinji Watanabe, Yotaro Kubo, Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Bag Of ARCS: New representation of speech segment features based on finite state machines. ICASSP 2012: 4201-4204
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OgawaHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OgawaHN12
Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura:
Error type classification and word accuracy estimation using alignment features from word confusion network. ICASSP 2012: 4925-4928
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChuangsuwanichWHIG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChuangsuwanichWHIG12
Ekapol Chuangsuwanich, Shinji Watanabe, Takaaki Hori, Tomoharu Iwata, James R. Glass:
Handling uncertain observations in unsupervised topic-mixture language model adaptation. ICASSP 2012: 5033-5036
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ObaHNI12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ObaHNI12
Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito:
Spoken document retrieval by discriminative modeling in a high dimensional feature space. ICASSP 2012: 5153-5156
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HahmOFHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HahmOFHN12
Seong-Jun Hahm, Atsunori Ogawa, Masakiyo Fujimoto, Takaaki Hori, Atsushi Nakamura:
Speaker Adaptation Using Variational Bayesian Linear Regression in Normalized Feature Space. INTERSPEECH 2012: 803-806
[c28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobashikawaHYAMT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobashikawaHYAMT12
Satoshi Kobashikawa, Takaaki Hori, Yoshikazu Yamaguchi, Taichi Asami, Hirokazu Masataki, Satoshi Takahashi:
Efficient Beam Width Control to Suppress Excessive Speech Recognition Computation Time Based on Prior Score Range Normalization. INTERSPEECH 2012: 1011-1014
[c27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuboHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuboHN12
Yotaro Kubo, Takaaki Hori, Atsushi Nakamura:
Integrating Deep Neural Networks into Structural Classification Approach based on Weighted Finite-State Transducers. INTERSPEECH 2012: 2594-2597
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/OgawaHN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/OgawaHN12
Atsunori Ogawa, Takaaki Hori, Atsushi Nakamura:
Recognition rate estimation based on word alignment network and discriminative error type classification. SLT 2012: 113-118
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/SatoshiHYAHT12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/SatoshiHYAHT12
Satoshi Kobashikawa, Takaaki Hori, Yoshikazu Yamaguchi, Taichi Asami, Hirokazu Masataki, Satoshi Takahashi:
Efficient prior and incremental beam width control to suppress excessive speech recognition time based on score range estimation. SLT 2012: 125-130
2011
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/WatanabeITSA11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/WatanabeITSA11
Shinji Watanabe, Tomoharu Iwata, Takaaki Hori, Atsushi Sako, Yasuo Ariki:
Topic tracking language model for speech recognition. Comput. Speech Lang. 25(2): 440-461 (2011)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WatanabeMHN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WatanabeMHN11
Shinji Watanabe, Daichi Mochihashi, Takaaki Hori, Atsushi Nakamura:
Gibbs sampling based Multi-scale Mixture Model for speaker clustering. ICASSP 2011: 4524-4527
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ObaHIN11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ObaHIN11
Takanobu Oba, Takaaki Hori, Akinori Ito, Atsushi Nakamura:
Round-robin duel discriminative language models in one-pass decoding with on-the-fly error correction. ICASSP 2011: 5588-5591
2010
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ieicet/ObaHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieicet/ObaHN10
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Improved Sequential Dependency Analysis Integrating Labeling-Based Sentence Boundary Detection. IEICE Trans. Inf. Syst. 93-D(5): 1272-1281 (2010)
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WatanabeHMN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WatanabeHMN10
Shinji Watanabe, Takaaki Hori, Erik McDermott, Atsushi Nakamura:
A discriminative model for continuous speech recognition based on Weighted Finite State Transducers. ICASSP 2010: 4922-4925
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriWN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriWN10
Takaaki Hori, Shinji Watanabe, Atsushi Nakamura:
Search error risk minimization in Viterbi beam search for speech recognition. ICASSP 2010: 4934-4937
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ObaHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ObaHN10
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
A comparative study on methods of Weighted language model training for reranking lvcsr N-best hypotheses. ICASSP 2010: 5126-5129
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHN10
Shinji Watanabe, Takaaki Hori, Atsushi Nakamura:
Large vocabulary continuous speech recognition using WFST-based linear classifier for structured data. INTERSPEECH 2010: 346-349
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriWN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriWN10
Takaaki Hori, Shinji Watanabe, Atsushi Nakamura:
Improvements of search error risk minimization in viterbi beam search for speech recognition. INTERSPEECH 2010: 1962-1965
[c17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaHN10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaHN10
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Round-robin discrimination model for reranking ASR hypotheses. INTERSPEECH 2010: 2446-2449
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WatanabeIHSA10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WatanabeIHSA10
Shinji Watanabe, Tomoharu Iwata, Takaaki Hori, Atsushi Sako, Yasuo Ariki:
Application of topic tracking model to language model adaptation and meeting analysis. SLT 2010: 378-383
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HoriAYFWOOOMKNNY10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HoriAYFWOOOMKNNY10
Takaaki Hori, Shoko Araki, Takuya Yoshioka, Masakiyo Fujimoto, Shinji Watanabe, Takanobu Oba, Atsunori Ogawa, Kazuhiro Otsuka, Dan Mikami, Keisuke Kinoshita, Tomohiro Nakatani, Atsushi Nakamura, Junji Yamato:
Real-time meeting recognition and understanding using distant microphones and omni-directional camera. SLT 2010: 424-429

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2008
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ObaHN08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ObaHN08
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Sequential dependency analysis for online spontaneous speech processing. Speech Commun. 50(7): 616-625 (2008)
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HoriHMN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HoriHMN07
Takaaki Hori, Chiori Hori, Yasuhiro Minami, Atsushi Nakamura:
Efficient WFST-Based One-Pass Decoding With On-The-Fly Hypothesis Rescoring in Extremely Large Vocabulary Continuous Speech Recognition. IEEE Trans. Speech Audio Process. 15(4): 1352-1365 (2007)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriHHG07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriHHG07
Takaaki Hori, I. Lee Hetherington, Timothy J. Hazen, James R. Glass:
Open-Vocabulary Spoken Utterance Retrieval using Confusion Networks. ICASSP (4) 2007: 73-76
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaHN07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaHN07
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
An approach to efficient generation of high-accuracy and compact error-corrective models for speech recognition. INTERSPEECH 2007: 1753-1756
2006
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cim/NakamuraWHMK06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cim/NakamuraWHMK06
Atsushi Nakamura, Shinji Watanabe, Takaaki Hori, Erik McDermott, Shigeru Katagiri:
Advanced computational models and learning theories for spoken language processing. IEEE Comput. Intell. Mag. 1(2): 5-9 (2006)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriN06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriN06
Takaaki Hori, Atsushi Nakamura:
An Extremely Large Vocabulary Approach to Named Entity Extraction from Speech. ICASSP (1) 2006: 973-976
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObaHN06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObaHN06
Takanobu Oba, Takaaki Hori, Atsushi Nakamura:
Sentence boundary detection using sequential dependency analysis combined with CRF-based chunking. INTERSPEECH 2006
2005
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SchusterH05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SchusterH05
Mike Schuster, Takaaki Hori:
Efficient Generation of high-order context-dependent Weighted Finite State Transducers for Speech Recognition. ICASSP (1) 2005: 201-204
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriN05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriN05
Takaaki Hori, Atsushi Nakamura:
Generalized fast on-the-fly composition algorithm for WFST-based speech recognition. INTERSPEECH 2005: 557-560
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchusterHN05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchusterHN05
Mike Schuster, Takaaki Hori, Atsushi Nakamura:
Experiments with probabilistic principal component analysis in LVCSR. INTERSPEECH 2005: 1685-1688
2004
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHM04
Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Fast on-the-fly composition for weighted finite-state transducers in 1.8 million-word vocabulary continuous speech recognition. INTERSPEECH 2004: 289-292
2003
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HoriHTISM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HoriHTISM03
Chiori Hori, Takaaki Hori, Hajime Tsukada, Hideki Isozaki, Yutaka Sasaki, Eisaku Maeda:
Spoken Interactive ODQA System: SPIQA. ACL (Companion) 2003: 153-156
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriWM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriWM03
Takaaki Hori, Daniel Willett, Yasuhiro Minami:
Language model adaptation using WFST-based speaking-style translation. ICASSP (1) 2003: 228-231
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HoriHIMKF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HoriHIMKF03
Chiori Hori, Takaaki Hori, Hideki Isozaki, Eisaku Maeda, Shigeru Katagiri, Sadaoki Furui:
Deriving disambiguous queries in a spoken interactive ODQA system. ICASSP (1) 2003: 624-627
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHM03
Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Speech summarization using weighted finite-state transducers. INTERSPEECH 2003: 2817-2820
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHF03
Chiori Hori, Takaaki Hori, Sadaoki Furui:
Evaluation method for automatic speech summarization. INTERSPEECH 2003: 2825-2828
2001
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriNM01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriNM01
Takaaki Hori, Yoshiaki Noda, Shoichi Matsunaga:
Improved phoneme-history-dependent search for large-vocabulary continuous-speech recognition. INTERSPEECH 2001: 1809-1813

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.