default search action
Jean-Marc Valin
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Jean-Marc Valin, Ahmed Mustafa, Jan Büthe:
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) With Pitch Prediction. IEEE Signal Process. Lett. 31: 2115-2119 (2024) - [c58]Masahito Togami, Jean-Marc Valin, Karim Helwani, Ritwik Giri, Umut Isik, Michael M. Goodwin:
Real-Time Stereo Speech Enhancement with Spatial-Cue Preservation Based on Dual-Path Structure. ICASSP 2024: 71-75 - [c57]Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin:
NOLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping. ICASSP 2024: 476-480 - [c56]Krishna Subramani, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin:
Noise-Robust DSP-Assisted Neural Pitch Estimation With Very Low Complexity. ICASSP 2024: 11851-11855 - 2023
- [c55]Ahmed Mustafa, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin:
Framewise Wavegan: High Speed Adversarial Vocoder In Time Domain With Very Low Computational Complexity. ICASSP 2023: 1-5 - [c54]Jean-Marc Valin, Jan Büthe, Ahmed Mustafa:
Low-Bitrate Redundancy Coding of Speech Using A Rate-Distortion-Optimized Variational Autoencoder. ICASSP 2023: 1-5 - [c53]Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis:
A Framework for Unified Real-Time Personalized and Non-Personalized Speech Enhancement. ICASSP 2023: 1-5 - [c52]Jan Büthe, Jean-Marc Valin, Ahmed Mustafa:
Lace: A Light-Weight, Causal Model for Enhancing Coded Speech Through Adaptive Convolutions. WASPAA 2023: 1-5 - [i54]Zhepei Wang, Ritwik Giri, Devansh Shah, Jean-Marc Valin, Michael M. Goodwin, Paris Smaragdis:
A Framework for Unified Real-time Personalized and Non-Personalized Speech Enhancement. CoRR abs/2302.11768 (2023) - [i53]Krishna Subramani, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin:
Noise-Robust DSP-Assisted Neural Pitch Estimation with Very Low Complexity. CoRR abs/2309.14507 (2023) - [i52]Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin:
NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping. CoRR abs/2309.14521 (2023) - 2022
- [c51]Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:
Improved Singing Voice Separation with Chromagram-Based Pitch-Aware Remixing. ICASSP 2022: 111-115 - [c50]Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of Lpcnet. ICASSP 2022: 8437-8441 - [c49]Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery, Timothy B. Terriberry, Michael Klingbeil, Paris Smaragdis, Arvindh Krishnaswamy:
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model. INTERSPEECH 2022: 570-574 - [c48]Krishna Subramani, Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation. INTERSPEECH 2022: 818-822 - [i51]Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
Neural Speech Synthesis on a Shoestring: Improving the Efficiency of LPCNet. CoRR abs/2202.11169 (2022) - [i50]Krishna Subramani, Jean-Marc Valin, Umut Isik, Paris Smaragdis, Arvindh Krishnaswamy:
End-to-end LPCNet: A Neural Vocoder With Fully-Differentiable LPC Estimation. CoRR abs/2202.11301 (2022) - [i49]Siyuan Yuan, Zhepei Wang, Umut Isik, Ritwik Giri, Jean-Marc Valin, Michael M. Goodwin, Arvindh Krishnaswamy:
Improved singing voice separation with chromagram-based pitch-aware remixing. CoRR abs/2203.15092 (2022) - [i48]Jean-Marc Valin, Ahmed Mustafa, Christopher Montgomery, Timothy B. Terriberry, Michael Klingbeil, Paris Smaragdis, Arvindh Krishnaswamy:
Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model. CoRR abs/2205.05785 (2022) - [i47]Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy:
To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets. CoRR abs/2206.07917 (2022) - [i46]Zhepei Wang, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Jean-Marc Valin, Paris Smaragdis, Michael M. Goodwin, Arvindh Krishnaswamy:
Semi-supervised Time Domain Target Speaker Extraction with Attention. CoRR abs/2206.09072 (2022) - [i45]Ahmed Mustafa, Jean-Marc Valin, Jan Büthe, Paris Smaragdis, Mike Goodwin:
Framewise WaveGAN: High Speed Adversarial Vocoder in Time Domain with Very Low Computational Complexity. CoRR abs/2212.04532 (2022) - 2021
- [c47]Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy:
Semi-Supervised Singing Voice Separation With Noisy Self-Training. ICASSP 2021: 31-35 - [c46]Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the Codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. ICASSP 2021: 711-715 - [c45]Jean-Marc Valin, Srikanth V. Tenneti, Karim Helwani, Umut Isik, Arvindh Krishnaswamy:
Low-Complexity, Real-Time Joint Neural Echo Control and Speech Enhancement Based On Percepnet. ICASSP 2021: 7133-7137 - [c44]Ritwik Giri, Shrikant Venkataramani, Jean-Marc Valin, Umut Isik, Arvindh Krishnaswamy:
Personalized PercepNet: Real-Time, Low-Complexity Target Voice Separation and Enhancement. Interspeech 2021: 1124-1128 - [c43]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-Channel Opus Compression for Far-Field Automatic Speech Recognition with a Fixed Bitrate Budget. Interspeech 2021: 1669-1673 - [i44]Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy:
Enhancing into the codec: Noise Robust Speech Coding with Vector-Quantized Autoencoders. CoRR abs/2102.06610 (2021) - [i43]Lukas Drude, Jahn Heymann, Andreas Schwarz, Jean-Marc Valin:
Multi-channel Opus compression for far-field automatic speech recognition with a fixed bitrate budget. CoRR abs/2106.07994 (2021) - 2020
- [c42]Jean-Marc Valin, Umut Isik, Neerad Phansalkar, Ritwik Giri, Karim Helwani, Arvindh Krishnaswamy:
A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech. INTERSPEECH 2020: 2482-2486 - [c41]Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy:
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss. INTERSPEECH 2020: 2487-2491 - [c40]Jan Skoglund, Jean-Marc Valin:
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis. INTERSPEECH 2020: 2847-2851 - [i42]Umut Isik, Ritwik Giri, Neerad Phansalkar, Jean-Marc Valin, Karim Helwani, Arvindh Krishnaswamy:
PoCoNet: Better Speech Enhancement with Frequency-Positional Embeddings, Semi-Supervised Conversational Data, and Biased Loss. CoRR abs/2008.04470 (2020)
2010 – 2019
- 2019
- [c39]Jean-Marc Valin, Jan Skoglund:
LPCNET: Improving Neural Speech Synthesis through Linear Prediction. ICASSP 2019: 5891-5895 - [c38]Jean-Marc Valin, Jan Skoglund:
A Real-Time Wideband Neural Vocoder at 1.6kb/s Using LPCNet. INTERSPEECH 2019: 3406-3410 - [i41]Jean-Marc Valin, Jan Skoglund:
A Real-Time Wideband Neural Vocoder at 1.6 kb/s Using LPCNet. CoRR abs/1903.12087 (2019) - [i40]Jan Skoglund, Jean-Marc Valin:
Improving Opus Low Bit Rate Quality with Neural Speech Synthesis. CoRR abs/1905.04628 (2019) - 2018
- [c37]Steinar Midtskogen, Jean-Marc Valin:
The Av1 Constrained Directional Enhancement Filter (Cdef). ICASSP 2018: 1193-1197 - [c36]Jean-Marc Valin:
A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement. MMSP 2018: 1-5 - [c35]Yue Chen, Debargha Mukherjee, Jingning Han, Adrian Grange, Yaowu Xu, Zoe Liu, Sarah Parker, Cheng Chen, Hui Su, Urvang Joshi, Ching-Han Chiang, Yunqing Wang, Paul Wilkins, Jim Bankoski, Luc N. Trudeau, Nathan E. Egge, Jean-Marc Valin, Thomas Davies, Steinar Midtskogen, Andrey Norkin, Peter De Rivaz:
An Overview of Core Coding Tools in the AV1 Video Codec. PCS 2018: 41-45 - [i39]Jean-Marc Valin, Jan Skoglund:
LPCNet: Improving Neural Speech Synthesis Through Linear Prediction. CoRR abs/1810.11846 (2018) - 2017
- [i38]Jean-Marc Valin:
A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement. CoRR abs/1709.08243 (2017) - [i37]Jean-Marc Valin, Koen Vos:
Updates to the Opus Audio Codec. RFC 8251: 1-12 (2017) - 2016
- [c34]Jean-Marc Valin, Nathan E. Egge, Thomas J. Daede, Timothy B. Terriberry, Christopher Montgomery:
Daala: A perceptually-driven still picture codec. ICIP 2016: 76-80 - [c33]Jean-Marc Valin, Timothy B. Terriberry, Nathan E. Egge, Thomas J. Daede, Yushin Cho, Christopher Montgomery, Michael Bebenita:
Daala: Building a next-generation video codec from unconventional technology. MMSP 2016: 1-6 - [i36]Jean-Marc Valin, Gregory Maxwell, Timothy B. Terriberry, Koen Vos:
High-Quality, Low-Delay Music Coding in the Opus Codec. CoRR abs/1602.04845 (2016) - [i35]Jean-Marc Valin, Timothy B. Terriberry:
Perceptual Vector Quantization For Video Coding. CoRR abs/1602.05209 (2016) - [i34]Jean-Marc Valin, Timothy B. Terriberry, Gregory Maxwell:
A Full-Bandwidth Audio Codec With Low Complexity And Very Low Delay. CoRR abs/1602.05311 (2016) - [i33]Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell:
A High-Quality Speech and Audio Codec With Less Than 10 ms Delay. CoRR abs/1602.05526 (2016) - [i32]Jean-Marc Valin, Daniel V. Smith, Christopher Montgomery, Timothy B. Terriberry:
An Iterative Linearised Solution to the Sinusoidal Parameter Estimation Problem. CoRR abs/1602.05900 (2016) - [i31]Jean-Marc Valin:
The Daala Directional Deringing Filter. CoRR abs/1602.05975 (2016) - [i30]Jean-Marc Valin, Shun'ichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech By a Mobile Robot. CoRR abs/1602.06442 (2016) - [i29]Jean-Marc Valin:
Auditory System for a Mobile Robot. CoRR abs/1602.06652 (2016) - [i28]Jean-Marc Valin:
On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk. CoRR abs/1602.08044 (2016) - [i27]Jean-Marc Valin, Iain B. Collings:
Interference-Normalised Least Mean Square Algorithm. CoRR abs/1602.08116 (2016) - [i26]Jean-Marc Valin, François Michaud, Jean Rouat:
Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering. CoRR abs/1602.08139 (2016) - [i25]Jean-Marc Valin:
Extension spectrale d'un signal de parole de la bande téléphonique à la bande AM. CoRR abs/1602.08185 (2016) - [i24]Jean-Marc Valin, François Michaud, Jean Rouat, Dominic Létourneau:
Robust Sound Source Localization Using a Microphone Array on a Mobile Robot. CoRR abs/1602.08213 (2016) - [i23]Jean-Marc Valin, Roch Lefebvre:
Bandwidth Extension of Narrowband Speech for Low Bit-Rate Wideband Coding. CoRR abs/1602.08215 (2016) - [i22]Jean-Marc Valin, Iain B. Collings:
A New Robust Frequency Domain Echo Canceller With Closed-Loop Learning Rate Adaptation. CoRR abs/1602.08609 (2016) - [i21]Jean-Marc Valin, François Michaud, Brahim Hadjou, Jean Rouat:
Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency-Domain Steered Beamformer Approach. CoRR abs/1602.08629 (2016) - [i20]Jean-Marc Valin:
Perceptually-Motivated Nonlinear Channel Decorrelation For Stereo Acoustic Echo Cancellation. CoRR abs/1602.08633 (2016) - [i19]Jean-Marc Valin:
Speex: A Free Codec For Free Speech. CoRR abs/1602.08668 (2016) - [i18]Jean-Marc Valin, Daniel V. Smith, Christopher Montgomery, Timothy B. Terriberry:
Low-Complexity Iterative Sinusoidal Parameter Estimation. CoRR abs/1603.01824 (2016) - [i17]Jean-Marc Valin, Christopher Montgomery:
Improved Noise Weighting in CELP Coding of Speech - Applying the Vorbis Psychoacoustic Model To Speex. CoRR abs/1603.01863 (2016) - [i16]Jean-Marc Valin, Jean Rouat, François Michaud:
Enhanced Robot Audition Based on Microphone Array Source Separation with Post-Filter. CoRR abs/1603.02341 (2016) - [i15]Thomas J. Daede, Nathan E. Egge, Jean-Marc Valin, Guillaume Martres, Timothy B. Terriberry:
Daala: A Perceptually-Driven Next Generation Video Codec. CoRR abs/1603.03129 (2016) - [i14]Jean-Marc Valin, Jean Rouat, François Michaud:
Microphone array post-filter for separation of simultaneous non-stationary sources. CoRR abs/1603.03215 (2016) - [i13]Jean-Marc Valin:
Channel Decorrelation For Stereo Acoustic Echo Cancellation In High-Quality Audio Communication. CoRR abs/1603.03364 (2016) - [i12]Nathan E. Egge, Jean-Marc Valin:
Predicting Chroma from Luma with Frequency Domain Intra Prediction. CoRR abs/1603.03482 (2016) - [i11]Jean-Marc Valin, François Michaud, Jean Rouat:
Robust 3D Localization and Tracking of Sound Sources Using Beamforming and Particle Filtering. CoRR abs/1604.01642 (2016) - [i10]Jean-Marc Valin, Nathan E. Egge, Thomas J. Daede, Timothy B. Terriberry, Christopher Montgomery:
Daala: A Perceptually-Driven Still Picture Codec. CoRR abs/1605.04930 (2016) - [i9]Jean-Marc Valin, Timothy B. Terriberry, Nathan E. Egge, Thomas J. Daede, Yushin Cho, Christopher Montgomery, Michael Bebenita:
Daala: Building A Next-Generation Video Codec From Unconventional Technology. CoRR abs/1608.01947 (2016) - [i8]Yushin Cho, Thomas J. Daede, Nathan E. Egge, Guillaume Martres, Tristan Matthews, Christopher Montgomery, Timothy B. Terriberry, Jean-Marc Valin:
Perceptually-Driven Video Coding with the Daala Video Codec. CoRR abs/1610.02488 (2016) - [i7]Jean-Marc Valin, Cary Bran:
WebRTC Audio Codec and Processing Requirements. RFC 7874: 1-7 (2016) - 2015
- [c32]Nathan E. Egge, Jean-Marc Valin:
Predicting chroma from luma with frequency domain intra prediction. Visual Information Processing and Communication 2015: 941008 - [c31]Jean-Marc Valin, Timothy B. Terriberry:
Perceptual vector quantization for video coding. Visual Information Processing and Communication 2015: 941009 - [i6]Julian Spittka, Koen Vos, Jean-Marc Valin:
RTP Payload Format for the Opus Speech and Audio Codec. RFC 7587: 1-18 (2015) - 2012
- [c30]Maxime Fréchette, Dominic Létourneau, Jean-Marc Valin, François Michaud:
Integration of sound source localization and separation to improve Dialogue Management on a robot. IROS 2012: 2358-2363 - [i5]Colin Perkins, Jean-Marc Valin:
Guidelines for the Use of Variable Bit Rate Audio with Secure RTP. RFC 6562: 1-6 (2012) - [i4]Jean-Marc Valin, Slava Borilin, Koen Vos, Christopher Montgomery, Raymond (Juin-Hwey) Chen:
Guidelines for Development of an Audio Codec within the IETF. RFC 6569: 1-14 (2012) - [i3]Jean-Marc Valin, Koen Vos, Timothy B. Terriberry:
Definition of the Opus Audio Codec. RFC 6716: 1-326 (2012) - 2011
- [i2]Jean-Marc Valin, Koen Vos:
Requirements for an Internet Audio Codec. RFC 6366: 1-17 (2011) - 2010
- [j8]Jean-Marc Valin, Daniel V. Smith, Christopher Montgomery, Timothy B. Terriberry:
An iterative linearised solution to the sinusoidal parameter estimation problem. Comput. Electr. Eng. 36(4): 603-616 (2010) - [j7]Jean-Marc Valin, Timothy B. Terriberry, Christopher Montgomery, Gregory Maxwell:
A High-Quality Speech and Audio Codec With Less Than 10-ms Delay. IEEE Trans. Speech Audio Process. 18(1): 58-67 (2010)
2000 – 2009
- 2009
- [c29]Jean-Marc Valin, Timothy B. Terriberry, Gregory Maxwell:
A full-bandwidth audio codec with low complexity and very low delay. EUSIPCO 2009: 1254-1258 - [c28]Fariza Sabrina, Jean-Marc Valin:
Priority Based Dynamic Rate Control for VoIP Traffic. GLOBECOM 2009: 1-8 - [c27]Daniel J. Ryan, Iain B. Collings, Jean-Marc Valin:
Reflected Simplex Codebooks for Limited Feedback MIMO Beamforming. ICC 2009: 1-5 - [c26]Anthony P. Badali, Jean-Marc Valin, François Michaud, Parham Aarabi:
Evaluating real-time audio localization algorithms for artificial audition in robotics. IROS 2009: 2033-2038 - [i1]Greg Herlein, Jean-Marc Valin, Alfred E. Heggestad, Aymeric Moizard:
RTP Payload Format for the Speex Codec. RFC 5574: 1-14 (2009) - 2008
- [c25]Fariza Sabrina, Jean-Marc Valin:
Adaptive Rate Control for Aggregated VoIP Traffic. GLOBECOM 2008: 1405-1410 - [c24]Simon Brière, Jean-Marc Valin, François Michaud, Dominic Létourneau:
Embedded auditory system for small mobile robots. ICRA 2008: 3463-3468 - 2007
- [j6]François Michaud, Carle Côté, Dominic Létourneau, Yannick Brosseau, Jean-Marc Valin, Eric Beaudry, Clément Raïevsky, Arnaud Ponchon, Pierre Moisan, Pierre Lepage, Yan Morin, Frédéric Gagnon, Patrick Giguère, Marc-André Roux, Serge Caron, Patrick Frenette, Froduald Kabanza:
Spartacus attending the 2005 AAAI conference. Auton. Robots 22(4): 369-383 (2007) - [j5]Jean-Marc Valin, François Michaud, Jean Rouat:
Robust localization and tracking of simultaneous moving sound sources using beamforming and particle filtering. Robotics Auton. Syst. 55(3): 216-228 (2007) - [j4]Jean-Marc Valin, Iain B. Collings:
Interference-Normalized Least Mean Square Algorithm. IEEE Signal Process. Lett. 14(12): 988-991 (2007) - [j3]Jean-Marc Valin:
On Adjusting the Learning Rate in Frequency Domain Echo Cancellation With Double-Talk. IEEE Trans. Speech Audio Process. 15(3): 1030-1034 (2007) - [j2]Jean-Marc Valin, Seiichi Yamamoto, Jean Rouat, François Michaud, Kazuhiro Nakadai, Hiroshi G. Okuno:
Robust Recognition of Simultaneous Speech by a Mobile Robot. IEEE Trans. Robotics 23(4): 742-752 (2007) - [c23]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Design and implementation of a robot audition system for automatic speech recognition of simultaneous speech. ASRU 2007: 111-116 - [c22]Jean-Marc Valin, Iain B. Collings:
A New Robust Frequency Domain Echo Canceller with Closed-Loop Learning Rate Adaptation. ICASSP (1) 2007: 93-96 - 2006
- [c21]Simon Brière, Dominic Létourneau, Maxime Fréchette, Jean-Marc Valin, François Michaud:
Embedded and Integrated Audition for a Mobile Robot. AAAI Fall Symposium: Aurally Informed Performance 2006: 6-10 - [c20]Jean-Marc Valin, François Michaud, Jean Rouat:
Robust 3D Localization and Tracking of Sound Sources Using Beamforming and Particle Filtering. ICASSP (4) 2006: 841-844 - [c19]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals. IEA/AIE 2006: 207-217 - [c18]Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Leak energy based missing feature mask generation for ICA and GSS and its evaluation with simultaneous speech recognition. SAPA@INTERSPEECH 2006: 42-47 - [c17]Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World. IROS 2006: 5333-5338 - [c16]Shun'ichi Yamamoto, Ryu Takeda, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Recognition of Simultaneous Speech by Estimating Reliability of Separated Signals for Robot Audition. PRICAI 2006: 484-494 - 2005
- [c15]François Michaud, Dominic Létourneau, Pierre Lepage, Yan Morin, Frédéric Gagnon, Patrick Giguère, Eric Beaudry, Yannick Brosseau, Carle Côté, Audrey Duquette, Jean-François Laplante, Marc-Antoine Legault, Pierre Moisan, Arnaud Ponchon, Clément Raïevsky, Marc-André Roux, Tamie Salter, Jean-Marc Valin, Serge Caron, Patrice Masson, Froduald Kabanza, Michel Lauria:
A Brochette of Socially Interactive Robots. AAAI 2005: 1733-1734 - [c14]Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, Hiroshi G. Okuno:
Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory. ICRA 2005: 1477-1482 - [c13]Masamitsu Murase, Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Kentaro Yamada, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Multiple moving speaker tracking by microphone array on mobile robot. INTERSPEECH 2005: 249-252 - [c12]Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, François Michaud, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno:
Making a robot recognize three simultaneous sentences in real-time. IROS 2005: 4040-4045 - [c11]François Michaud, Yannick Brosseau, Carle Côté, Dominic Létourneau, Pierre Moisan, Arnaud Ponchon, Clément Raïevsky, Jean-Marc Valin, Eric Beaudry, Froduald Kabanza:
Modularity and integration in the design of a socially interactive robot. RO-MAN 2005: 172-177 - 2004
- [j1]Dominic Létourneau, François Michaud, Jean-Marc Valin:
Autonomous Mobile Robot That Can Read. EURASIP J. Adv. Signal Process. 2004(17): 2650-2662 (2004) - [c10]Jean-Marc Valin, Jean Rouat, François Michaud:
Microphone array post-filter for separation of simultaneous non-stationary sources. ICASSP (1) 2004: 221-224 - [c9]Jean-Marc Valin, François Michaud, Brahim Hadjou, Jean Rouat:
Localization of Simultaneous Moving Sound Sources for Mobile Robot Using a Frequency- Domain Steered Beamformer Approach. ICRA 2004: 1033-1038 - [c8]Mathieu Lemay, François Michaud, Dominic Létourneau, Jean-Marc Valin:
Autonomous Initialization of Robot Formations. ICRA 2004: 3018-3023 - [c7]Carle Côté, Dominic Létourneau, François Michaud, Jean-Marc Valin, Yannick Brosseau, Clément Raïevsky, Mathieu Lemay, Wctor Tran:
Code reusability tools for programming mobile robots. IROS 2004: 1820-1825 - [c6]Jean-Marc Valin, Jean Rouat, François Michaud:
Enhanced robot audition based on microphone array source separation with post-filter. IROS 2004: 2123-2128 - 2003
- [c5]Jean-Marc Valin, François Michaud, Jean Rouat, Dominic Létourneau:
Robust sound source localization using a microphone array on a mobile robot. IROS 2003: 1228-1233 - [c4]Dominic Létourneau, François Michaud, Jean-Marc Valin, Catherine Proulx:
Textual message read by a mobile robot. IROS 2003: 2724-2729 - [c3]Dominic Létourneau, François Michaud, Jean-Marc Valin, Catherine Proulx:
Making a mobile robot read textual messages. SMC 2003: 4236-4241 - 2002
- [c2]François Michaud, Dominic Létourneau, Matthieu Guilbert, Jean-Marc Valin:
Dynamic robot formations using directional visual perception. IROS 2002: 2740-2745
1990 – 1999
- 1999
- [c1]Stephen D. Peters, Peter Stubley, Jean-Marc Valin:
On the limits of speech recognition in noise. ICASSP 1999: 365-368
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-21 23:42 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint