default search action
Erik Marchi
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c49]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. ICASSP 2024: 10451-10455 - [i17]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models. CoRR abs/2403.14438 (2024) - 2023
- [c48]Pranay Dighe, Prateeth Nayak, Oggi Rudovic, Erik Marchi, Xiaochuan Niu, Ahmed H. Tewfik:
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR. ICASSP 2023: 1-5 - [c47]Oggi Rudovic, Wonil Chang, Vineet Garg, Pranay Dighe, Pramod Simha, Jack Berkowitz, Ahmed Hussen Abdelaziz, Sachin Kajarekar, Erik Marchi, Saurabh Adya:
Less Is More: A Unified Architecture for Device-Directed Speech Detection with Multiple Invocation Types. ICASSP 2023: 1-5 - [i16]Dominik Wagner, Alexander W. Churchill, Siddharth Sigtia, Panayiotis G. Georgiou, Matt Mirsamadi, Aarshee Mishra, Erik Marchi:
Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models. CoRR abs/2312.03632 (2023) - 2022
- [c46]Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. INTERSPEECH 2022: 1258-1262 - [c45]Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. INTERSPEECH 2022: 1896-1900 - [i15]Vin Sachidananda, Shao-Yen Tseng, Erik Marchi, Sachin Kajarekar, Panayiotis G. Georgiou:
CALM: Contrastive Aligned Audio-Language Multirate and Multimodal Representations. CoRR abs/2202.03587 (2022) - [i14]Vineet Garg, Ognjen Rudovic, Pranay Dighe, Ahmed Hussen Abdelaziz, Erik Marchi, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Device-Directed Speech Detection: Regularization via Distillation for Weakly-Supervised Models. CoRR abs/2203.15975 (2022) - [i13]Prateeth Nayak, Takuya Higuchi, Anmol Gupta, Shivesh Ranjan, Stephen Shum, Siddharth Sigtia, Erik Marchi, Varun Lakshminarasimhan, Minsik Cho, Saurabh Adya, Chandra Dhir, Ahmed H. Tewfik:
Improving Voice Trigger Detection with Metric Learning. CoRR abs/2204.02455 (2022) - [i12]Pranay Dighe, Prateeth Nayak, Oggi Rudovic, Erik Marchi, Xiaochuan Niu, Ahmed H. Tewfik:
Audio-to-Intent Using Acoustic-Textual Subword Representations from End-to-End ASR. CoRR abs/2210.12134 (2022) - 2021
- [c44]Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik:
Knowledge Transfer for Efficient on-Device False Trigger Mitigation. ICASSP 2021: 6838-6842 - [c43]Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. ICASSP 2021: 6843-6847 - [c42]Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz:
On The Role of Visual Cues in Audiovisual Speech Enhancement. ICASSP 2021: 8423-8427 - [c41]Qiong Hu, Tobias Bleisch, Petko Petkov, Tuomo Raitio, Erik Marchi, Varun Lakshminarasimhan:
Whispered and Lombard Neural Speech Synthesis. SLT 2021: 454-461 - [e1]Erik Marchi, Sabato Marco Siniscalchi, Sandro Cumani, Valerio Mario Salerno, Haizhou Li:
Increasing Naturalness and Flexibility in Spoken Dialogue Interaction - 10th International Workshop on Spoken Dialogue Systems, IWSDS 2019, Syracuse, Sicily, Italy, 24-26 April 2019. Lecture Notes in Electrical Engineering 714, Springer 2021, ISBN 978-981-15-9322-2 [contents] - [i11]Qiong Hu, Tobias Bleisch, Petko Petkov, Tuomo Raitio, Erik Marchi, Varun Lakshminarasimhan:
Whispered and Lombard Neural Speech Synthesis. CoRR abs/2101.05313 (2021) - 2020
- [c40]Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-Task Learning for Speaker Verification and Voice Trigger Detection. ICASSP 2020: 6844-6848 - [c39]Vasudha Kowtha, Vikramjit Mitra, Chris Bartels, Erik Marchi, Sue Booker, William Caruso, Sachin Kajarekar, Devang Naik:
Detecting Emotion Primitives from Speech and Their Use in Discerning Categorical Emotions. ICASSP 2020: 7164-7168 - [c38]Soumi Maiti, Erik Marchi, Alistair Conkie:
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data. ICASSP 2020: 7624-7628 - [i10]Siddharth Sigtia, Erik Marchi, Sachin Kajarekar, Devang Naik, John Bridle:
Multi-task Learning for Speaker Verification and Voice Trigger Detection. CoRR abs/2001.10816 (2020) - [i9]Vasudha Kowtha, Vikramjit Mitra, Chris Bartels, Erik Marchi, Sue Booker, William Caruso, Sachin Kajarekar, Devang Naik:
Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions. CoRR abs/2002.01323 (2020) - [i8]Soumi Maiti, Erik Marchi, Alistair Conkie:
Generating Multilingual Voices Using Speaker Space Translation Based on Bilingual Speaker Data. CoRR abs/2004.04972 (2020) - [i7]Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz:
Self-supervised Learning of Visual Speech Features with Audiovisual Speech Enhancement. CoRR abs/2004.12031 (2020) - [i6]Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik:
Knowledge Transfer for Efficient On-device False Trigger Mitigation. CoRR abs/2010.10591 (2020) - [i5]Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg:
Progressive Voice Trigger Detection: Accuracy vs Latency. CoRR abs/2010.15446 (2020)
2010 – 2019
- 2019
- [b1]Erik Marchi:
Automatic Emotion Recognition in the Voice of Children with Autism Spectrum Conditions. Technical University of Munich, Germany, Verlag Dr. Hut 2019, ISBN 978-3-8439-4283-6, pp. 1-142 - [j4]Björn W. Schuller, Felix Weninger, Yue Zhang, Fabien Ringeval, Anton Batliner, Stefan Steidl, Florian Eyben, Erik Marchi, Alessandro Vinciarelli, Klaus R. Scherer, Mohamed Chetouani, Marcello Mortillaro:
Affective and behavioural computing: Lessons learnt from the First Computational Paralinguistics Challenge. Comput. Speech Lang. 53: 156-180 (2019) - [j3]Erik Marchi, Tadas Baltrusaitis, Andra Adams, Marwa Mahmoud, Ofer Golan, Shimrit Fridenson-Hayo, Shahar Tal, Shai Newman, Noga Meir-Goren, Antonio Camurri, Stefano Piana, Björn W. Schuller, Sven Bölte, T. Metin Sezgin, Nese Alyüz, Agnieszka Rynkiewicz, Aurelie Baranger, Alice Baird, Simon Baron-Cohen, Amandine Lassalle, Helen O'Reilly, Delia Pigat, Peter Robinson, Ian Davies:
The ASC-Inclusion Perceptual Serious Gaming Platform for Autistic Children. IEEE Trans. Games 11(4): 328-339 (2019) - [c37]Vikramjit Mitra, Sue Booker, Erik Marchi, David Scott Farrar, Ute Dorothea Peitz, Bridget Cheng, Ermine Teves, Anuj Mehta, Devang Naik:
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice. INTERSPEECH 2019: 1651-1655 - [c36]Qiong Hu, Erik Marchi, David Winarsky, Yannis Stylianou, Devang Naik, Sachin Kajarekar:
Neural Text-to-Speech Adaptation from Low Quality Public Recordings. SSW 2019: 24-28 - [i4]Vikramjit Mitra, Sue Booker, Erik Marchi, David Scott Farrar, Ute Dorothea Peitz, Bridget Cheng, Ermine Teves, Anuj Mehta, Devang Naik:
Leveraging Acoustic Cues and Paralinguistic Embeddings to Detect Expression from Voice. CoRR abs/1907.00112 (2019) - 2018
- [c35]Erik Marchi, Stephen Shum, Kvuveon Hwang, Sachin Kajarekar, Siddharth Sigtia, Hywel Richards, Rob Haynes, Yoon Kim, John Bridle:
Generalised Discriminative Transform via Curriculum Learning for Speaker Recognition. ICASSP 2018: 5324-5328 - [c34]Siddharth Sigtia, Rob Haynes, Hywel Richards, Erik Marchi, John Bridle:
Efficient Voice Trigger Detection for Low Resource Hardware. INTERSPEECH 2018: 2092-2096 - 2017
- [j2]Erik Marchi, Fabio Vesperini, Stefano Squartini, Björn W. Schuller:
Deep Recurrent Neural Network-Based Autoencoders for Acoustic Novelty Detection. Comput. Intell. Neurosci. 2017: 4694860:1-4694860:14 (2017) - [c33]Gil Keren, Tobias Kirschstein, Erik Marchi, Fabien Ringeval, Björn W. Schuller:
End-to-end learning for dimensional emotion recognition from physiological signals. ICME 2017: 985-990 - 2016
- [j1]Sascha Frühholz, Erik Marchi, Björn W. Schuller:
The Effect of Narrow-Band Transmission on Recognition of Paralinguistic Information From Human Vocalizations. IEEE Access 4: 6059-6072 (2016) - [c32]Maximilian Schmitt, Erik Marchi, Fabien Ringeval, Björn W. Schuller:
Towards Cross-lingual Automatic Diagnosis of Autism Spectrum Condition in Children's Voices. ITG Symposium on Speech Communication 2016: 1-5 - [c31]Erik Marchi, Dario Tonelli, Xinzhou Xu, Fabien Ringeval, Jun Deng, Stefano Squartini, Björn W. Schuller:
Pairwise Decomposition with Deep Neural Networks and Multiscale Kernel Subspace Learning for Acoustic Scene Classification. DCASE 2016: 65-69 - [c30]Zixing Zhang, Fabien Ringeval, Bin Dong, Eduardo Coutinho, Erik Marchi, Björn W. Schuller:
Enhanced semi-supervised learning for multimodal emotion recognition. ICASSP 2016: 5185-5189 - [c29]George Trigeorgis, Fabien Ringeval, Raymond Brueckner, Erik Marchi, Mihalis A. Nicolaou, Björn W. Schuller, Stefanos Zafeiriou:
Adieu features? End-to-end speech emotion recognition using a deep convolutional recurrent network. ICASSP 2016: 5200-5204 - [c28]Irman Abdic, Lex Fridman, Daniel E. Brown, William Angell, Bryan Reimer, Erik Marchi, Björn W. Schuller:
Detecting road surface wetness from audio: A deep learning approach. ICPR 2016: 3458-3463 - [c27]Irman Abdic, Lex Fridman, Daniel McDuff, Erik Marchi, Bryan Reimer, Björn W. Schuller:
Driver Frustration Detection from Audio and Video in the Wild. IJCAI 2016: 1354-1360 - [c26]Felix Weninger, Fabien Ringeval, Erik Marchi, Björn W. Schuller:
Discriminatively Trained Recurrent Neural Networks for Continuous Dimensional Emotion Recognition from Audio. IJCAI 2016: 2196-2202 - [c25]Erik Marchi, Florian Eyben, Gerhard Hagerer, Björn W. Schuller:
Real-Time Tracking of Speakers' Emotions, States, and Traits on Mobile Platforms. INTERSPEECH 2016: 1182-1183 - [c24]Fabien Ringeval, Erik Marchi, Charline Grossard, Jean Xavier, Mohamed Chetouani, David Cohen, Björn W. Schuller:
Automatic Analysis of Typical and Atypical Encoding of Spontaneous Emotion in the Voice of Children. INTERSPEECH 2016: 1210-1214 - [c23]Shahin Amiriparian, Jouni Pohjalainen, Erik Marchi, Sergey Pugachevskiy, Björn W. Schuller:
Is Deception Emotional? An Emotion-Driven Predictive Approach. INTERSPEECH 2016: 2011-2015 - [c22]Hesam Sagha, Pavel Matejka, Maryna Gavryukova, Filip Povolný, Erik Marchi, Björn W. Schuller:
Enhancing Multilingual Recognition of Emotion in Speech by Language Identification. INTERSPEECH 2016: 2949-2953 - [c21]Zixing Zhang, Fabien Ringeval, Jing Han, Jun Deng, Erik Marchi, Björn W. Schuller:
Facing Realism in Spontaneous Emotion Recognition from Speech: Feature Enhancement by Autoencoder with LSTM Neural Networks. INTERSPEECH 2016: 3593-3597 - [c20]Simone Hantke, Erik Marchi, Björn W. Schuller:
Introducing the Weighted Trustability Evaluator for Crowdsourcing Exemplified by Speaker Likability Classification. LREC 2016 - 2015
- [c19]Nicolas Sabouret, Björn W. Schuller, Lucas Paletta, Erik Marchi, Hazaël Jones, Atef Ben Youssef:
Intelligent user interfaces in digital games for empowerment and inclusion. Advances in Computer Entertainment 2015: 8:1-8:8 - [c18]Florian Eyben, Bernd Huber, Erik Marchi, Dagmar Schuller, Björn W. Schuller:
Real-time robust recognition of speakers' emotions and characteristics on mobile platforms. ACII 2015: 778-780 - [c17]Erik Marchi, Fabio Vesperini, Florian Eyben, Stefano Squartini, Björn W. Schuller:
A novel approach for automatic acoustic novelty detection using a denoising autoencoder with bidirectional LSTM neural networks. ICASSP 2015: 1996-2000 - [c16]Erik Marchi, Fabio Vesperini, Felix Weninger, Florian Eyben, Stefano Squartini, Björn W. Schuller:
Non-linear prediction with LSTM recurrent neural networks for acoustic novelty detection. IJCNN 2015: 1-7 - [c15]Erik Marchi, Björn W. Schuller, Simon Baron-Cohen, Ofer Golan, Sven Bölte, Prerna Arora, Reinhold Häb-Umbach:
Typicality and emotion in the voice of children with autism spectrum condition: evidence across three languages. INTERSPEECH 2015: 115-119 - [c14]Fabien Ringeval, Erik Marchi, Marc Mehu, Klaus R. Scherer, Björn W. Schuller:
Face reading from speech - predicting facial action units from audio cues. INTERSPEECH 2015: 1977-1981 - [c13]George Trigeorgis, Eduardo Coutinho, Fabien Ringeval, Erik Marchi, Stefanos Zafeiriou, Björn W. Schuller:
The ICL-TUM-PASSAU Approach for the MediaEval 2015 "Affective Impact of Movies" Task. MediaEval 2015 - [c12]Fabien Ringeval, Björn W. Schuller, Michel F. Valstar, Shashank Jaiswal, Erik Marchi, Denis Lalanne, Roddy Cowie, Maja Pantic:
AV+EC 2015: The First Affect Recognition Challenge Bridging Across Audio, Video, and Physiological Data. AVEC@ACM Multimedia 2015: 3-8 - [i3]Amr El-Desoky Mousa, Erik Marchi, Björn W. Schuller:
The ICSTM+TUM+UP Approach to the 3rd CHIME Challenge: Single-Channel LSTM Speech Enhancement with Multi-Channel Correlation Shaping Dereverberation and LSTM Language Models. CoRR abs/1510.00268 (2015) - [i2]Irman Abdic, Lex Fridman, Erik Marchi, Daniel E. Brown, William Angell, Bryan Reimer:
Detecting Road Surface Wetness from Audio: A Deep Learning Approach. CoRR abs/1511.07035 (2015) - 2014
- [c11]Erik Marchi, Giacomo Ferroni, Florian Eyben, Leonardo Gabrielli, Stefano Squartini, Björn W. Schuller:
Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks. ICASSP 2014: 2164-2168 - [c10]Erik Marchi, Giacomo Ferroni, Florian Eyben, Stefano Squartini, Björn W. Schuller:
Audio onset detection: A wavelet packet based approach with recurrent neural networks. IJCNN 2014: 3585-3591 - [c9]Björn W. Schuller, Stefan Steidl, Anton Batliner, Julien Epps, Florian Eyben, Fabien Ringeval, Erik Marchi, Yue Zhang:
The INTERSPEECH 2014 computational paralinguistics challenge: cognitive & physical load. INTERSPEECH 2014: 427-431 - [i1]Björn W. Schuller, Erik Marchi, Simon Baron-Cohen, Helen O'Reilly, Delia Pigat, Peter Robinson, Ian Davies:
The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions. CoRR abs/1403.5912 (2014) - 2013
- [c8]Jun Deng, Zixing Zhang, Erik Marchi, Björn W. Schuller:
Sparse Autoencoder-Based Feature Transfer Learning for Speech Emotion Recognition. ACII 2013: 511-516 - [c7]Björn W. Schuller, Stefan Steidl, Anton Batliner, Alessandro Vinciarelli, Klaus R. Scherer, Fabien Ringeval, Mohamed Chetouani, Felix Weninger, Florian Eyben, Erik Marchi, Marcello Mortillaro, Hugues Salamin, Anna Polychroniou, Fabio Valente, Samuel Kim:
The INTERSPEECH 2013 computational paralinguistics challenge: social signals, conflict, emotion, autism. INTERSPEECH 2013: 148-152 - [c6]Zixing Zhang, Jun Deng, Erik Marchi, Björn W. Schuller:
Active learning by label uncertainty for acoustic emotion recognition. INTERSPEECH 2013: 2856-2860 - [c5]Florian Eyben, Felix Weninger, Erik Marchi, Björn W. Schuller:
Likability of human voices: A feature analysis and a neural network regression approach to automatic likability estimation. WIAMIS 2013: 1-4 - 2012
- [c4]Felix Weninger, Erik Marchi, Björn W. Schuller:
Improving Recognition of Speaker States and Traits by Cumulative Evidence: Intoxication, Sleepiness, Age and Gender. INTERSPEECH 2012: 1159-1162 - [c3]Erik Marchi, Anton Batliner, Björn W. Schuller, Shimrit Fridenzon, Shahar Tal, Ofer Golan:
Speech, Emotion, Age, Language, Task, and Typicality: Trying to Disentangle Performance and Feature Relevance. SocialCom/PASSAT 2012: 961-968 - [c2]Erik Marchi, Björn W. Schuller, Anton Batliner, Shimrit Fridenzon, Shahar Tal, Ofer Golan:
Emotion in the speech of children with autism spectrum conditions: prosody and everything else. WOCCI 2012: 17-24 - 2011
- [c1]Martin Wöllmer, Erik Marchi, Stefano Squartini, Björn W. Schuller:
Robust Multi-stream Keyword and Non-linguistic Vocalization Detection for Computationally Intelligent Virtual Agents. ISNN (2) 2011: 496-505
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-29 20:59 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint