default search action
Shiva Sundaram
Person information
Other persons with a similar name
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c41]Georgios Paraskevopoulos, Chandrashekhar Lavania, Lovish Chum, Shiva Sundaram:
Multi-Scale Compositional Constraints for Representation Learning on Videos. ICASSP 2023: 1-5 - 2022
- [c40]Chandrashekhar Lavania, Shiva Sundaram, Sundararajan Srinivasan, Katrin Kirchhoff:
Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation. ICASSP 2022: 4728-4732 - [c39]Raghuveer Peri, Srinivas Parthasarathy, Shiva Sundaram:
Scene Representation Learning from Videos Using Self-Supervised and Weakly-Supervised Techniques. ICIP 2022: 1671-1675 - 2021
- [c38]Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram:
Audiovisual Highlight Detection in Videos. ICASSP 2021: 4155-4159 - [c37]Raghuveer Peri, Srinivas Parthasarathy, Charles Bradshaw, Shiva Sundaram:
Disentanglement for Audio-Visual Emotion Recognition Using Multitask Setup. ICASSP 2021: 6344-6348 - [c36]Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram:
Self-Supervised Learning with Cross-Modal Transformers for Emotion Recognition. SLT 2021: 381-388 - [c35]Srinivas Parthasarathy, Shiva Sundaram:
Detecting Expressions with Multimodal Transformers. SLT 2021: 636-643 - [i13]Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram:
Audiovisual Highlight Detection in Videos. CoRR abs/2102.05811 (2021) - [i12]Raghuveer Peri, Srinivas Parthasarathy, Charles Bradshaw, Shiva Sundaram:
Disentanglement for audio-visual emotion recognition using multitask setup. CoRR abs/2102.06269 (2021) - 2020
- [c34]Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare, Shiva Sundaram:
Multimodal and Multiresolution Speech Recognition with Transformers. ACL 2020: 2381-2387 - [c33]Taejin Park, Ken'ichi Kumatani, Minhua Wu, Shiva Sundaram:
Robust Multi-Channel Speech Recognition Using Frequency Aligned Network. ICASSP 2020: 6859-6863 - [c32]Sanna Wager, Aparna Khare, Minhua Wu, Ken'ichi Kumatani, Shiva Sundaram:
Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning. ICASSP 2020: 6864-6868 - [c31]Srinivas Parthasarathy, Shiva Sundaram:
Training Strategies to Handle Missing Modalities for Audio-Visual Expression Recognition. ICMI Companion 2020: 400-404 - [c30]Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram:
Multi-Modal Embeddings Using Multi-Task Learning for Emotion Recognition. INTERSPEECH 2020: 384-388 - [i11]Aparna Khare, Shiva Sundaram, Minhua Wu:
Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression. CoRR abs/2002.00122 (2020) - [i10]Sanna Wager, Aparna Khare, Minhua Wu, Ken'ichi Kumatani, Shiva Sundaram:
Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning. CoRR abs/2002.00125 (2020) - [i9]Taejin Park, Ken'ichi Kumatani, Minhua Wu, Shiva Sundaram:
Robust Multi-channel Speech Recognition using Frequency Aligned Network. CoRR abs/2002.02520 (2020) - [i8]Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare, Shiva Sundaram:
Multiresolution and Multimodal Speech Recognition with Transformers. CoRR abs/2004.14840 (2020) - [i7]Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram:
Multi-modal embeddings using multi-task learning for emotion recognition. CoRR abs/2009.05019 (2020) - [i6]Srinivas Parthasarathy, Shiva Sundaram:
Training Strategies to Handle Missing Modalities for Audio-Visual Expression Recognition. CoRR abs/2010.00734 (2020) - [i5]Aparna Khare, Srinivas Parthasarathy, Shiva Sundaram:
Self-Supervised learning with cross-modal transformers for emotion recognition. CoRR abs/2011.10652 (2020) - [i4]Srinivas Parthasarathy, Shiva Sundaram:
Detecting expressions with multimodal transformers. CoRR abs/2012.00063 (2020)
2010 – 2019
- 2019
- [c29]Ladislav Mosner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Ken'ichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister:
Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning. ICASSP 2019: 6475-6479 - [c28]Ken'ichi Kumatani, Minhua Wu, Shiva Sundaram, Nikko Ström, Björn Hoffmeister:
Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition. ICASSP 2019: 6635-6639 - [c27]Minhua Wu, Ken'ichi Kumatani, Shiva Sundaram, Nikko Ström, Björn Hoffmeister:
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition. ICASSP 2019: 6640-6644 - [i3]Ladislav Mosner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Ken'ichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister:
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning. CoRR abs/1901.02348 (2019) - [i2]Minhua Wu, Ken'ichi Kumatani, Shiva Sundaram, Nikko Strom, Björn Hoffmeister:
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition. CoRR abs/1903.05299 (2019) - [i1]Ken'ichi Kumatani, Minhua Wu, Shiva Sundaram, Nikko Strom, Björn Hoffmeister:
Multi-Geometry Spatial Acoustic Modeling for Distant Speech Recognition. CoRR abs/1903.06539 (2019) - 2018
- [c26]Constantinos Papayiannis, Justice Amoh, Viktor Rozgic, Shiva Sundaram, Chao Wang:
Detecting Media Sound Presence in Acoustic Scenes. INTERSPEECH 2018: 1363-1367 - 2013
- [j2]Gaël Richard, Shiva Sundaram, Shrikanth S. Narayanan:
An Overview on Perceptually Motivated Audio Indexing and Classification. Proc. IEEE 101(9): 1939-1954 (2013) - [c25]Nikos Malandrakis, Shiva Sundaram, Alexandros Potamianos:
Affective classification of generic audio clips using regression models. INTERSPEECH 2013: 2832-2836 - 2012
- [j1]Tara N. Sainath, Bhuvana Ramabhadran, David Nahamoo, Dimitri Kanevsky, Dirk Van Compernolle, Kris Demuynck, Jort F. Gemmeke, Jerome R. Bellegarda, Shiva Sundaram:
Exemplar-Based Processing for Speech Recognition: An Overview. IEEE Signal Process. Mag. 29(6): 98-113 (2012) - [c24]Shiva Sundaram, Jerome R. Bellegarda:
Latent perceptual mapping with data-driven variable-length acoustic units for template-based speech recognition. ICASSP 2012: 4125-4128 - 2011
- [c23]Shiva Sundaram, Robert Schleicher, Nathalie Diehl:
Experiments in context-independent recognition of non-lexical 'yes' or 'no' responses. ICASSP 2011: 5696-5699 - [c22]Shiva Sundaram, Vladan Velisavljevic, Yujie Qin:
Hotflashes: Thumbnailing videos of social gatherings by detecting camera flash illuminated frames. ICME 2011: 1-4 - [c21]Henrik von Coler, Shiva Sundaram, Robert Schleicher, Gabriel Curio:
Towards the influence of vibration on evaluation of speech utterances in mobile devices. WASPAA 2011: 297-300 - 2010
- [c20]Shiva Sundaram, Robert Schleicher, Julia Seebode:
Clustering audio clips by context-free description and affective ratings. EUSIPCO 2010: 472-476 - [c19]Samuel Kim, Shiva Sundaram, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
Acoustic stopwords for unstructured audio information retrieval. EUSIPCO 2010: 1277-1280 - [c18]Samuel Kim, Panayiotis G. Georgiou, Shrikanth S. Narayanan, Shiva Sundaram:
Using naïve text queries for robust audio information retrieval. ICASSP 2010: 2406-2409 - [c17]Shiva Sundaram, Robert Schleicher:
Towards evaluation of example-based audio retrieval system using affective dimensions. ICME 2010: 573-577 - [c16]Shiva Sundaram, Jerome R. Bellegarda:
Latent perceptual mapping: a new acoustic modeling framework for speech recognition. INTERSPEECH 2010: 881-884 - [c15]Samuel Kim, Shiva Sundaram, Panayiotis G. Georgiou, Shrikanth S. Narayanan:
An N-gram model for unstructured audio signals toward information retrieval. MMSP 2010: 477-480 - [c14]Shiva Sundaram, Robert Schleicher, Nathalie Diehl:
A demonstration of automatic recognition of 'yes' or 'no' non-lexical verbal responses for speech-based interaction. SLT 2010: 167-168
2000 – 2009
- 2009
- [c13]Shiva Sundaram, Shrikanth S. Narayanan:
A divide-and-conquer approach to Latent Perceptual Indexing of audio for large Web 2.0 applications. ICME 2009: 466-469 - [c12]Tim Polzehl, Shiva Sundaram, Hamed Ketabdar, Michael Wagner, Florian Metze:
Emotion classification in children's speech using fusion of acoustic and linguistic features. INTERSPEECH 2009: 340-343 - [c11]Ozlem Kalinli, Shiva Sundaram, Shrikanth S. Narayanan:
Saliency-driven unstructured acoustic scene classification using latent perceptual indexing. MMSP 2009: 1-6 - [c10]Samuel Kim, Shrikanth S. Narayanan, Shiva Sundaram:
Acoustic topic model for audio information retrieval. WASPAA 2009: 37-40 - 2008
- [c9]Shiva Sundaram, Shrikanth S. Narayanan:
Audio retrieval by latent perceptual indexing. ICASSP 2008: 49-52 - [c8]Shiva Sundaram, Shrikanth S. Narayanan:
Classification of sound clips by two schemes: Using onomatopoeia and semantic labels. ICME 2008: 1341-1344 - 2007
- [c7]Shiva Sundaram, Shrikanth S. Narayanan:
Discriminating Two Types of Noise Sources using Cortical Representation and Dimension Reduction Technique. ICASSP (1) 2007: 213-216 - [c6]Shiva Sundaram, Shrikanth S. Narayanan:
Analysis of Audio Clustering using Word Descriptions. ICASSP (2) 2007: 769-772 - [c5]Shiva Sundaram, Shrikanth S. Narayanan:
Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate. MMSP 2007: 98-102 - 2006
- [c4]Shiva Sundaram, Shrikanth S. Narayanan:
Vector-based Representation and Clustering of Audio Using Onomatopoeia Words. AAAI Fall Symposium: Aurally Informed Performance 2006: 55- - [c3]Shrikanth S. Narayanan, Panayiotis G. Georgiou, Abhinav Sethy, Dagen Wang, Murtaza Bulut, Shiva Sundaram, Emil Ettelaie, Sankaranarayanan Ananthakrishnan, Horacio Franco, Kristin Precoda, Dimitra Vergyri, Jing Zheng, Wen Wang, Venkata Ramana Rao Gadde, Martin Graciarena, Victor Abrash, Michael W. Frandsen, Colleen Richey:
Speech Recognition Engineering Issues in Speech to Speech Translation System Design for Low Resource Languages and Domains. ICASSP (5) 2006: 1209-1212 - [c2]Shiva Sundaram, Shrikanth S. Narayanan:
An attribute-based approach to audio description applied to segmenting vocal sections in popular music songs. MMSP 2006: 103-107 - 2003
- [c1]Shiva Sundaram, Shrikanth S. Narayanan:
An empirical text transformation method for spontaneous speech synthesizers. INTERSPEECH 2003: 1221-1224
Coauthor Index
aka: Shrikanth S. Narayanan
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-10 23:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint