default search action
Gregory Sell
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2021
- [c39]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream end-to-end ASR. SLT 2021: 229-235 - [i6]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR. CoRR abs/2102.03055 (2021) - 2020
- [j3]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Leibny Paola García-Perera, Fred Richardson, Réda Dehak, Pedro A. Torres-Carrasquillo, Najim Dehak:
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations. Comput. Speech Lang. 60 (2020) - [c38]Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky:
A Practical Two-Stage Training Strategy for Multi-Stream End-to-End Speech Recognition. ICASSP 2020: 7014-7018 - [c37]Daniel Garcia-Romero, Alan McCree, David Snyder, Gregory Sell:
Jhu-HLTCOE System for the Voxsrc Speaker Recognition Challenge. ICASSP 2020: 7559-7563 - [c36]Daniel Garcia-Romero, Gregory Sell, Alan McCree:
MagNetO: X-vector Magnitude Estimation Network plus Offset for Improved Speaker Recognition. Odyssey 2020: 1-8 - [c35]Jesús Antonio Villalba López, Daniel Garcia-Romero, Nanxin Chen, Gregory Sell, Jonas Borgstrom, Alan McCree, Leibny Paola García-Perera, Saurabh Kataria, Phani Sankar Nidadavolu, Pedro Torres-Carrasquiilo, Najim Dehak:
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19. Odyssey 2020: 273-280
2010 – 2019
- 2019
- [c34]Sandeep Kothinti, Keisuke Imoto, Debmalya Chakrabarty, Gregory Sell, Shinji Watanabe, Mounya Elhilali:
Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection. ICASSP 2019: 36-40 - [c33]Lucas Ondel, Ruizhi Li, Gregory Sell, Hynek Hermansky:
Deriving Spectro-temporal Properties of Hearing from Speech Data. ICASSP 2019: 411-415 - [c32]David Snyder, Daniel Garcia-Romero, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition for Multi-speaker Conversations Using X-vectors. ICASSP 2019: 5796-5800 - [c31]David Etter, Stephen Rawls, Cameron Carpenter, Gregory Sell:
A Synthetic Recipe for OCR. ICDAR 2019: 864-869 - [c30]Gregory Sell, David Etter, Daniel Garcia-Romero, Alan McCree:
Script Identification using Across- and Within-Image Distribution Estimation. ICDAR 2019: 1084-1089 - [c29]Alan McCree, Gregory Sell, Daniel Garcia-Romero:
Speaker Diarization Using Leave-One-Out Gaussian PLDA Clustering of DNN Embeddings. INTERSPEECH 2019: 381-385 - [c28]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c27]Daniel Garcia-Romero, David Snyder, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
x-Vector DNN Refinement with Full-Length Recordings for Speaker Recognition. INTERSPEECH 2019: 1493-1496 - [c26]Daniel Garcia-Romero, David Snyder, Shinji Watanabe, Gregory Sell, Alan McCree, Daniel Povey, Sanjeev Khudanpur:
Speaker Recognition Benchmark Using the CHiME-5 Corpus. INTERSPEECH 2019: 1506-1510 - [c25]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Performance Monitoring for End-to-End Speech Recognition. INTERSPEECH 2019: 2245-2249 - [c24]David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472 - [c23]Matthew Maciejewski, Gregory Sell, Yusuke Fujita, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur:
Analysis of Robustness of Deep Single-Channel Speech Separation Using Corpora Constructed From Multiple Domains. WASPAA 2019: 165-169 - [i5]Ruizhi Li, Gregory Sell, Hynek Hermansky:
Performance Monitoring for End-to-End Speech Recognition. CoRR abs/1904.04896 (2019) - [i4]Ruizhi Li, Gregory Sell, Xiaofei Wang, Shinji Watanabe, Hynek Hermansky:
A practical two-stage training strategy for multi-stream end-to-end speech recognition. CoRR abs/1910.10671 (2019) - 2018
- [c22]Gregory Sell, Kevin Duh, David Snyder, Dave Etter, Daniel Garcia-Romero:
Audio-Visual Person Recognition in Multimedia Data From the Iarpa Janus Program. ICASSP 2018: 3031-3035 - [c21]David Snyder, Daniel Garcia-Romero, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
X-Vectors: Robust DNN Embeddings for Speaker Recognition. ICASSP 2018: 5329-5333 - [c20]Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812 - [c19]Alan McCree, David Snyder, Gregory Sell, Daniel Garcia-Romero:
Language Recognition for Telephone and Video Speech: The JHU HLTCOE Submission for NIST LRE17. Odyssey 2018: 68-73 - [c18]David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Daniel Povey, Sanjeev Khudanpur:
Spoken Language Recognition using X-vectors. Odyssey 2018: 105-111 - [i3]Matthew Maciejewski, Gregory Sell, Leibny Paola García-Perera, Shinji Watanabe, Sanjeev Khudanpur:
Building Corpora for Single-Channel Speech Separation Across Multiple Domains. CoRR abs/1811.02641 (2018) - [i2]Sandeep Kothinti, Keisuke Imoto, Debmalya Chakrabarty, Gregory Sell, Shinji Watanabe, Mounya Elhilali:
Joint Acoustic and Class Inference for Weakly Supervised Sound Event Detection. CoRR abs/1811.04048 (2018) - 2017
- [j2]Aren Jansen, Gregory Sell, Vince Lyzinski:
Scalable out-of-sample extension of graph embeddings using deep neural networks. Pattern Recognit. Lett. 94: 1-6 (2017) - [c17]Ning Gao, Gregory Sell, Douglas W. Oard, Mark Dredze:
Leveraging side information for speaker identification with the Enron conversational telephone speech collection. ASRU 2017: 577-583 - [c16]Daniel Garcia-Romero, David Snyder, Gregory Sell, Daniel Povey, Alan McCree:
Speaker diarization using deep neural network embeddings. ICASSP 2017: 4930-4934 - [c15]Gregory Sell, Alan McCree:
Multi-speaker conversations, cross-talk, and diarization for speaker recognition. ICASSP 2017: 5425-5429 - [c14]Alan McCree, Gregory Sell, Daniel Garcia-Romero:
Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition. INTERSPEECH 2017: 1552-1556 - 2016
- [c13]Gregory Sell, Alan McCree, Daniel Garcia-Romero:
Priors for Speaker Counting and Diarization with AHC. INTERSPEECH 2016: 2194-2198 - [c12]Alan McCree, Gregory Sell, Daniel Garcia-Romero:
Augmented Data Training of Joint Acoustic/Phonotactic DNN i-vectors for NIST LRE15. Odyssey 2016: 204-209 - 2015
- [c11]Gregory Sell, Daniel Garcia-Romero:
Diarization resegmentation in the factor analysis subspace. ICASSP 2015: 4794-4798 - [c10]Jonathan Wintrode, Gregory Sell, Aren Jansen, Michelle Fox, Daniel Garcia-Romero, Alan McCree:
Content-based recommender systems for spoken documents. ICASSP 2015: 5201-5205 - [c9]Gregory Sell, Daniel Garcia-Romero, Alan McCree:
Speaker diarization with i-vectors from DNN senone posteriors. INTERSPEECH 2015: 3096-3099 - [c8]Vince Lyzinski, Gregory Sell, Aren Jansen:
An evaluation of graph clustering methods for unsupervised term discovery. INTERSPEECH 2015: 3209-3213 - [i1]Aren Jansen, Gregory Sell, Vince Lyzinski:
Scalable Out-of-Sample Extension of Graph Embeddings Using Deep Neural Networks. CoRR abs/1508.04422 (2015) - 2014
- [c7]Gregory Sell:
Automatic carrier pitch estimation for coherent demodulation. ICASSP 2014: 2119-2123 - [c6]Gregory Sell, Pascal Clark:
Music tonality features for speech/music discrimination. ICASSP 2014: 2489-2493 - [c5]Gregory Sell, Daniel Garcia-Romero:
Speaker diarization with plda i-vector scoring and unsupervised calibration. SLT 2014: 413-417 - 2013
- [c4]Gregory Sell:
Optimizing coherent demodulation for improved separation of overlapping sources. ICASSP 2013: 901-904 - 2011
- [c3]Geoffrey Zweig, Patrick Nguyen, Dirk Van Compernolle, Kris Demuynck, Les E. Atlas, Pascal Clark, Gregory Sell, Meihong Wang, Fei Sha, Hynek Hermansky, Damianos G. Karakos, Aren Jansen, Samuel Thomas, Sivaram G. S. V. S., Samuel R. Bowman, Justine T. Kao:
Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop. ICASSP 2011: 5044-5047 - [c2]Pascal Clark, Gregory Sell, Les E. Atlas:
A novel approach using modulation features for multiphone-based speech recognition. ICASSP 2011: 5264-5267 - 2010
- [j1]Gregory Sell, Malcolm Slaney:
Solving Demodulation as an Optimization Problem. IEEE Trans. Speech Audio Process. 18(8): 2051-2066 (2010) - [c1]Gregory Sell, Malcolm Slaney:
The information content of demodulated speech. ICASSP 2010: 5470-5473
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-05 21:15 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint