iBet uBet
web content aggregator. Adding the entire web to your favor.
Link to original content:
https://dblp.org/pid/159/1894.rss
dblp: Kelvin Xu
https://dblp.org/pid/159/1894.html
dblp person page RSS feed
Tue, 22 Oct 2024 20:16:14 +0200
en-US
daily
1
released under the CC0 1.0 license
dblp@dagstuhl.de (dblp team)
dblp@dagstuhl.de (dblp team)
Computers/Computer_Science/Publications/Bibliographies
http://www.rssboard.org/rss-specification
https://dblp.org/img/logo.144x51.png
dblp: Kelvin Xu
https://dblp.org/pid/159/1894.html
144
51
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
https://openreview.net/forum?id=lNAyUngGFK
Avi Singh
,
John D. Co-Reyes
,
Rishabh Agarwal
,
Ankesh Anand
,
Piyush Patil
,
Xavier Garcia
,
Peter J. Liu
,
James Harrison
,
Jaehoon Lee
,
Kelvin Xu
,
Aaron T. Parisi
,
Abhishek Kumar
,
Alexander A. Alemi
,
Alex Rizkowsky
,
Azade Nova
,
Ben Adlam
,
Bernd Bohnet
,
Gamaleldin Fathy Elsayed
,
Hanie Sedghi
,
Igor Mordatch
,
Isabelle Simpson
,
Izzeddin Gur
,
Jasper Snoek
,
Jeffrey Pennington
,
Jiri Hron
,
Kathleen Kenealy
,
Kevin Swersky
,
Kshiteej Mahajan
,
Laura Culp
,
Lechao Xiao
,
Maxwell L. Bileschi
,
Noah Constant
,
Roman Novak
,
Rosanne Liu
,
Tris Warkentin
,
Yundi Qian
,
Yamini Bansal
,
Ethan Dyer
,
Behnam Neyshabur
,
Jascha Sohl-Dickstein
,
Noah Fiedel
:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
Trans. Mach. Learn. Res.
2024
(
2024
)]]>
https://dblp.org/rec/journals/tmlr/SinghCAAPGLH0XP24
Mon, 01 Jan 2024 00:00:00 +0100
ContMulti-objective Optimization Model for Momentum Change Based on Genetic Algorithm.
https://doi.org/10.1007/978-981-97-5578-3_11
Shuo Zhang
,
Ziqi Kong
,
Kelvin Xu
,
Guangxiao Shi
,
Zixiao Kong
,
Xia Li
,
Jinjin Zan
:
ContMulti-objective Optimization Model for Momentum Change Based on Genetic Algorithm.
ICIC (1)
2024
:
134-145
]]>
https://dblp.org/rec/conf/icic/ZhangKXSKLZ24
Mon, 01 Jan 2024 00:00:00 +0100
Small-scale proxies for large-scale Transformer training instabilities.
https://openreview.net/forum?id=d8w0pmvXbZ
Mitchell Wortsman
,
Peter J. Liu
,
Lechao Xiao
,
Katie E. Everett
,
Alexander A. Alemi
,
Ben Adlam
,
John D. Co-Reyes
,
Izzeddin Gur
,
Abhishek Kumar
,
Roman Novak
,
Jeffrey Pennington
,
Jascha Sohl-Dickstein
,
Kelvin Xu
,
Jaehoon Lee
,
Justin Gilmer
,
Simon Kornblith
:
Small-scale proxies for large-scale Transformer training instabilities.
ICLR
2024
]]>
https://dblp.org/rec/conf/iclr/WortsmanLXEAACG24
Mon, 01 Jan 2024 00:00:00 +0100
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.
https://doi.org/10.48550/arXiv.2408.03314
Charlie Snell
,
Jaehoon Lee
,
Kelvin Xu
,
Aviral Kumar
:
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.
CoRR
abs/2408.03314
(
2024
)]]>
https://dblp.org/rec/journals/corr/abs-2408-03314
Mon, 01 Jan 2024 00:00:00 +0100
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability.
https://doi.org/10.48550/arXiv.2408.07852
Jiri Hron
,
Laura Culp
,
Gamaleldin F. Elsayed
,
Rosanne Liu
,
Ben Adlam
,
Maxwell L. Bileschi
,
Bernd Bohnet
,
JD Co-Reyes
,
Noah Fiedel
,
C. Daniel Freeman
,
Izzeddin Gur
,
Kathleen Kenealy
,
Jaehoon Lee
,
Peter J. Liu
,
Gaurav Mishra
,
Igor Mordatch
,
Azade Nova
,
Roman Novak
,
Aaron Parisi
,
Jeffrey Pennington
,
Alex Rizkowsky
,
Isabelle Simpson
,
Hanie Sedghi
,
Jascha Sohl-Dickstein
,
Kevin Swersky
,
Sharad Vikram
,
Tris Warkentin
,
Lechao Xiao
,
Kelvin Xu
,
Jasper Snoek
,
Simon Kornblith
:
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability.
CoRR
abs/2408.07852
(
2024
)]]>
https://dblp.org/rec/journals/corr/abs-2408-07852
Mon, 01 Jan 2024 00:00:00 +0100
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries.
https://doi.org/10.48550/arXiv.2409.12640
Kiran Vodrahalli
,
Santiago Ontanon
,
Nilesh Tripuraneni
,
Kelvin Xu
,
Sanil Jain
,
Rakesh Shivanna
,
Jeffrey Hui
,
Nishanth Dikkala
,
Mehran Kazemi
,
Bahare Fatemi
,
Rohan Anil
,
Ethan Dyer
,
Siamak Shakeri
,
Roopali Vij
,
Harsh Mehta
,
Vinay V. Ramasesh
,
Quoc Le
,
Ed H. Chi
,
Yifeng Lu
,
Orhan Firat
,
Angeliki Lazaridou
,
Jean-Baptiste Lespiau
,
Nithya Attaluri
,
Kate Olszewska
:
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries.
CoRR
abs/2409.12640
(
2024
)]]>
https://dblp.org/rec/journals/corr/abs-2409-12640
Mon, 01 Jan 2024 00:00:00 +0100
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.
https://doi.org/10.1109/ICRA48891.2023.10161493
Kelvin Xu
,
Zheyuan Hu
,
Ria Doshi
,
Aaron Rovinsky
,
Vikash Kumar
,
Abhishek Gupta
,
Sergey Levine
:
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.
ICRA
2023
:
5938-5945
]]>
https://dblp.org/rec/conf/icra/XuHDRKGL23
Sun, 01 Jan 2023 00:00:00 +0100
Small-scale proxies for large-scale Transformer training instabilities.
https://doi.org/10.48550/arXiv.2309.14322
Mitchell Wortsman
,
Peter J. Liu
,
Lechao Xiao
,
Katie Everett
,
Alex Alemi
,
Ben Adlam
,
John D. Co-Reyes
,
Izzeddin Gur
,
Abhishek Kumar
,
Roman Novak
,
Jeffrey Pennington
,
Jascha Sohl-Dickstein
,
Kelvin Xu
,
Jaehoon Lee
,
Justin Gilmer
,
Simon Kornblith
:
Small-scale proxies for large-scale Transformer training instabilities.
CoRR
abs/2309.14322
(
2023
)]]>
https://dblp.org/rec/journals/corr/abs-2309-14322
Sun, 01 Jan 2023 00:00:00 +0100
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
https://doi.org/10.48550/arXiv.2311.07587
C. Daniel Freeman
,
Laura Culp
,
Aaron Parisi
,
Maxwell L. Bileschi
,
Gamaleldin F. Elsayed
,
Alex Rizkowsky
,
Isabelle Simpson
,
Alex Alemi
,
Azade Nova
,
Ben Adlam
,
Bernd Bohnet
,
Gaurav Mishra
,
Hanie Sedghi
,
Igor Mordatch
,
Izzeddin Gur
,
Jaehoon Lee
,
John D. Co-Reyes
,
Jeffrey Pennington
,
Kelvin Xu
,
Kevin Swersky
,
Kshiteej Mahajan
,
Lechao Xiao
,
Rosanne Liu
,
Simon Kornblith
,
Noah Constant
,
Peter J. Liu
,
Roman Novak
,
Yundi Qian
,
Noah Fiedel
,
Jascha Sohl-Dickstein
:
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
CoRR
abs/2311.07587
(
2023
)]]>
https://dblp.org/rec/journals/corr/abs-2311-07587
Sun, 01 Jan 2023 00:00:00 +0100
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.
https://doi.org/10.48550/arXiv.2311.18232
Marwa Abdulhai
,
Isadora White
,
Charlie Snell
,
Charles Sun
,
Joey Hong
,
Yuexiang Zhai
,
Kelvin Xu
,
Sergey Levine
:
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.
CoRR
abs/2311.18232
(
2023
)]]>
https://dblp.org/rec/journals/corr/abs-2311-18232
Sun, 01 Jan 2023 00:00:00 +0100
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
https://doi.org/10.48550/arXiv.2312.06585
Avi Singh
,
John D. Co-Reyes
,
Rishabh Agarwal
,
Ankesh Anand
,
Piyush Patil
,
Xavier Garcia
,
Peter J. Liu
,
James Harrison
,
Jaehoon Lee
,
Kelvin Xu
,
Aaron Parisi
,
Abhishek Kumar
,
Alex Alemi
,
Alex Rizkowsky
,
Azade Nova
,
Ben Adlam
,
Bernd Bohnet
,
Gamaleldin F. Elsayed
,
Hanie Sedghi
,
Igor Mordatch
,
Isabelle Simpson
,
Izzeddin Gur
,
Jasper Snoek
,
Jeffrey Pennington
,
Jiri Hron
,
Kathleen Kenealy
,
Kevin Swersky
,
Kshiteej Mahajan
,
Laura Culp
,
Lechao Xiao
,
Maxwell L. Bileschi
,
Noah Constant
,
Roman Novak
,
Rosanne Liu
,
Tris Warkentin
,
Yundi Qian
,
Yamini Bansal
,
Ethan Dyer
,
Behnam Neyshabur
,
Jascha Sohl-Dickstein
,
Noah Fiedel
:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.
CoRR
abs/2312.06585
(
2023
)]]>
https://dblp.org/rec/journals/corr/abs-2312-06585
Sun, 01 Jan 2023 00:00:00 +0100
Towards Adaptive, Continual Embodied Agents.
https://www.escholarship.org/uc/item/3tk9g0b7
Kelvin Xu
:
Towards Adaptive, Continual Embodied Agents.
University of California, Berkeley, USA,
2022
]]>
https://dblp.org/rec/phd/us/Xu22f
Sat, 01 Jan 2022 00:00:00 +0100
Autonomous Reinforcement Learning: Formalism and Benchmarking.
https://openreview.net/forum?id=nkaba3ND7B5
Archit Sharma
,
Kelvin Xu
,
Nikhil Sardana
,
Abhishek Gupta
,
Karol Hausman
,
Sergey Levine
,
Chelsea Finn
:
Autonomous Reinforcement Learning: Formalism and Benchmarking.
ICLR
2022
]]>
https://dblp.org/rec/conf/iclr/SharmaXS0HLF22
Sat, 01 Jan 2022 00:00:00 +0100
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.
https://doi.org/10.48550/arXiv.2212.09902
Kelvin Xu
,
Zheyuan Hu
,
Ria Doshi
,
Aaron Rovinsky
,
Vikash Kumar
,
Abhishek Gupta
,
Sergey Levine
:
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.
CoRR
abs/2212.09902
(
2022
)]]>
https://dblp.org/rec/journals/corr/abs-2212-09902
Sat, 01 Jan 2022 00:00:00 +0100
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.
https://doi.org/10.1109/ICRA48506.2021.9561384
Abhishek Gupta
,
Justin Yu
,
Tony Z. Zhao
,
Vikash Kumar
,
Aaron Rovinsky
,
Kelvin Xu
,
Thomas Devlin
,
Sergey Levine
:
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.
ICRA
2021
:
6664-6671
]]>
https://dblp.org/rec/conf/icra/0004YZKRXDL21
Fri, 01 Jan 2021 00:00:00 +0100
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.
https://arxiv.org/abs/2104.11203
Abhishek Gupta
,
Justin Yu
,
Tony Z. Zhao
,
Vikash Kumar
,
Aaron Rovinsky
,
Kelvin Xu
,
Thomas Devlin
,
Sergey Levine
:
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.
CoRR
abs/2104.11203
(
2021
)]]>
https://dblp.org/rec/journals/corr/abs-2104-11203
Fri, 01 Jan 2021 00:00:00 +0100
Autonomous Reinforcement Learning: Formalism and Benchmarking.
https://arxiv.org/abs/2112.09605
Archit Sharma
,
Kelvin Xu
,
Nikhil Sardana
,
Abhishek Gupta
,
Karol Hausman
,
Sergey Levine
,
Chelsea Finn
:
Autonomous Reinforcement Learning: Formalism and Benchmarking.
CoRR
abs/2112.09605
(
2021
)]]>
https://dblp.org/rec/journals/corr/abs-2112-09605
Fri, 01 Jan 2021 00:00:00 +0100
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.
https://openreview.net/forum?id=rkgAGAVKPr
Eleni Triantafillou
,
Tyler Zhu
,
Vincent Dumoulin
,
Pascal Lamblin
,
Utku Evci
,
Kelvin Xu
,
Ross Goroshin
,
Carles Gelada
,
Kevin Swersky
,
Pierre-Antoine Manzagol
,
Hugo Larochelle
:
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.
ICLR
2020
]]>
https://dblp.org/rec/conf/iclr/TriantafillouZD20
Wed, 01 Jan 2020 00:00:00 +0100
Continual Learning of Control Primitives : Skill Discovery via Reset-Games.
https://proceedings.neurips.cc/paper/2020/hash/3472ab80b6dff70c54758fd6dfc800c2-Abstract.html
Kelvin Xu
,
Siddharth Verma
,
Chelsea Finn
,
Sergey Levine
:
Continual Learning of Control Primitives : Skill Discovery via Reset-Games.
NeurIPS
2020
]]>
https://dblp.org/rec/conf/nips/XuVFL20
Wed, 01 Jan 2020 00:00:00 +0100
Continual Learning of Control Primitives: Skill Discovery via Reset-Games.
https://arxiv.org/abs/2011.05286
Kelvin Xu
,
Siddharth Verma
,
Chelsea Finn
,
Sergey Levine
:
Continual Learning of Control Primitives: Skill Discovery via Reset-Games.
CoRR
abs/2011.05286
(
2020
)]]>
https://dblp.org/rec/journals/corr/abs-2011-05286
Wed, 01 Jan 2020 00:00:00 +0100
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.
http://proceedings.mlr.press/v97/xu19d.html
Kelvin Xu
,
Ellis Ratner
,
Anca D. Dragan
,
Sergey Levine
,
Chelsea Finn
:
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.
ICML
2019
:
6952-6962
]]>
https://dblp.org/rec/conf/icml/XuRDLF19
Tue, 01 Jan 2019 00:00:00 +0100
Privacy-Preserving Fall Detection with Deep Learning on mmWave Radar Signal.
https://doi.org/10.1109/VCIP47243.2019.8965661
Yangfan Sun
,
Renlong Hang
,
Zhu Li
,
Mouqing Jin
,
Kelvin Xu
:
Privacy-Preserving Fall Detection with Deep Learning on mmWave Radar Signal.
VCIP
2019
:
1-4
]]>
https://dblp.org/rec/conf/vcip/SunH0JX19
Tue, 01 Jan 2019 00:00:00 +0100
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.
http://arxiv.org/abs/1903.03096
Eleni Triantafillou
,
Tyler Zhu
,
Vincent Dumoulin
,
Pascal Lamblin
,
Kelvin Xu
,
Ross Goroshin
,
Carles Gelada
,
Kevin Swersky
,
Pierre-Antoine Manzagol
,
Hugo Larochelle
:
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.
CoRR
abs/1903.03096
(
2019
)]]>
https://dblp.org/rec/journals/corr/abs-1903-03096
Tue, 01 Jan 2019 00:00:00 +0100
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
https://openreview.net/forum?id=HyrCWeWCb
Ofir Nachum
,
Mohammad Norouzi
,
Kelvin Xu
,
Dale Schuurmans
:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
ICLR (Poster)
2018
]]>
https://dblp.org/rec/conf/iclr/Nachum0XS18
Mon, 01 Jan 2018 00:00:00 +0100
Probabilistic Model-Agnostic Meta-Learning.
https://proceedings.neurips.cc/paper/2018/hash/8e2c381d4dd04f1c55093f22c59c3a08-Abstract.html
Chelsea Finn
,
Kelvin Xu
,
Sergey Levine
:
Probabilistic Model-Agnostic Meta-Learning.
NeurIPS
2018
:
9537-9548
]]>
https://dblp.org/rec/conf/nips/FinnXL18
Mon, 01 Jan 2018 00:00:00 +0100
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.
http://arxiv.org/abs/1805.12573
Kelvin Xu
,
Ellis Ratner
,
Anca D. Dragan
,
Sergey Levine
,
Chelsea Finn
:
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.
CoRR
abs/1805.12573
(
2018
)]]>
https://dblp.org/rec/journals/corr/abs-1805-12573
Mon, 01 Jan 2018 00:00:00 +0100
Probabilistic Model-Agnostic Meta-Learning.
http://arxiv.org/abs/1806.02817
Chelsea Finn
,
Kelvin Xu
,
Sergey Levine
:
Probabilistic Model-Agnostic Meta-Learning.
CoRR
abs/1806.02817
(
2018
)]]>
https://dblp.org/rec/journals/corr/abs-1806-02817
Mon, 01 Jan 2018 00:00:00 +0100
On integrating a language model into neural machine translation.
https://doi.org/10.1016/j.csl.2017.01.014
Çaglar Gülçehre
,
Orhan Firat
,
Kelvin Xu
,
Kyunghyun Cho
,
Yoshua Bengio
:
On integrating a language model into neural machine translation.
Comput. Speech Lang.
45
:
137-148
(
2017
)]]>
https://dblp.org/rec/journals/csl/GulcehreFXCB17
Sun, 01 Jan 2017 00:00:00 +0100
An Actor-Critic Algorithm for Sequence Prediction.
https://openreview.net/forum?id=SJDaqqveg
Dzmitry Bahdanau
,
Philemon Brakel
,
Kelvin Xu
,
Anirudh Goyal
,
Ryan Lowe
,
Joelle Pineau
,
Aaron C. Courville
,
Yoshua Bengio
:
An Actor-Critic Algorithm for Sequence Prediction.
ICLR (Poster)
2017
]]>
https://dblp.org/rec/conf/iclr/BahdanauBXGLPCB17
Sun, 01 Jan 2017 00:00:00 +0100
Unsupervised Perceptual Rewards for Imitation Learning.
https://openreview.net/forum?id=Byf3mmNFl
Pierre Sermanet
,
Kelvin Xu
,
Sergey Levine
:
Unsupervised Perceptual Rewards for Imitation Learning.
ICLR (Workshop)
2017
]]>
https://dblp.org/rec/conf/iclr/SermanetXL17
Sun, 01 Jan 2017 00:00:00 +0100
Bridging the Gap Between Value and Policy Based Reinforcement Learning.
https://proceedings.neurips.cc/paper/2017/hash/facf9f743b083008a894eee7baa16469-Abstract.html
Ofir Nachum
,
Mohammad Norouzi
,
Kelvin Xu
,
Dale Schuurmans
:
Bridging the Gap Between Value and Policy Based Reinforcement Learning.
NIPS
2017
:
2775-2785
]]>
https://dblp.org/rec/conf/nips/NachumNXS17
Sun, 01 Jan 2017 00:00:00 +0100
Unsupervised Perceptual Rewards for Imitation Learning.
http://www.roboticsproceedings.org/rss13/p50.html
Pierre Sermanet
,
Kelvin Xu
,
Sergey Levine
:
Unsupervised Perceptual Rewards for Imitation Learning.
Robotics: Science and Systems
2017
]]>
https://dblp.org/rec/conf/rss/SermanetXL17
Sun, 01 Jan 2017 00:00:00 +0100
Bridging the Gap Between Value and Policy Based Reinforcement Learning.
http://arxiv.org/abs/1702.08892
Ofir Nachum
,
Mohammad Norouzi
,
Kelvin Xu
,
Dale Schuurmans
:
Bridging the Gap Between Value and Policy Based Reinforcement Learning.
CoRR
abs/1702.08892
(
2017
)]]>
https://dblp.org/rec/journals/corr/NachumNXS17
Sun, 01 Jan 2017 00:00:00 +0100
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
http://arxiv.org/abs/1707.01891
Ofir Nachum
,
Mohammad Norouzi
,
Kelvin Xu
,
Dale Schuurmans
:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.
CoRR
abs/1707.01891
(
2017
)]]>
https://dblp.org/rec/journals/corr/NachumNXS17aa
Sun, 01 Jan 2017 00:00:00 +0100
Theano: A Python framework for fast computation of mathematical expressions.
http://arxiv.org/abs/1605.02688
Rami Al-Rfou
,
Guillaume Alain
,
Amjad Almahairi
,
Christof Angermüller
,
Dzmitry Bahdanau
,
Nicolas Ballas
,
Frédéric Bastien
,
Justin Bayer
,
Anatoly Belikov
,
Alexander Belopolsky
,
Yoshua Bengio
,
Arnaud Bergeron
,
James Bergstra
,
Valentin Bisson
,
Josh Bleecher Snyder
,
Nicolas Bouchard
,
Nicolas Boulanger-Lewandowski
,
Xavier Bouthillier
,
Alexandre de Brébisson
,
Olivier Breuleux
,
Pierre Luc Carrier
,
Kyunghyun Cho
,
Jan Chorowski
,
Paul F. Christiano
,
Tim Cooijmans
,
Marc-Alexandre Côté
,
Myriam Côté
,
Aaron C. Courville
,
Yann N. Dauphin
,
Olivier Delalleau
,
Julien Demouth
,
Guillaume Desjardins
,
Sander Dieleman
,
Laurent Dinh
,
Melanie Ducoffe
,
Vincent Dumoulin
,
Samira Ebrahimi Kahou
,
Dumitru Erhan
,
Ziye Fan
,
Orhan Firat
,
Mathieu Germain
,
Xavier Glorot
,
Ian J. Goodfellow
,
Matthew Graham
,
Çaglar Gülçehre
,
Philippe Hamel
,
Iban Harlouchet
,
Jean-Philippe Heng
,
Balázs Hidasi
,
Sina Honari
,
Arjun Jain
,
Sébastien Jean
,
Kai Jia
,
Mikhail Korobov
,
Vivek Kulkarni
,
Alex Lamb
,
Pascal Lamblin
,
Eric Larsen
,
César Laurent
,
Sean Lee
,
Simon Lefrançois
,
Simon Lemieux
,
Nicholas Léonard
,
Zhouhan Lin
,
Jesse A. Livezey
,
Cory Lorenz
,
Jeremiah Lowin
,
Qianli Ma
,
Pierre-Antoine Manzagol
,
Olivier Mastropietro
,
Robert McGibbon
,
Roland Memisevic
,
Bart van Merriënboer
,
Vincent Michalski
,
Mehdi Mirza
,
Alberto Orlandi
,
Christopher Joseph Pal
,
Razvan Pascanu
,
Mohammad Pezeshki
,
Colin Raffel
,
Daniel Renshaw
,
Matthew Rocklin
,
Adriana Romero
,
Markus Roth
,
Peter Sadowski
,
John Salvatier
,
François Savard
,
Jan Schlüter
,
John Schulman
,
Gabriel Schwartz
,
Iulian Vlad Serban
,
Dmitriy Serdyuk
,
Samira Shabanian
,
Étienne Simon
,
Sigurd Spieckermann
,
S. Ramana Subramanyam
,
Jakub Sygnowski
,
Jérémie Tanguay
,
Gijs van Tulder
,
Joseph P. Turian
,
Sebastian Urban
,
Pascal Vincent
,
Francesco Visin
,
Harm de Vries
,
David Warde-Farley
,
Dustin J. Webb
,
Matthew Willson
,
Kelvin Xu
,
Lijun Xue
,
Li Yao
,
Saizheng Zhang
,
Ying Zhang
:
Theano: A Python framework for fast computation of mathematical expressions.
CoRR
abs/1605.02688
(
2016
)]]>
https://dblp.org/rec/journals/corr/Al-RfouAAa16
Fri, 01 Jan 2016 00:00:00 +0100
An Actor-Critic Algorithm for Sequence Prediction.
http://arxiv.org/abs/1607.07086
Dzmitry Bahdanau
,
Philemon Brakel
,
Kelvin Xu
,
Anirudh Goyal
,
Ryan Lowe
,
Joelle Pineau
,
Aaron C. Courville
,
Yoshua Bengio
:
An Actor-Critic Algorithm for Sequence Prediction.
CoRR
abs/1607.07086
(
2016
)]]>
https://dblp.org/rec/journals/corr/BahdanauBXGLPCB16
Fri, 01 Jan 2016 00:00:00 +0100
Unsupervised Perceptual Rewards for Imitation Learning.
http://arxiv.org/abs/1612.06699
Pierre Sermanet
,
Kelvin Xu
,
Sergey Levine
:
Unsupervised Perceptual Rewards for Imitation Learning.
CoRR
abs/1612.06699
(
2016
)]]>
https://dblp.org/rec/journals/corr/SermanetXL16
Fri, 01 Jan 2016 00:00:00 +0100
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.
http://proceedings.mlr.press/v37/xuc15.html
Kelvin Xu
,
Jimmy Ba
,
Ryan Kiros
,
Kyunghyun Cho
,
Aaron C. Courville
,
Ruslan Salakhutdinov
,
Richard S. Zemel
,
Yoshua Bengio
:
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.
ICML
2015
:
2048-2057
]]>
https://dblp.org/rec/conf/icml/XuBKCCSZB15
Thu, 01 Jan 2015 00:00:00 +0100
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.
http://arxiv.org/abs/1502.03044
Kelvin Xu
,
Jimmy Ba
,
Ryan Kiros
,
Kyunghyun Cho
,
Aaron C. Courville
,
Ruslan Salakhutdinov
,
Richard S. Zemel
,
Yoshua Bengio
:
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.
CoRR
abs/1502.03044
(
2015
)]]>
https://dblp.org/rec/journals/corr/XuBKCCSZB15
Thu, 01 Jan 2015 00:00:00 +0100
On Using Monolingual Corpora in Neural Machine Translation.
http://arxiv.org/abs/1503.03535
Çaglar Gülçehre
,
Orhan Firat
,
Kelvin Xu
,
Kyunghyun Cho
,
Loïc Barrault
,
Huei-Chi Lin
,
Fethi Bougares
,
Holger Schwenk
,
Yoshua Bengio
:
On Using Monolingual Corpora in Neural Machine Translation.
CoRR
abs/1503.03535
(
2015
)]]>
https://dblp.org/rec/journals/corr/GulcehreFXCBLBS15
Thu, 01 Jan 2015 00:00:00 +0100
A Controller Recognizer Framework: How necessary is recognition for control?
http://arxiv.org/abs/1511.06428
Marcin Moczulski
,
Kelvin Xu
,
Aaron C. Courville
,
KyungHyun Cho
:
A Controller Recognizer Framework: How necessary is recognition for control?
CoRR
abs/1511.06428
(
2015
)]]>
https://dblp.org/rec/journals/corr/MoczulskiXCC15
Thu, 01 Jan 2015 00:00:00 +0100