iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.org/pid/159/1894.rss

dblp: Kelvin Xu https://dblp.org/pid/159/1894.html dblp person page RSS feed Tue, 22 Oct 2024 20:16:14 +0200 en-US daily 1 released under the CC0 1.0 license dblp@dagstuhl.de (dblp team) dblp@dagstuhl.de (dblp team) Computers/Computer_Science/Publications/Bibliographies http://www.rssboard.org/rss-specification https://dblp.org/img/logo.144x51.pngdblp: Kelvin Xuhttps://dblp.org/pid/159/1894.html14451 Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.https://openreview.net/forum?id=lNAyUngGFKAvi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron T. Parisi, Abhishek Kumar, Alexander A. Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Fathy Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. Trans. Mach. Learn. Res. 2024 (2024)]]>https://dblp.org/rec/journals/tmlr/SinghCAAPGLH0XP24Mon, 01 Jan 2024 00:00:00 +0100 ContMulti-objective Optimization Model for Momentum Change Based on Genetic Algorithm.https://doi.org/10.1007/978-981-97-5578-3_11Shuo Zhang, Ziqi Kong, Kelvin Xu, Guangxiao Shi, Zixiao Kong, Xia Li, Jinjin Zan:
ContMulti-objective Optimization Model for Momentum Change Based on Genetic Algorithm. ICIC (1) 2024: 134-145]]>https://dblp.org/rec/conf/icic/ZhangKXSKLZ24Mon, 01 Jan 2024 00:00:00 +0100 Small-scale proxies for large-scale Transformer training instabilities.https://openreview.net/forum?id=d8w0pmvXbZMitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie E. Everett, Alexander A. Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith:
Small-scale proxies for large-scale Transformer training instabilities. ICLR 2024]]>https://dblp.org/rec/conf/iclr/WortsmanLXEAACG24Mon, 01 Jan 2024 00:00:00 +0100 Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters.https://doi.org/10.48550/arXiv.2408.03314Charlie Snell, Jaehoon Lee, Kelvin Xu, Aviral Kumar:
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters. CoRR abs/2408.03314 (2024)]]>https://dblp.org/rec/journals/corr/abs-2408-03314Mon, 01 Jan 2024 00:00:00 +0100 Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability.https://doi.org/10.48550/arXiv.2408.07852Jiri Hron, Laura Culp, Gamaleldin F. Elsayed, Rosanne Liu, Ben Adlam, Maxwell L. Bileschi, Bernd Bohnet, JD Co-Reyes, Noah Fiedel, C. Daniel Freeman, Izzeddin Gur, Kathleen Kenealy, Jaehoon Lee, Peter J. Liu, Gaurav Mishra, Igor Mordatch, Azade Nova, Roman Novak, Aaron Parisi, Jeffrey Pennington, Alex Rizkowsky, Isabelle Simpson, Hanie Sedghi, Jascha Sohl-Dickstein, Kevin Swersky, Sharad Vikram, Tris Warkentin, Lechao Xiao, Kelvin Xu, Jasper Snoek, Simon Kornblith:
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability. CoRR abs/2408.07852 (2024)]]>https://dblp.org/rec/journals/corr/abs-2408-07852Mon, 01 Jan 2024 00:00:00 +0100 Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries.https://doi.org/10.48550/arXiv.2409.12640Kiran Vodrahalli, Santiago Ontanon, Nilesh Tripuraneni, Kelvin Xu, Sanil Jain, Rakesh Shivanna, Jeffrey Hui, Nishanth Dikkala, Mehran Kazemi, Bahare Fatemi, Rohan Anil, Ethan Dyer, Siamak Shakeri, Roopali Vij, Harsh Mehta, Vinay V. Ramasesh, Quoc Le, Ed H. Chi, Yifeng Lu, Orhan Firat, Angeliki Lazaridou, Jean-Baptiste Lespiau, Nithya Attaluri, Kate Olszewska:
Michelangelo: Long Context Evaluations Beyond Haystacks via Latent Structure Queries. CoRR abs/2409.12640 (2024)]]>https://dblp.org/rec/journals/corr/abs-2409-12640Mon, 01 Jan 2024 00:00:00 +0100 Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.https://doi.org/10.1109/ICRA48891.2023.10161493Kelvin Xu, Zheyuan Hu

, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine:
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance. ICRA 2023: 5938-5945]]>https://dblp.org/rec/conf/icra/XuHDRKGL23Sun, 01 Jan 2023 00:00:00 +0100 Small-scale proxies for large-scale Transformer training instabilities.https://doi.org/10.48550/arXiv.2309.14322Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-Dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith:
Small-scale proxies for large-scale Transformer training instabilities. CoRR abs/2309.14322 (2023)]]>https://dblp.org/rec/journals/corr/abs-2309-14322Sun, 01 Jan 2023 00:00:00 +0100 Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?https://doi.org/10.48550/arXiv.2311.07587C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L. Bileschi, Gamaleldin F. Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, John D. Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant, Peter J. Liu, Roman Novak, Yundi Qian, Noah Fiedel, Jascha Sohl-Dickstein:
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? CoRR abs/2311.07587 (2023)]]>https://dblp.org/rec/journals/corr/abs-2311-07587Sun, 01 Jan 2023 00:00:00 +0100 LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models.https://doi.org/10.48550/arXiv.2311.18232Marwa Abdulhai, Isadora White, Charlie Snell, Charles Sun, Joey Hong, Yuexiang Zhai, Kelvin Xu, Sergey Levine:
LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models. CoRR abs/2311.18232 (2023)]]>https://dblp.org/rec/journals/corr/abs-2311-18232Sun, 01 Jan 2023 00:00:00 +0100 Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models.https://doi.org/10.48550/arXiv.2312.06585Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin F. Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel:
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models. CoRR abs/2312.06585 (2023)]]>https://dblp.org/rec/journals/corr/abs-2312-06585Sun, 01 Jan 2023 00:00:00 +0100 Towards Adaptive, Continual Embodied Agents.https://www.escholarship.org/uc/item/3tk9g0b7Kelvin Xu:
Towards Adaptive, Continual Embodied Agents. University of California, Berkeley, USA, 2022]]>https://dblp.org/rec/phd/us/Xu22fSat, 01 Jan 2022 00:00:00 +0100 Autonomous Reinforcement Learning: Formalism and Benchmarking.https://openreview.net/forum?id=nkaba3ND7B5Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn:
Autonomous Reinforcement Learning: Formalism and Benchmarking. ICLR 2022]]>https://dblp.org/rec/conf/iclr/SharmaXS0HLF22Sat, 01 Jan 2022 00:00:00 +0100 Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance.https://doi.org/10.48550/arXiv.2212.09902Kelvin Xu, Zheyuan Hu, Ria Doshi, Aaron Rovinsky, Vikash Kumar, Abhishek Gupta, Sergey Levine:
Dexterous Manipulation from Images: Autonomous Real-World RL via Substep Guidance. CoRR abs/2212.09902 (2022)]]>https://dblp.org/rec/journals/corr/abs-2212-09902Sat, 01 Jan 2022 00:00:00 +0100 Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.https://doi.org/10.1109/ICRA48506.2021.9561384Abhishek Gupta, Justin Yu, Tony Z. Zhao, Vikash Kumar, Aaron Rovinsky, Kelvin Xu, Thomas Devlin, Sergey Levine:
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention. ICRA 2021: 6664-6671]]>https://dblp.org/rec/conf/icra/0004YZKRXDL21Fri, 01 Jan 2021 00:00:00 +0100 Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention.https://arxiv.org/abs/2104.11203Abhishek Gupta, Justin Yu, Tony Z. Zhao, Vikash Kumar, Aaron Rovinsky, Kelvin Xu, Thomas Devlin, Sergey Levine:
Reset-Free Reinforcement Learning via Multi-Task Learning: Learning Dexterous Manipulation Behaviors without Human Intervention. CoRR abs/2104.11203 (2021)]]>https://dblp.org/rec/journals/corr/abs-2104-11203Fri, 01 Jan 2021 00:00:00 +0100 Autonomous Reinforcement Learning: Formalism and Benchmarking.https://arxiv.org/abs/2112.09605Archit Sharma, Kelvin Xu, Nikhil Sardana, Abhishek Gupta, Karol Hausman, Sergey Levine, Chelsea Finn:
Autonomous Reinforcement Learning: Formalism and Benchmarking. CoRR abs/2112.09605 (2021)]]>https://dblp.org/rec/journals/corr/abs-2112-09605Fri, 01 Jan 2021 00:00:00 +0100 Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.https://openreview.net/forum?id=rkgAGAVKPrEleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle:
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples. ICLR 2020]]>https://dblp.org/rec/conf/iclr/TriantafillouZD20Wed, 01 Jan 2020 00:00:00 +0100 Continual Learning of Control Primitives : Skill Discovery via Reset-Games.https://proceedings.neurips.cc/paper/2020/hash/3472ab80b6dff70c54758fd6dfc800c2-Abstract.htmlKelvin Xu, Siddharth Verma, Chelsea Finn, Sergey Levine:
Continual Learning of Control Primitives : Skill Discovery via Reset-Games. NeurIPS 2020]]>https://dblp.org/rec/conf/nips/XuVFL20Wed, 01 Jan 2020 00:00:00 +0100 Continual Learning of Control Primitives: Skill Discovery via Reset-Games.https://arxiv.org/abs/2011.05286Kelvin Xu, Siddharth Verma, Chelsea Finn, Sergey Levine:
Continual Learning of Control Primitives: Skill Discovery via Reset-Games. CoRR abs/2011.05286 (2020)]]>https://dblp.org/rec/journals/corr/abs-2011-05286Wed, 01 Jan 2020 00:00:00 +0100 Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.http://proceedings.mlr.press/v97/xu19d.htmlKelvin Xu, Ellis Ratner, Anca D. Dragan, Sergey Levine, Chelsea Finn:
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning. ICML 2019: 6952-6962]]>https://dblp.org/rec/conf/icml/XuRDLF19Tue, 01 Jan 2019 00:00:00 +0100 Privacy-Preserving Fall Detection with Deep Learning on mmWave Radar Signal.https://doi.org/10.1109/VCIP47243.2019.8965661Yangfan Sun, Renlong Hang, Zhu Li, Mouqing Jin, Kelvin Xu:
Privacy-Preserving Fall Detection with Deep Learning on mmWave Radar Signal. VCIP 2019: 1-4]]>https://dblp.org/rec/conf/vcip/SunH0JX19Tue, 01 Jan 2019 00:00:00 +0100 Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples.http://arxiv.org/abs/1903.03096Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin, Pascal Lamblin, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle:
Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples. CoRR abs/1903.03096 (2019)]]>https://dblp.org/rec/journals/corr/abs-1903-03096Tue, 01 Jan 2019 00:00:00 +0100 Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.https://openreview.net/forum?id=HyrCWeWCbOfir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control. ICLR (Poster) 2018]]>https://dblp.org/rec/conf/iclr/Nachum0XS18Mon, 01 Jan 2018 00:00:00 +0100 Probabilistic Model-Agnostic Meta-Learning.https://proceedings.neurips.cc/paper/2018/hash/8e2c381d4dd04f1c55093f22c59c3a08-Abstract.htmlChelsea Finn, Kelvin Xu, Sergey Levine:
Probabilistic Model-Agnostic Meta-Learning. NeurIPS 2018: 9537-9548]]>https://dblp.org/rec/conf/nips/FinnXL18Mon, 01 Jan 2018 00:00:00 +0100 Learning a Prior over Intent via Meta-Inverse Reinforcement Learning.http://arxiv.org/abs/1805.12573Kelvin Xu, Ellis Ratner, Anca D. Dragan, Sergey Levine, Chelsea Finn:
Learning a Prior over Intent via Meta-Inverse Reinforcement Learning. CoRR abs/1805.12573 (2018)]]>https://dblp.org/rec/journals/corr/abs-1805-12573Mon, 01 Jan 2018 00:00:00 +0100 Probabilistic Model-Agnostic Meta-Learning.http://arxiv.org/abs/1806.02817Chelsea Finn, Kelvin Xu, Sergey Levine:
Probabilistic Model-Agnostic Meta-Learning. CoRR abs/1806.02817 (2018)]]>https://dblp.org/rec/journals/corr/abs-1806-02817Mon, 01 Jan 2018 00:00:00 +0100 On integrating a language model into neural machine translation.https://doi.org/10.1016/j.csl.2017.01.014Çaglar Gülçehre

, Orhan Firat, Kelvin Xu, Kyunghyun Cho, Yoshua Bengio:
On integrating a language model into neural machine translation. Comput. Speech Lang. 45: 137-148 (2017)]]>https://dblp.org/rec/journals/csl/GulcehreFXCB17Sun, 01 Jan 2017 00:00:00 +0100 An Actor-Critic Algorithm for Sequence Prediction.https://openreview.net/forum?id=SJDaqqvegDzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron C. Courville, Yoshua Bengio:
An Actor-Critic Algorithm for Sequence Prediction. ICLR (Poster) 2017]]>https://dblp.org/rec/conf/iclr/BahdanauBXGLPCB17Sun, 01 Jan 2017 00:00:00 +0100 Unsupervised Perceptual Rewards for Imitation Learning.https://openreview.net/forum?id=Byf3mmNFlPierre Sermanet, Kelvin Xu, Sergey Levine:
Unsupervised Perceptual Rewards for Imitation Learning. ICLR (Workshop) 2017]]>https://dblp.org/rec/conf/iclr/SermanetXL17Sun, 01 Jan 2017 00:00:00 +0100 Bridging the Gap Between Value and Policy Based Reinforcement Learning.https://proceedings.neurips.cc/paper/2017/hash/facf9f743b083008a894eee7baa16469-Abstract.htmlOfir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Bridging the Gap Between Value and Policy Based Reinforcement Learning. NIPS 2017: 2775-2785]]>https://dblp.org/rec/conf/nips/NachumNXS17Sun, 01 Jan 2017 00:00:00 +0100 Unsupervised Perceptual Rewards for Imitation Learning.http://www.roboticsproceedings.org/rss13/p50.htmlPierre Sermanet, Kelvin Xu, Sergey Levine:
Unsupervised Perceptual Rewards for Imitation Learning. Robotics: Science and Systems 2017]]>https://dblp.org/rec/conf/rss/SermanetXL17Sun, 01 Jan 2017 00:00:00 +0100 Bridging the Gap Between Value and Policy Based Reinforcement Learning.http://arxiv.org/abs/1702.08892Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Bridging the Gap Between Value and Policy Based Reinforcement Learning. CoRR abs/1702.08892 (2017)]]>https://dblp.org/rec/journals/corr/NachumNXS17Sun, 01 Jan 2017 00:00:00 +0100 Trust-PCL: An Off-Policy Trust Region Method for Continuous Control.http://arxiv.org/abs/1707.01891Ofir Nachum, Mohammad Norouzi, Kelvin Xu, Dale Schuurmans:
Trust-PCL: An Off-Policy Trust Region Method for Continuous Control. CoRR abs/1707.01891 (2017)]]>https://dblp.org/rec/journals/corr/NachumNXS17aaSun, 01 Jan 2017 00:00:00 +0100 Theano: A Python framework for fast computation of mathematical expressions.http://arxiv.org/abs/1605.02688Rami Al-Rfou, Guillaume Alain, Amjad Almahairi, Christof Angermüller, Dzmitry Bahdanau, Nicolas Ballas, Frédéric Bastien, Justin Bayer, Anatoly Belikov, Alexander Belopolsky, Yoshua Bengio, Arnaud Bergeron, James Bergstra, Valentin Bisson, Josh Bleecher Snyder, Nicolas Bouchard, Nicolas Boulanger-Lewandowski, Xavier Bouthillier, Alexandre de Brébisson, Olivier Breuleux, Pierre Luc Carrier, Kyunghyun Cho, Jan Chorowski, Paul F. Christiano, Tim Cooijmans, Marc-Alexandre Côté, Myriam Côté, Aaron C. Courville, Yann N. Dauphin, Olivier Delalleau, Julien Demouth, Guillaume Desjardins, Sander Dieleman, Laurent Dinh, Melanie Ducoffe, Vincent Dumoulin, Samira Ebrahimi Kahou, Dumitru Erhan, Ziye Fan, Orhan Firat, Mathieu Germain, Xavier Glorot, Ian J. Goodfellow, Matthew Graham, Çaglar Gülçehre, Philippe Hamel, Iban Harlouchet, Jean-Philippe Heng, Balázs Hidasi, Sina Honari, Arjun Jain, Sébastien Jean, Kai Jia, Mikhail Korobov, Vivek Kulkarni, Alex Lamb, Pascal Lamblin, Eric Larsen, César Laurent, Sean Lee, Simon Lefrançois, Simon Lemieux, Nicholas Léonard, Zhouhan Lin, Jesse A. Livezey, Cory Lorenz, Jeremiah Lowin, Qianli Ma, Pierre-Antoine Manzagol, Olivier Mastropietro, Robert McGibbon, Roland Memisevic, Bart van Merriënboer, Vincent Michalski, Mehdi Mirza, Alberto Orlandi, Christopher Joseph Pal, Razvan Pascanu, Mohammad Pezeshki, Colin Raffel, Daniel Renshaw, Matthew Rocklin, Adriana Romero, Markus Roth, Peter Sadowski, John Salvatier, François Savard, Jan Schlüter, John Schulman, Gabriel Schwartz, Iulian Vlad Serban, Dmitriy Serdyuk, Samira Shabanian, Étienne Simon, Sigurd Spieckermann, S. Ramana Subramanyam, Jakub Sygnowski, Jérémie Tanguay, Gijs van Tulder, Joseph P. Turian, Sebastian Urban, Pascal Vincent, Francesco Visin, Harm de Vries, David Warde-Farley, Dustin J. Webb, Matthew Willson, Kelvin Xu, Lijun Xue, Li Yao, Saizheng Zhang, Ying Zhang:
Theano: A Python framework for fast computation of mathematical expressions. CoRR abs/1605.02688 (2016)]]>https://dblp.org/rec/journals/corr/Al-RfouAAa16Fri, 01 Jan 2016 00:00:00 +0100 An Actor-Critic Algorithm for Sequence Prediction.http://arxiv.org/abs/1607.07086Dzmitry Bahdanau, Philemon Brakel, Kelvin Xu, Anirudh Goyal, Ryan Lowe, Joelle Pineau, Aaron C. Courville, Yoshua Bengio:
An Actor-Critic Algorithm for Sequence Prediction. CoRR abs/1607.07086 (2016)]]>https://dblp.org/rec/journals/corr/BahdanauBXGLPCB16Fri, 01 Jan 2016 00:00:00 +0100 Unsupervised Perceptual Rewards for Imitation Learning.http://arxiv.org/abs/1612.06699Pierre Sermanet, Kelvin Xu, Sergey Levine:
Unsupervised Perceptual Rewards for Imitation Learning. CoRR abs/1612.06699 (2016)]]>https://dblp.org/rec/journals/corr/SermanetXL16Fri, 01 Jan 2016 00:00:00 +0100 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.http://proceedings.mlr.press/v37/xuc15.htmlKelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, Yoshua Bengio:
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. ICML 2015: 2048-2057]]>https://dblp.org/rec/conf/icml/XuBKCCSZB15Thu, 01 Jan 2015 00:00:00 +0100 Show, Attend and Tell: Neural Image Caption Generation with Visual Attention.http://arxiv.org/abs/1502.03044Kelvin Xu, Jimmy Ba, Ryan Kiros, Kyunghyun Cho, Aaron C. Courville, Ruslan Salakhutdinov, Richard S. Zemel, Yoshua Bengio:
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention. CoRR abs/1502.03044 (2015)]]>https://dblp.org/rec/journals/corr/XuBKCCSZB15Thu, 01 Jan 2015 00:00:00 +0100 On Using Monolingual Corpora in Neural Machine Translation.http://arxiv.org/abs/1503.03535Çaglar Gülçehre, Orhan Firat, Kelvin Xu, Kyunghyun Cho, Loïc Barrault, Huei-Chi Lin, Fethi Bougares, Holger Schwenk, Yoshua Bengio:
On Using Monolingual Corpora in Neural Machine Translation. CoRR abs/1503.03535 (2015)]]>https://dblp.org/rec/journals/corr/GulcehreFXCBLBS15Thu, 01 Jan 2015 00:00:00 +0100 A Controller Recognizer Framework: How necessary is recognition for control?http://arxiv.org/abs/1511.06428Marcin Moczulski, Kelvin Xu, Aaron C. Courville, KyungHyun Cho:
A Controller Recognizer Framework: How necessary is recognition for control? CoRR abs/1511.06428 (2015)]]>https://dblp.org/rec/journals/corr/MoczulskiXCC15Thu, 01 Jan 2015 00:00:00 +0100