default search action

combined dblp search
author search
venue search
publication search

ask others

Simon S. Du

Simon Shaolei Du – 杜少雷

> Home > Persons

Person information

unicode name: 杜少雷
affiliation: University of Washington, USA
affiliation (former): Carnegie Mellon University, Machine Learning Department

Other persons with a similar name

see FAQ

Why are some names followed by a four digit number?

SPARQL queries

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhouDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhouDL24
Runlong Zhou, Simon S. Du, Beibin Li:
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs. ACL (1) 2024: 995-1015
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/BhattCDZTMZBDJA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/BhattCDZTMZBDJA24
Gantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeff A. Bilmes, Simon S. Du, Kevin G. Jamieson, Jordan T. Ash, Robert D. Nowak:
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models. ACL (Findings) 2024: 6549-6560
[c101]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/0002CD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/0002CD24
Yan Dai, Qiwen Cui, Simon S. Du:
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation (Extended Abstract). COLT 2024: 1260-1261
[c100]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/Zhang0LD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/Zhang0LD24
Zihan Zhang, Yuxin Chen, Jason D. Lee, Simon S. Du:
Settling the sample complexity of online reinforcement learning. COLT 2024: 5213-5219
[c99]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/ZhangZCDL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/ZhangZCDL24
Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee:
Optimal Multi-Distribution Learning. COLT 2024: 5220-5223
[c98]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiangCXFD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiangCXFD24
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon Shaolei Du:
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning. ICLR 2024
[c97]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LyuJ0DL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LyuJ0DL024
Kaifeng Lyu, Jikai Jin, Zhiyuan Li, Simon Shaolei Du, Jason D. Lee, Wei Hu:
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking. ICLR 2024
[c96]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ShiLZDX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShiLZDX24
Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon Shaolei Du, Huazhe Xu:
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning. ICLR 2024
[c95]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/TianW0CD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TianW0CD24
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Shaolei Du:
JoMA: Demystifying Multilayer Transformers via Joint Dynamics of MLP and Attention. ICLR 2024
[c94]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XiongDD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XiongDD24
Nuoya Xiong, Lijun Ding, Simon Shaolei Du:
How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization. ICLR 2024
[c93]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangLCD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangLCD24
Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon Shaolei Du:
Horizon-Free Regret for Linear Markov Decision Processes. ICLR 2024
[c92]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhouZZC0D24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhouZZC0D24
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du:
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning. ICLR 2024
[c91]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LuSLHDX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LuSLHDX24
Chenhao Lu, Ruizhe Shi, Yuyao Liu, Kaizhe Hu, Simon Shaolei Du, Huazhe Xu:
Rethinking Transformers in Solving POMDPs. ICML 2024
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-06692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-06692
Gantavya Bhatt, Yifang Chen, Arnav Mohanty Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey A. Bilmes, Simon S. Du, Kevin G. Jamieson, Jordan T. Ash, Robert D. Nowak:
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models. CoRR abs/2401.06692 (2024)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02055
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02055
Yiping Wang, Yifang Chen, Wendan Yan, Kevin G. Jamieson, Simon Shaolei Du:
Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning. CoRR abs/2402.02055 (2024)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07082
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07082
Yan Dai, Qiwen Cui, Simon S. Du:
Refined Sample Complexity for Markov Games with Independent Linear Function Approximation. CoRR abs/2402.07082 (2024)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-07437
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-07437
Qiwen Cui, Maryam Fazel, Simon S. Du:
Learning Optimal Tax Design in Nonatomic Congestion Games. CoRR abs/2402.07437 (2024)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12570
Avinandan Bose, Simon Shaolei Du, Maryam Fazel:
Offline Multi-task Transfer RL with Representational Penalization. CoRR abs/2402.12570 (2024)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12621
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12621
Runlong Zhou, Simon S. Du, Beibin Li:
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs. CoRR abs/2402.12621 (2024)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-06328
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-06328
Chuning Zhu, Xinqi Wang, Tyler Han, Simon S. Du, Abhishek Gupta:
Transferable Reinforcement Learning via Generalized Occupancy Models. CoRR abs/2403.06328 (2024)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-10738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-10738
Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon S. Du:
Horizon-Free Regret for Linear Markov Decision Processes. CoRR abs/2403.10738 (2024)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17358
Chenhao Lu, Ruizhe Shi, Yuyao Liu, Kaizhe Hu, Simon S. Du, Huazhe Xu:
Rethinking Transformers in Solving POMDPs. CoRR abs/2405.17358 (2024)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19547
Yiping Wang, Yifang Chen, Wendan Yan, Alex Fang, Wenjing Zhou, Kevin Jamieson, Simon Shaolei Du:
CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning. CoRR abs/2405.19547 (2024)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18853
Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon S. Du:
Decoding-Time Language Model Alignment with Multiple Objectives. CoRR abs/2406.18853 (2024)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-00490
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-00490
Weihang Xu, Maryam Fazel, Simon S. Du:
Toward Global Convergence of Gradient EM for Over-Parameterized Gaussian Mixture Models. CoRR abs/2407.00490 (2024)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-02119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-02119
Yifang Chen, Shuohang Wang, Ziyi Yang, Hiteshi Sharma, Nikos Karampatziakis, Donghan Yu, Kevin G. Jamieson, Simon Shaolei Du, Yelong Shen:
Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning. CoRR abs/2407.02119 (2024)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04600
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04600
Divyansh Pareek, Simon S. Du, Sewoong Oh:
Understanding the Gains from Repeated Self-Distillation. CoRR abs/2407.04600 (2024)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-00717
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-00717
Natalia Zhang, Xinqi Wang, Qiwen Cui, Runlong Zhou, Sham M. Kakade, Simon S. Du:
Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques. CoRR abs/2409.00717 (2024)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19605
Ruizhe Shi, Runlong Zhou, Simon S. Du:
The Crucial Role of Samplers in Online Direct Preference Optimization. CoRR abs/2409.19605 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14706
Xiyu Zhai, Runlong Zhou, Liao Zhang, Simon Shaolei Du:
Transformers are Efficient Compilers, Provably. CoRR abs/2410.14706 (2024)
2023
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/ZhengYCWJDWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/ZhengYCWJDWW23
Wenqing Zheng, Hao (Frank) Yang, Jiarui Cai, Peihao Wang, Xuan Jiang, Simon Shaolei Du, Yinhai Wang, Zhangyang Wang:
Integrating the traffic science with representation learning for city-wide network congestion prediction. Inf. Fusion 99: 101837 (2023)
[j5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/XuLLDW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/XuLLDW23
Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu:
Beyond Information Gain: An Empirical Benchmark for Low-Switching-Cost Reinforcement Learning. Trans. Mach. Learn. Res. 2023 (2023)
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/ZhouHT0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/ZhouHT0D23
Runlong Zhou, Zelin He, Yuandong Tian, Yi Wu, Simon Shaolei Du:
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization. Trans. Mach. Learn. Res. 2023 (2023)
[c90]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/0002CD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/0002CD23
Yulai Zhao, Jianshu Chen, Simon S. Du:
Blessing of Class Diversity in Pre-training. AISTATS 2023: 283-305
[c89]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/XuD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/XuD23
Weihang Xu, Simon S. Du:
Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron. COLT 2023: 1155-1198
[c88]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/CuiZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/CuiZD23
Qiwen Cui, Kaiqing Zhang, Simon S. Du:
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation. COLT 2023: 2651-2652
[c87]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0002WD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0002WD23
Yan Dai, Ruosong Wang, Simon Shaolei Du:
Variance-Aware Sparse Linear Bandits. ICLR 2023
[c86]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/CenCDX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/CenCDX23
Shicong Cen, Yuejie Chi, Simon Shaolei Du, Lin Xiao:
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games. ICLR 2023
[c85]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/JiangCXFD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/JiangCXFD23
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon Shaolei Du:
Offline Congestion Games: How Feedback Type Affects Data Coverage Requirement. ICLR 2023
[c84]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YuanDGLX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YuanDGLX23
Rui Yuan, Simon Shaolei Du, Robert M. Gower, Alessandro Lazaric, Lin Xiao:
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies. ICLR 2023
[c83]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Jin0LDL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Jin0LDL23
Jikai Jin, Zhiyuan Li, Kaifeng Lyu, Simon Shaolei Du, Jason D. Lee:
Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing. ICML 2023: 15200-15238
[c82]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangC0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangC0D23
Yiping Wang, Yifang Chen, Kevin Jamieson, Simon Shaolei Du:
Improved Active Multi-Task Representation Learning via Lasso. ICML 2023: 35548-35578
[c81]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/YeC0D23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YeC0D23
Haotian Ye, Xiaoyu Chen, Liwei Wang, Simon Shaolei Du:
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness. ICML 2023: 39770-39800
[c80]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhouWD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhouWD23
Runlong Zhou, Ruosong Wang, Simon Shaolei Du:
Horizon-Free and Variance-Dependent Reinforcement Learning for Latent Markov Decision Processes. ICML 2023: 42698-42723
[c79]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhouZD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhouZD23
Runlong Zhou, Zihan Zhang, Simon Shaolei Du:
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments. ICML 2023: 42878-42914
[c78]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenHDJS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenHDJS23
Yifang Chen, Yingbing Huang, Simon S. Du, Kevin G. Jamieson, Guanya Shi:
Active representation learning for general task space with applications in robotics. NeurIPS 2023
[c77]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TianWCD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TianWCD23
Yuandong Tian, Yiping Wang, Beidi Chen, Simon S. Du:
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer. NeurIPS 2023
[c76]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Yang0WLWD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Yang0WLWD23
Yunchang Yang, Han Zhong, Tianhao Wu, Bin Liu, Liwei Wang, Simon S. Du:
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback. NeurIPS 2023
[c75]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YuanLGJGD23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YuanLGJGD23
Angela Yuan, Chris Junchi Li, Gauthier Gidel, Michael I. Jordan, Quanquan Gu, Simon S. Du:
Optimal Extragradient-Based Algorithms for Stochastic Variational Inequalities with Separable Structure. NeurIPS 2023
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-11500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-11500
Jikai Jin, Zhiyuan Li, Kaifeng Lyu, Simon S. Du, Jason D. Lee:
Understanding Incremental Learning of Gradient Descent: A Fine-grained Analysis of Matrix Sensing. CoRR abs/2301.11500 (2023)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13446
Runlong Zhou, Zihan Zhang, Simon S. Du:
Sharp Variance-Dependent Bounds in Reinforcement Learning: Best of Both Worlds in Stochastic and Deterministic Environments. CoRR abs/2301.13446 (2023)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01477
Yunchang Yang, Han Zhong, Tianhao Wu, Bin Liu, Liwei Wang, Simon S. Du:
A Reduction-based Framework for Sequential Decision Making with Delayed Feedback. CoRR abs/2302.01477 (2023)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03673
Qiwen Cui, Kaiqing Zhang, Simon S. Du:
Breaking the Curse of Multiagents in a Large State Space: RL in Markov Games with Independent Linear Function Approximation. CoRR abs/2302.03673 (2023)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-10034
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-10034
Weihang Xu, Simon S. Du:
Over-Parameterization Exponentially Slows Down Gradient Descent for Learning a Single Neuron. CoRR abs/2302.10034 (2023)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16380
Yuandong Tian, Yiping Wang, Beidi Chen, Simon S. Du:
Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer. CoRR abs/2305.16380 (2023)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02556
Yiping Wang, Yifang Chen, Kevin G. Jamieson, Simon S. Du:
Improved Active Multi-Task Representation Learning via Lasso. CoRR abs/2306.02556 (2023)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07465
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning. CoRR abs/2306.07465 (2023)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08942
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08942
Yifang Chen, Yingbing Huang, Simon S. Du, Kevin G. Jamieson, Guanya Shi:
Active Representation Learning for General Task Space with Applications in Robotics. CoRR abs/2306.08942 (2023)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09910
Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Yinglun Zhu, Simon Shaolei Du, Kevin G. Jamieson, Robert D. Nowak:
LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning. CoRR abs/2306.09910 (2023)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-13586
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-13586
Zihan Zhang, Yuxin Chen, Jason D. Lee, Simon S. Du:
Settling the Sample Complexity of Online Reinforcement Learning. CoRR abs/2307.13586 (2023)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00535
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon S. Du:
JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention. CoRR abs/2310.00535 (2023)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01769
Nuoya Xiong, Lijun Ding, Simon S. Du:
How Over-Parameterization Slows Down Gradient Descent in Matrix Sensing: The Curses of Symmetry and Initialization. CoRR abs/2310.01769 (2023)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-19308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-19308
Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du:
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning. CoRR abs/2310.19308 (2023)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-20587
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-20587
Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon S. Du, Huazhe Xu:
Unleashing the Power of Pre-trained Language Models for Offline Reinforcement Learning. CoRR abs/2310.20587 (2023)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18817
Kaifeng Lyu, Jikai Jin, Zhiyuan Li, Simon S. Du, Jason D. Lee, Wei Hu:
Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking. CoRR abs/2311.18817 (2023)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-05134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-05134
Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee:
Optimal Multi-Distribution Learning. CoRR abs/2312.05134 (2023)
2022
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/mp/ShiDJS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mp/ShiDJS22
Bin Shi, Simon S. Du, Michael I. Jordan, Weijie J. Su:
Understanding the acceleration phenomenon via high-resolution differential equations. Math. Program. 195(1): 79-148 (2022)
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WuXDW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WuXDW22
Xiaoxia Wu, Yuege Xie, Simon Shaolei Du, Rachel A. Ward:
AdaLoss: A Computationally-Efficient and Provably Convergent Adaptive Gradient Method. AAAI 2022: 8691-8699
[c73]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/DouYWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/DouYWD22
Zehao Dou, Zhuoran Yang, Zhaoran Wang, Simon S. Du:
Gap-Dependent Bounds for Two-Player Markov Games. AISTATS 2022: 432-455
[c72]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/ZhaoTLD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/ZhaoTLD22
Yulai Zhao, Yuandong Tian, Jason D. Lee, Simon S. Du:
Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games. AISTATS 2022: 2736-2761
[c71]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/ZhangJD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/ZhangJD22
Zihan Zhang, Xiangyang Ji, Simon S. Du:
Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies. COLT 2022: 3858-3904
[c70]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/FengHD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/FengHD22
Zhili Feng, Shaobo Han, Simon Shaolei Du:
Provable Adaptation across Multiway Domains via Representation Learning. ICLR 2022
[c69]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YangWZGPLWD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangWZGPLWD22
Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon Shaolei Du:
A Reduction-Based Framework for Conservative Bandits and Reinforcement Learning. ICLR 2022
[c68]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Cai0D22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Cai0D22
Haoyuan Cai, Tengyu Ma, Simon S. Du:
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path. ICML 2022: 2434-2456
[c67]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenJD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenJD22
Yifang Chen, Kevin G. Jamieson, Simon S. Du:
Active Multi-Task Representation Learning. ICML 2022: 3271-3298
[c66]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WagenmakerCSDJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WagenmakerCSDJ22
Andrew J. Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin G. Jamieson:
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach. ICML 2022: 22384-22429
[c65]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WagenmakerCSDJ22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WagenmakerCSDJ22a
Andrew J. Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin G. Jamieson:
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes. ICML 2022: 22430-22456
[c64]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/0001D0IZT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0001D0IZT22
Tongzhou Wang, Simon S. Du, Antonio Torralba, Phillip Isola, Amy Zhang, Yuandong Tian:
Denoised MDPs: Learning World Models Better Than the World Itself. ICML 2022: 22591-22612
[c63]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WuYZ0DJ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuYZ0DJ22
Tianhao Wu, Yunchang Yang, Han Zhong, Liwei Wang, Simon S. Du, Jiantao Jiao:
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee. ICML 2022: 24243-24265
[c62]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiD22
Qiwen Cui, Simon S. Du:
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus. NeurIPS 2022
[c61]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiD22a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiD22a
Qiwen Cui, Simon S. Du:
When are Offline Two-Player Zero-Sum Markov Games Solvable? NeurIPS 2022
[c60]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/CuiXFD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiXFD22
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Learning in Congestion Games with Bandit Feedback. NeurIPS 2022
[c59]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LuZDH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LuZDH22
Rui Lu, Andrew Zhao, Simon S. Du, Gao Huang:
Provable General Function Class Representation Learning in Multitask Bandits and MDP. NeurIPS 2022
[c58]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangCD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangCD22
Xinqi Wang, Qiwen Cui, Simon S. Du:
On Gap-dependent Bounds for Offline Reinforcement Learning. NeurIPS 2022
[c57]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/XiongSCFD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/XiongSCFD22
Zhihan Xiong, Ruoqi Shen, Qiwen Cui, Maryam Fazel, Simon S. Du:
Near-Optimal Randomized Exploration for Tabular Markov Decision Processes. NeurIPS 2022
[i94]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-03522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-03522
Qiwen Cui, Simon S. Du:
When is Offline Two-Player Zero-Sum Markov Game Solvable? CoRR abs/2201.03522 (2022)
[i93]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-11206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-11206
Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson:
Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes. CoRR abs/2201.11206 (2022)
[i92]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00911
Yifang Chen, Simon S. Du, Kevin Jamieson:
Active Multi-Task Representation Learning. CoRR abs/2202.00911 (2022)
[i91]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-03183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-03183
Meixin Zhu, Simon S. Du, Xuesong Wang, Hao (Frank) Yang, Ziyuan Pu, Yinhai Wang:
TransFollower: Long-Sequence Car-Following Trajectory Prediction through Transformer. CoRR abs/2202.03183 (2022)
[i90]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-05423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-05423
Runlong Zhou, Yuandong Tian, Yi Wu, Simon S. Du:
Understanding Curriculum Learning in Policy Optimization for Solving Combinatorial Optimization Problems. CoRR abs/2202.05423 (2022)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12922
Zihan Zhang, Xiangyang Ji, Simon S. Du:
Horizon-Free Reinforcement Learning in Polynomial Time: the Power of Stationary Policies. CoRR abs/2203.12922 (2022)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15664
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15664
Jiaqi Yang, Qi Lei, Jason D. Lee, Simon S. Du:
Nearly Minimax Algorithms for Linear Bandits with Shared Representation. CoRR abs/2203.15664 (2022)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10729
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10729
Haoyuan Cai, Tengyu Ma, Simon S. Du:
Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path. CoRR abs/2205.10729 (2022)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-13450
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-13450
Yan Dai, Ruosong Wang, Simon S. Du:
Variance-Aware Sparse Linear Bandits. CoRR abs/2205.13450 (2022)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-15701
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-15701
Rui Lu, Andrew Zhao, Simon S. Du, Gao Huang:
Provable General Function Class Representation Learning in Multitask Bandits and MDPs. CoRR abs/2205.15701 (2022)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00159
Qiwen Cui, Simon S. Du:
Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus. CoRR abs/2206.00159 (2022)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-00177
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-00177
Xinqi Wang, Qiwen Cui, Simon S. Du:
On Gap-dependent Bounds for Offline Reinforcement Learning. CoRR abs/2206.00177 (2022)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-01880
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-01880
Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Learning in Congestion Games with Bandit Feedback. CoRR abs/2206.01880 (2022)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-08573
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-08573
Simon S. Du, Gauthier Gidel, Michael I. Jordan, Chris Junchi Li:
Optimal Extragradient-Based Bilinearly-Coupled Saddle-Point Optimization. CoRR abs/2206.08573 (2022)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-15477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-15477
Tongzhou Wang, Simon S. Du, Antonio Torralba, Phillip Isola, Amy Zhang, Yuandong Tian:
Denoised MDPs: Learning World Models Better Than the World Itself. CoRR abs/2206.15477 (2022)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-03447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-03447
Yulai Zhao, Jianshu Chen, Simon S. Du:
Blessing of Class Diversity in Pre-training. CoRR abs/2209.03447 (2022)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01050
Shicong Cen, Yuejie Chi, Simon S. Du, Lin Xiao:
Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games. CoRR abs/2210.01050 (2022)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01400
Rui Yuan, Simon S. Du, Robert M. Gower, Alessandro Lazaric, Lin Xiao:
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies. CoRR abs/2210.01400 (2022)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10464
Haotian Ye, Xiaoyu Chen, Liwei Wang, Simon S. Du:
On the Power of Pre-training for Generalization in RL: Provable Benefits and Hardness. CoRR abs/2210.10464 (2022)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11604
Runlong Zhou, Ruosong Wang, Simon S. Du:
Horizon-Free Reinforcement Learning for Latent Markov Decision Processes. CoRR abs/2210.11604 (2022)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-13396
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-13396
Haozhe Jiang, Qiwen Cui, Zhihan Xiong, Maryam Fazel, Simon S. Du:
Offline congestion games: How feedback type affects data coverage requirement. CoRR abs/2210.13396 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/informs/WangWD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/informs/WangWD21
Yining Wang, Yi Wu, Simon S. Du:
Near-Linear Time Local Polynomial Nonparametric Estimation with Box Kernels. INFORMS J. Comput. 33(4): 1339-1353 (2021)
[c56]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/YangYD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YangYD21
Kunhe Yang, Lin F. Yang, Simon S. Du:
Q-learning with Logarithmic Regret. AISTATS 2021: 1576-1584
[c55]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/Xu0D21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/Xu0D21
Haike Xu, Tengyu Ma, Simon S. Du:
Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap. COLT 2021: 4438-4472
[c54]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/ZhangJD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/ZhangJD21
Zihan Zhang, Xiangyang Ji, Simon S. Du:
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon. COLT 2021: 4528-4531
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0001WDK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0001WDK21
Yining Wang, Ruosong Wang, Simon Shaolei Du, Akshay Krishnamurthy:
Optimism in Reinforcement Learning with Generalized Linear Function Approximation. ICLR 2021
[c52]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuHKLL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuHKLL21
Simon Shaolei Du, Wei Hu, Sham M. Kakade, Jason D. Lee, Qi Lei:
Few-Shot Learning via Learning the Representation, Provably. ICLR 2021
[c51]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/TangYCX0FDWW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/TangYCX0FDWW21
Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Shaolei Du, Yu Wang, Yi Wu:
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization. ICLR 2021
[c50]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuZLDKJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuZLDKJ21
Keyulu Xu, Mozhi Zhang, Jingling Li, Simon Shaolei Du, Ken-ichi Kawarabayashi, Stefanie Jegelka:
How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks. ICLR 2021
[c49]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YangHLD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangHLD21
Jiaqi Yang, Wei Hu, Jason D. Lee, Simon Shaolei Du:
Impact of Representation Learning in Linear Bandits. ICLR 2021
[c48]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChenDJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenDJ21
Yifang Chen, Simon S. Du, Kevin Jamieson:
Improved Corruption Robust Algorithms for Episodic Reinforcement Learning. ICML 2021: 1561-1570
[c47]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuKLLMSW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuKLLMSW21
Simon S. Du, Sham M. Kakade, Jason D. Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang:
Bilinear Classes: A Structural Framework for Provable Generalization in RL. ICML 2021: 2826-2836
[c46]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WuYD021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuYD021
Tianhao Wu, Yunchang Yang, Simon S. Du, Liwei Wang:
On Reinforcement Learning with Adversarial Corruption and Its Application to Block MDP. ICML 2021: 11296-11306
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangDJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangDJ21
Zihan Zhang, Simon S. Du, Xiangyang Ji:
Near Optimal Reward-Free Reinforcement Learning. ICML 2021: 12402-12412
[c44]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YeD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YeD21
Tian Ye, Simon S. Du:
Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization. NeurIPS 2021: 1429-1439
[c43]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangYJD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangYJD21
Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du:
Improved Variance-Aware Confidence Sets for Linear Bandits and Linear Mixture MDP. NeurIPS 2021: 4342-4355
[c42]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/TarbouriechZDPV21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/TarbouriechZDPV21
Jean Tarbouriech, Runlong Zhou, Simon S. Du, Matteo Pirotta, Michal Valko, Alessandro Lazaric:
Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret. NeurIPS 2021: 6843-6855
[c41]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/RenLDDS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/RenLDDS21
Tongzheng Ren, Jialian Li, Bo Dai, Simon S. Du, Sujay Sanghavi:
Nearly Horizon-Free Offline Reinforcement Learning. NeurIPS 2021: 15621-15634
[c40]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChenDJ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChenDJ21
Yifang Chen, Simon S. Du, Kevin G. Jamieson:
Corruption Robust Active Learning. NeurIPS 2021: 29643-29654
[c39]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/uai/DuH0S0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uai/DuH0S0021
Simon S. Du, Wei Hu, Zhiyuan Li, Ruoqi Shen, Zhao Song, Jiajun Wu:
When is particle filtering efficient for planning in partially observed linear dynamical systems? UAI 2021: 728-737
[i73]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-00494
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-00494
Minbo Gao, Tianle Xie, Simon S. Du, Lin F. Yang:
A Provably Efficient Algorithm for Linear Markov Decision Process with Low Switching Cost. CoRR abs/2101.00494 (2021)
[i72]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2101-12745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2101-12745
Zihan Zhang, Jiaqi Yang, Xiangyang Ji, Simon S. Du:
Variance-Aware Confidence Set: Variance-Dependent Bound for Linear Bandits and Horizon-Free Bound for Linear Mixture MDP. CoRR abs/2101.12745 (2021)
[i71]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-04692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-04692
Haike Xu, Tengyu Ma, Simon S. Du:
Fine-Grained Gap-Dependent Bounds for Tabular MDPs via Adaptive Multi-Step Bootstrap. CoRR abs/2102.04692 (2021)
[i70]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-06875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-06875
Yifang Chen, Simon S. Du, Kevin Jamieson:
Improved Corruption Robust Algorithms for Episodic Reinforcement Learning. CoRR abs/2102.06875 (2021)
[i69]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-08903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-08903
Yulai Zhao, Yuandong Tian, Jason D. Lee, Simon S. Du:
Provably Efficient Policy Gradient Methods for Two-Player Zero-Sum Markov Games. CoRR abs/2102.08903 (2021)
[i68]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-09703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-09703
Zhihan Xiong, Ruoqi Shen, Simon S. Du:
Randomized Exploration is Near-Optimal for Tabular MDP. CoRR abs/2102.09703 (2021)
[i67]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-04564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-04564
Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon S. Du, Yu Wang, Yi Wu:
Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization. CoRR abs/2103.04564 (2021)
[i66]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-10897
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-10897
Simon S. Du, Sham M. Kakade, Jason D. Lee, Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang:
Bilinear Classes: A Structural Framework for Provable Generalization in RL. CoRR abs/2103.10897 (2021)
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-14077
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-14077
Tongzheng Ren, Jialian Li, Bo Dai, Simon S. Du, Sujay Sanghavi:
Nearly Horizon-Free Offline Reinforcement Learning. CoRR abs/2103.14077 (2021)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-11186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-11186
Jean Tarbouriech, Runlong Zhou, Simon S. Du, Matteo Pirotta, Michal Valko, Alessandro Lazaric:
Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret. CoRR abs/2104.11186 (2021)
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06657
Zhili Feng, Shaobo Han, Simon S. Du:
Provable Adaptation across Multiway Domains via Representation Learning. CoRR abs/2106.06657 (2021)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08053
Rui Lu, Gao Huang, Simon S. Du:
On the Power of Multitask Representation Learning in Linear MDP. CoRR abs/2106.08053 (2021)
[i61]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11220
Yifang Chen, Simon S. Du, Kevin Jamieson:
Corruption Robust Active Learning. CoRR abs/2106.11220 (2021)
[i60]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11692
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11692
Yunchang Yang, Tianhao Wu, Han Zhong, Evrard Garcelon, Matteo Pirotta, Alessandro Lazaric, Liwei Wang, Simon S. Du:
A Unified Framework for Conservative Exploration. CoRR abs/2106.11692 (2021)
[i59]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-14289
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-14289
Tian Ye, Simon S. Du:
Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization. CoRR abs/2106.14289 (2021)
[i58]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-00685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-00685
Zehao Dou, Zhuoran Yang, Zhaoran Wang, Simon S. Du:
Gap-Dependent Bounds for Two-Player Markov Games. CoRR abs/2107.00685 (2021)
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-08282
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-08282
Xiaoxia Wu, Yuege Xie, Simon S. Du, Rachel A. Ward:
AdaLoss: A computationally-efficient and provably convergent adaptive gradient method. CoRR abs/2109.08282 (2021)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04947
Xiang Wang, Xinlei Chen, Simon S. Du, Yuandong Tian:
Towards Demystifying Representation Learning with Non-contrastive Self-supervision. CoRR abs/2110.04947 (2021)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-03432
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-03432
Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson:
First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach. CoRR abs/2112.03432 (2021)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-06424
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-06424
Shusheng Xu, Yancheng Liang, Yunfei Li, Simon Shaolei Du, Yi Wu:
A Benchmark for Low-Switching-Cost Reinforcement Learning. CoRR abs/2112.06424 (2021)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-10935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-10935
Tianhao Wu, Yunchang Yang, Han Zhong, Liwei Wang, Simon S. Du, Jiantao Jiao:
Nearly Optimal Policy Optimization with Stable at Any Time Guarantee. CoRR abs/2112.10935 (2021)
2020
[j1]
- view
  - electronic edition @ jmlr.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/ChenDT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/ChenDT20
Xi Chen, Simon S. Du, Xin T. Tong:
On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics. J. Mach. Learn. Res. 21: 68:1-68:41 (2020)
[c38]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/AroraD0SWY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/AroraD0SWY20
Sanjeev Arora, Simon S. Du, Zhiyuan Li, Ruslan Salakhutdinov, Ruosong Wang, Dingli Yu:
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks. ICLR 2020
[c37]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuKWY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuKWY20
Simon S. Du, Sham M. Kakade, Ruosong Wang, Lin F. Yang:
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? ICLR 2020
[c36]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuLZDKJ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuLZDKJ20
Keyulu Xu, Jingling Li, Mozhi Zhang, Simon S. Du, Ken-ichi Kawarabayashi, Stefanie Jegelka:
What Can Neural Networks Reason About? ICLR 2020
[c35]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AroraDKLS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AroraDKLS20
Sanjeev Arora, Simon S. Du, Sham M. Kakade, Yuping Luo, Nikunj Saunshi:
Provable Representation Learning for Imitation Learning via Bi-level Optimization. ICML 2020: 367-376
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/WangL0ZD0T20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/WangL0ZD0T20
Yunbo Wang, Bo Liu, Jiajun Wu, Yuke Zhu, Simon S. Du, Li Fei-Fei, Joshua B. Tenenbaum:
DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs. IJCAI 2020: 4190-4198
[c33]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/DuLMW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuLMW20
Simon S. Du, Jason D. Lee, Gaurav Mahajan, Ruosong Wang:
Agnostic $Q$-learning with Function Approximation in Deterministic Systems: Near-Optimal Bounds on Approximation Error and Sample Complexity. NeurIPS 2020
[c32]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FengWYDY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FengWYDY20
Fei Feng, Ruosong Wang, Wotao Yin, Simon S. Du, Lin F. Yang:
Provably Efficient Exploration for Reinforcement Learning Using Unsupervised Learning. NeurIPS 2020
[c31]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangDYK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangDYK20
Ruosong Wang, Simon S. Du, Lin F. Yang, Sham M. Kakade:
Is Long Horizon RL More Difficult Than Short Horizon RL? NeurIPS 2020
[c30]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangDYS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangDYS20
Ruosong Wang, Simon S. Du, Lin F. Yang, Ruslan Salakhutdinov:
On Reward-Free Reinforcement Learning with Linear Function Approximation. NeurIPS 2020
[c29]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangZDSY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangZDSY20
Ruosong Wang, Peilin Zhong, Simon S. Du, Ruslan Salakhutdinov, Lin F. Yang:
Planning with General Objective Functions: Going Beyond Total Rewards. NeurIPS 2020
[c28]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangPDLSA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangPDLSA20
Yi Zhang, Orestis Plevrakis, Simon S. Du, Xingguo Li, Zhao Song, Sanjeev Arora:
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality. NeurIPS 2020
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06668
Yi Zhang, Orestis Plevrakis, Simon S. Du, Xingguo Li, Zhao Song, Sanjeev Arora:
Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality. CoRR abs/2002.06668 (2020)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-07125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-07125
Simon S. Du, Jason D. Lee, Gaurav Mahajan, Ruosong Wang:
Agnostic Q-learning with Function Approximation in Deterministic Systems: Tight Bounds on Approximation Error and Sample Complexity. CoRR abs/2002.07125 (2020)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-09434
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-09434
Simon S. Du, Wei Hu, Sham M. Kakade, Jason D. Lee, Qi Lei:
Few-Shot Learning via Learning the Representation, Provably. CoRR abs/2002.09434 (2020)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10544
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10544
Sanjeev Arora, Simon S. Du, Sham M. Kakade, Yuping Luo, Nikunj Saunshi:
Provable Representation Learning for Imitation Learning via Bi-level Optimization. CoRR abs/2002.10544 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-06898
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-06898
Fei Feng, Ruosong Wang, Wotao Yin, Simon S. Du, Lin F. Yang:
Provably Efficient Exploration for RL with Unsupervised Learning. CoRR abs/2003.06898 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-00527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-00527
Ruosong Wang, Simon S. Du, Lin F. Yang, Sham M. Kakade:
Is Long Horizon Reinforcement Learning More Difficult Than Short Horizon Reinforcement Learning? CoRR abs/2005.00527 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-05975
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-05975
Simon S. Du, Wei Hu, Zhiyuan Li, Ruoqi Shen, Zhao Song, Jiajun Wu:
When is Particle Filtering Efficient for POMDP Sequential Planning? CoRR abs/2006.05975 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-09118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-09118
Kunhe Yang, Lin F. Yang, Simon S. Du:
Q-learning with Logarithmic Regret. CoRR abs/2006.09118 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-11274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-11274
Ruosong Wang, Simon S. Du, Lin F. Yang, Ruslan Salakhutdinov:
On Reward-Free Reinforcement Learning with Linear Function Approximation. CoRR abs/2006.11274 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-11848
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-11848
Keyulu Xu, Jingling Li, Mozhi Zhang, Simon S. Du, Ken-ichi Kawarabayashi, Stefanie Jegelka:
How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks. CoRR abs/2009.11848 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-13503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-13503
Zihan Zhang, Xiangyang Ji, Simon S. Du:
Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon. CoRR abs/2009.13503 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-05901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-05901
Zihan Zhang, Simon S. Du, Xiangyang Ji:
Nearly Minimax Optimal Reward-free Reinforcement Learning. CoRR abs/2010.05901 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-06531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-06531
Jiaqi Yang, Wei Hu, Jason D. Lee, Simon S. Du:
Provable Benefits of Representation Learning in Linear Bandits. CoRR abs/2010.06531 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/us/Du19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Du19a
Simon S. Du:
Gradient Descent for Non-convex Problems in Modern Machine Learning. Carnegie Mellon University, USA, 2019
[c27]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/DuH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/DuH19
Simon S. Du, Wei Hu:
Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity. AISTATS 2019: 196-205
[c26]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuZPS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuZPS19
Simon S. Du, Xiyu Zhai, Barnabás Póczos, Aarti Singh:
Gradient Descent Provably Optimizes Over-parameterized Neural Networks. ICLR (Poster) 2019
[c25]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/AroraDHLW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/AroraDHLW19
Sanjeev Arora, Simon S. Du, Wei Hu, Zhiyuan Li, Ruosong Wang:
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks. ICML 2019: 322-332
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuH19
Simon S. Du, Wei Hu:
Width Provably Matters in Optimization for Deep Linear Neural Networks. ICML 2019: 1655-1664
[c23]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuKJAD019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuKJAD019
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford:
Provably efficient RL with Rich Observations via Latent State Decoding. ICML 2019: 1665-1674
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuLL0Z19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuLL0Z19
Simon S. Du, Jason D. Lee, Haochuan Li, Liwei Wang, Xiyu Zhai:
Gradient Descent Finds Global Minima of Deep Neural Networks. ICML 2019: 1675-1685
[c21]
- view
- export record
  dblp key:
  - conf/nips/DuHSPWX19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuHSPWX19
Simon S. Du, Kangcheng Hou, Ruslan Salakhutdinov, Barnabás Póczos, Ruosong Wang, Keyulu Xu:
Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels. NeurIPS 2019: 5724-5734
[c20]
- view
- export record
  dblp key:
  - conf/nips/ShiDSJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ShiDSJ19
Bin Shi, Simon S. Du, Weijie J. Su, Michael I. Jordan:
Acceleration via Symplectic Discretization of High-Resolution Differential Equations. NeurIPS 2019: 5745-5753
[c19]
- view
- export record
  dblp key:
  - conf/nips/LiuCZDZZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuCZDZZ19
Tianyi Liu, Minshuo Chen, Mo Zhou, Simon S. Du, Enlu Zhou, Tuo Zhao:
Towards Understanding the Importance of Shortcut Connections in Residual Networks. NeurIPS 2019: 7890-7900
[c18]
- view
- export record
  dblp key:
  - conf/nips/DuLWZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuLWZ19
Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang:
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle. NeurIPS 2019: 8058-8068
[c17]
- view
- export record
  dblp key:
  - conf/nips/AroraDH0SW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/AroraDH0SW19
Sanjeev Arora, Simon S. Du, Wei Hu, Zhiyuan Li, Ruslan Salakhutdinov, Ruosong Wang:
On Exact Computation with an Infinitely Wide Neural Net. NeurIPS 2019: 8139-8148
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-08572
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-08572
Simon S. Du, Wei Hu:
Width Provably Matters in Optimization for Deep Linear Neural Networks. CoRR abs/1901.08572 (2019)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-08584
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-08584
Sanjeev Arora, Simon S. Du, Wei Hu, Zhiyuan Li, Ruosong Wang:
Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks. CoRR abs/1901.08584 (2019)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-09018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-09018
Simon S. Du, Akshay Krishnamurthy, Nan Jiang, Alekh Agarwal, Miroslav Dudík, John Langford:
Provably efficient RL with Rich Observations via Latent State Decoding. CoRR abs/1901.09018 (2019)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-03694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-03694
Bin Shi, Simon S. Du, Weijie J. Su, Michael I. Jordan:
Acceleration via Symplectic Discretization of High-Resolution Differential Equations. CoRR abs/1902.03694 (2019)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1902-07111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1902-07111
Xiaoxia Wu, Simon S. Du, Rachel A. Ward:
Global Convergence of Adaptive Gradient Methods for An Over-parameterized Neural Network. CoRR abs/1902.07111 (2019)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-11955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-11955
Sanjeev Arora, Simon S. Du, Wei Hu, Zhiyuan Li, Ruslan Salakhutdinov, Ruosong Wang:
On Exact Computation with an Infinitely Wide Neural Net. CoRR abs/1904.11955 (2019)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-13016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-13016
Xi Chen, Simon S. Du, Xin T. Tong:
Hitting Time of Stochastic Gradient Langevin Dynamics to Stationary Points: A Direct Analysis. CoRR abs/1904.13016 (2019)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13192
Simon S. Du, Kangcheng Hou, Barnabás Póczos, Ruslan Salakhutdinov, Ruosong Wang, Keyulu Xu:
Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels. CoRR abs/1905.13192 (2019)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-13211
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-13211
Keyulu Xu, Jingling Li, Mozhi Zhang, Simon S. Du, Ken-ichi Kawarabayashi, Stefanie Jegelka:
What Can Neural Networks Reason About? CoRR abs/1905.13211 (2019)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-06321
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-06321
Simon S. Du, Yuping Luo, Ruosong Wang, Hanrui Zhang:
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle. CoRR abs/1906.06321 (2019)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-04653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-04653
Tianyi Liu, Minshuo Chen, Mo Zhou, Simon S. Du, Enlu Zhou, Tuo Zhao:
Towards Understanding the Importance of Shortcut Connections in Residual Networks. CoRR abs/1909.04653 (2019)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-13003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-13003
Yunbo Wang, Bo Liu, Jiajun Wu, Yuke Zhu, Simon S. Du, Li Fei-Fei, Joshua B. Tenenbaum:
Dual Sequential Monte Carlo: Tunneling Filtering and Planning in Continuous POMDPs. CoRR abs/1909.13003 (2019)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01663
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01663
Sanjeev Arora, Simon S. Du, Zhiyuan Li, Ruslan Salakhutdinov, Ruosong Wang, Dingli Yu:
Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks. CoRR abs/1910.01663 (2019)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-03016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-03016
Simon S. Du, Sham M. Kakade, Ruosong Wang, Lin F. Yang:
Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning? CoRR abs/1910.03016 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-13614
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-13614
Simon S. Du, Ruosong Wang, Mengdi Wang, Lin F. Yang:
Continuous Control with Contexts, Provably. CoRR abs/1910.13614 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-00809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-00809
Zhiyuan Li, Ruosong Wang, Dingli Yu, Simon S. Du, Wei Hu, Ruslan Salakhutdinov, Sanjeev Arora:
Enhanced Convolutional Neural Tangent Kernels. CoRR abs/1911.00809 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-04136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-04136
Yining Wang, Ruosong Wang, Simon S. Du, Akshay Krishnamurthy:
Optimism in Reinforcement Learning with Generalized Linear Function Approximation. CoRR abs/1912.04136 (2019)
2018
[c16]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/WangDBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/WangDBS18
Yining Wang, Simon S. Du, Sivaraman Balakrishnan, Aarti Singh:
Stochastic Zeroth-order Optimization in High Dimensions. AISTATS 2018: 1356-1365
[c15]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DuLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DuLT18
Simon S. Du, Jason D. Lee, Yuandong Tian:
When is a Convolutional Filter Easy to Learn? ICLR (Poster) 2018
[c14]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuL18
Simon S. Du, Jason D. Lee:
On the Power of Over-parametrization in Neural Networks with Quadratic Activation. ICML 2018: 1328-1337
[c13]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuLTSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuLTSP18
Simon S. Du, Jason D. Lee, Yuandong Tian, Aarti Singh, Barnabás Póczos:
Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima. ICML 2018: 1338-1347
[c12]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WuSHDR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuSHDR18
Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon S. Du, Stuart Russell:
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms. ICML 2018: 5339-5348
[c11]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ZhangDG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZhangDG18
Xiao Zhang, Simon S. Du, Quanquan Gu:
Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow. ICML 2018: 5751-5760
[c10]
- view
- export record
  dblp key:
  - conf/nips/DuWZBSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuWZBSS18
Simon S. Du, Yining Wang, Xiyu Zhai, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Aarti Singh:
How Many Samples are Needed to Estimate a Convolutional Neural Network? NeurIPS 2018: 371-381
[c9]
- view
- export record
  dblp key:
  - conf/nips/DuHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuHL18
Simon S. Du, Wei Hu, Jason D. Lee:
Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced. NeurIPS 2018: 382-393
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-01504
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-01504
Simon S. Du, Wei Hu:
Linear Convergence of the Primal-Dual Gradient Method for Convex-Concave Saddle Point Problems without Strong Convexity. CoRR abs/1802.01504 (2018)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-09578
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-09578
Yining Wang, Yi Wu, Simon S. Du:
Near-Linear Time Local Polynomial Nonparametric Estimation. CoRR abs/1802.09578 (2018)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-01206
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-01206
Simon S. Du, Jason D. Lee:
On the Power of Over-parametrization in Neural Networks with Quadratic Activation. CoRR abs/1803.01206 (2018)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-01233
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-01233
Xiao Zhang, Simon S. Du, Quanquan Gu:
Fast and Sample Efficient Inductive Matrix Completion via Multi-Phase Procrustes Flow. CoRR abs/1803.01233 (2018)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07798
Simon S. Du, Surbhi Goel:
Improved Learning of One-hidden-layer Convolutional Neural Networks with Overlaps. CoRR abs/1805.07798 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-07883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-07883
Simon S. Du, Yining Wang, Xiyu Zhai, Sivaraman Balakrishnan, Ruslan Salakhutdinov, Aarti Singh:
How Many Samples are Needed to Learn a Convolutional Neural Network? CoRR abs/1805.07883 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-10406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-10406
Simon S. Du, Yining Wang, Sivaraman Balakrishnan, Pradeep Ravikumar, Aarti Singh:
Robust Nonparametric Regression under Huber's ε-contamination Model. CoRR abs/1805.10406 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-00900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-00900
Simon S. Du, Wei Hu, Jason D. Lee:
Algorithmic Regularization in Learning Deep Homogeneous Models: Layers are Automatically Balanced. CoRR abs/1806.00900 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1806-02027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-02027
Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon S. Du, Stuart Russell:
Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms. CoRR abs/1806.02027 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-02054
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-02054
Simon S. Du, Xiyu Zhai, Barnabás Póczos, Aarti Singh:
Gradient Descent Provably Optimizes Over-parameterized Neural Networks. CoRR abs/1810.02054 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-08907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-08907
Bin Shi, Simon S. Du, Michael I. Jordan, Weijie J. Su:
Understanding the Acceleration Phenomenon via High-Resolution Differential Equations. CoRR abs/1810.08907 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-03804
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-03804
Simon S. Du, Jason D. Lee, Haochuan Li, Liwei Wang, Xiyu Zhai:
Gradient Descent Finds Global Minima of Deep Neural Networks. CoRR abs/1811.03804 (2018)
2017
[c8]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/BalakrishnanDLS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/BalakrishnanDLS17
Sivaraman Balakrishnan, Simon S. Du, Jerry Li, Aarti Singh:
Computationally Efficient Robust Sparse Estimation in High Dimensions. COLT 2017: 169-212
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/fsr/VijayaranganSKB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fsr/VijayaranganSKB17
Srinivasan Vijayarangan, Paloma Sodhi, Prathamesh Kini, James Bourne, Simon S. Du, Hanqi Sun, Barnabás Póczos, Dimitrios Apostolopoulos, David Wettergreen:
High-Throughput Robotic Phenotyping of Energy Sorghum Crops. FSR 2017: 99-113
[c6]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DuCLXZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DuCLXZ17
Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou:
Stochastic Variance Reduction Methods for Policy Evaluation. ICML 2017: 1049-1058
[c5]
- view
- export record
  dblp key:
  - conf/nips/DuWS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuWS17
Simon S. Du, Yining Wang, Aarti Singh:
On the Power of Truncated SVD for General High-rank Matrix Estimation Problems. NIPS 2017: 445-455
[c4]
- view
- export record
  dblp key:
  - conf/nips/DuKSP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuKSP17
Simon S. Du, Jayanth Koushik, Aarti Singh, Barnabás Póczos:
Hypothesis Transfer Learning via Transformation Functions. NIPS 2017: 574-584
[c3]
- view
- export record
  dblp key:
  - conf/nips/DuJLJSP17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DuJLJSP17
Simon S. Du, Chi Jin, Jason D. Lee, Michael I. Jordan, Aarti Singh, Barnabás Póczos:
Gradient Descent Can Take Exponential Time to Escape Saddle Points. NIPS 2017: 1067-1077
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DuWS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DuWS17
Simon S. Du, Yining Wang, Aarti Singh:
On the Power of Truncated SVD for General High-rank Matrix Estimation Problems. CoRR abs/1702.06861 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DuBS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DuBS17
Simon S. Du, Sivaraman Balakrishnan, Aarti Singh:
Computationally Efficient Robust Estimation of Sparse Functionals. CoRR abs/1702.07709 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DuCLXZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DuCLXZ17
Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou:
Stochastic Variance Reduction Methods for Policy Evaluation. CoRR abs/1702.07944 (2017)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DuJLJPS17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DuJLJPS17
Simon S. Du, Chi Jin, Jason D. Lee, Michael I. Jordan, Barnabás Póczos, Aarti Singh:
Gradient Descent Can Take Exponential Time to Escape Saddle Points. CoRR abs/1705.10412 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-06129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-06129
Simon S. Du, Jason D. Lee, Yuandong Tian:
When is a Convolutional Filter Easy To Learn? CoRR abs/1709.06129 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10551
Yining Wang, Simon S. Du, Sivaraman Balakrishnan, Aarti Singh:
Stochastic Zeroth-order Optimization in High Dimensions. CoRR abs/1710.10551 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1712-00779
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1712-00779
Simon S. Du, Jason D. Lee, Yuandong Tian, Barnabás Póczos, Aarti Singh:
Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima. CoRR abs/1712.00779 (2017)
2016
[c2]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/colt/BalcanDWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/colt/BalcanDWY16
Maria-Florina Balcan, Simon Shaolei Du, Yining Wang, Adams Wei Yu:
An Improved Gap-Dependency Analysis of the Noisy Power Method. COLT 2016: 284-309
[c1]
- view
- export record
  dblp key:
  - conf/nips/SinghDP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/SinghDP16
Shashank Singh, Simon S. Du, Barnabás Póczos:
Efficient Nonparametric Smoothness Estimation. NIPS 2016: 1010-1018
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BalcanDWY16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BalcanDWY16
Maria-Florina Balcan, Simon S. Du, Yining Wang, Adams Wei Yu:
An Improved Gap-Dependency Analysis of the Noisy Power Method. CoRR abs/1602.07046 (2016)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SinghDP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SinghDP16
Shashank Singh, Simon S. Du, Barnabás Póczos:
Efficient Nonparametric Smoothness Estimation. CoRR abs/1605.05785 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DuKSP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DuKSP16
Simon Shaolei Du, Jayanth Koushik, Aarti Singh, Barnabás Póczos:
Transformation Function Based Methods for Model Shift. CoRR abs/1612.01020 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.