Marc LanctotDavid SychrovskyMichal SustrElnaz DavoodiMichael BowlingMarc LanctotMartin SchmidLearning Not to Regret.15202-152102024AAAIhttps://doi.org/10.1609/aaai.v38i14.29443conf/aaai/2024db/conf/aaai/aaai2024.html#SychrovskySDBLS24Ian GempMarc LanctotLuke MarrisYiran MaoEdgar A. Duéñez-GuzmánSarah PerrinAndras GyorgyRomuald ElieGeorgios PiliourasMichael KaisersDaniel HennesKalesha BullardKate LarsonYoram BachrachApproximating the Core via Iterative Coalition Sampling.669-6782024AAMAShttps://dl.acm.org/doi/10.5555/3635637.3662919conf/atal/2024db/conf/atal/aamas2024.html#GempLMMDPGEPKHB24Siqi Liu 0002Luke MarrisMarc LanctotGeorgios PiliourasJoel Z. LeiboNicolas HeessNeural Population Learning beyond Symmetric Zero-Sum Games.1247-12552024AAMAShttps://dl.acm.org/doi/10.5555/3635637.3662982conf/atal/2024db/conf/atal/aamas2024.html#LiuMLPLH24Siqi Liu 0002Luke MarrisMarc LanctotGeorgios PiliourasJoel Z. LeiboNicolas HeessNeural Population Learning beyond Symmetric Zero-sum Games.2024abs/2401.05133CoRRhttps://doi.org/10.48550/arXiv.2401.05133db/journals/corr/corr2401.html#abs-2401-05133Ian GempYoram BachrachMarc LanctotRoma PatelVibhavari DasagiLuke MarrisGeorgios PiliourasSiqi Liu 0002Karl TuylsStates as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers.2024abs/2402.01704CoRRhttps://doi.org/10.48550/arXiv.2402.01704db/journals/corr/corr2402.html#abs-2402-01704Ian GempMarc LanctotLuke MarrisYiran MaoEdgar A. Duéñez-GuzmánSarah PerrinAndras GyorgyRomuald ElieGeorgios PiliourasMichael KaisersDaniel HennesKalesha BullardKate LarsonYoram BachrachApproximating the Core via Iterative Coalition Sampling.2024abs/2402.03928CoRRhttps://doi.org/10.48550/arXiv.2402.03928db/journals/corr/corr2402.html#abs-2402-03928Luca D'Amico-WongHugh ZhangMarc LanctotDavid C. ParkesEasy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization.2024abs/2402.11835CoRRhttps://doi.org/10.48550/arXiv.2402.11835db/journals/corr/corr2402.html#abs-2402-11835Heymann BenjaminMarc LanctotLearning in Games with progressive hiding.2024abs/2409.03875CoRRhttps://doi.org/10.48550/arXiv.2409.03875db/journals/corr/corr2409.html#abs-2409-03875streams/journals/corrMarc LanctotJohn SchultzNeil BurchMax Olan SmithDaniel HennesThomas Anthony 0001Julien PérolatPopulation-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning.20232023Trans. Mach. Learn. Res.https://openreview.net/forum?id=gQnJ7ODIAxdb/journals/tmlr/tmlr2023.html#LanctotSBSH0P23Zun Li 0002Marc LanctotKevin R. McKeeLuke MarrisIan GempDaniel HennesKate LarsonYoram BachrachMichael P. WellmanPaul MullerSearch-Improved Game-Theoretic Multiagent Reinforcement Learning in General and Negotiation Games.2445-24472023AAMAShttps://dl.acm.org/doi/10.5555/3545946.3598962conf/atal/2023db/conf/atal/aamas2023.html#LiLMMGHLBWM23Stephen Marcus McAleerGabriele FarinaMarc LanctotTuomas SandholmESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret.2023ICLRhttps://openreview.net/forum?id=35QyoZv8cKOconf/iclr/2023db/conf/iclr/iclr2023.html#McAleerFLS23Samuel SokotaRyan D'OrazioJ. Zico KolterNicolas LoizouMarc LanctotIoannis MitliagkasNoam BrownChristian KroerA Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games.2023ICLRhttps://openreview.net/forum?id=DpE5UYUQzZHconf/iclr/2023db/conf/iclr/iclr2023.html#SokotaDKLLMBK23Zun Li 0002Marc LanctotKevin R. McKeeLuke MarrisIan GempDaniel HennesPaul MullerKate LarsonYoram BachrachMichael P. WellmanCombining Tree-Search, Generative Models, and Nash Bargaining Concepts in Game-Theoretic Reinforcement Learning.2023abs/2302.00797CoRRhttps://doi.org/10.48550/arXiv.2302.00797db/journals/corr/corr2302.html#abs-2302-00797David SychrovskyMichal SustrElnaz DavoodiMarc LanctotMartin SchmidLearning not to Regret.2023abs/2303.01074CoRRhttps://doi.org/10.48550/arXiv.2303.01074db/journals/corr/corr2303.html#abs-2303-01074Marc LanctotJohn SchultzNeil BurchMax Olan SmithDaniel HennesThomas W. Anthony 0001Julien PérolatPopulation-based Evaluation in Repeated Rock-Paper-Scissors as a Benchmark for Multiagent Reinforcement Learning.2023abs/2303.03196CoRRhttps://doi.org/10.48550/arXiv.2303.03196db/journals/corr/corr2303.html#abs-2303-03196Marc LanctotKate LarsonYoram BachrachLuke MarrisZun Li 0002Avishkar BhoopchandThomas W. Anthony 0001Brian TannerAnna KoopEvaluating Agents using Social Choice Theory.2023abs/2312.03121CoRRhttps://doi.org/10.48550/arXiv.2312.03121db/journals/corr/corr2312.html#abs-2312-03121Ian GempThomas W. Anthony 0001Yoram BachrachAvishkar BhoopchandKalesha BullardJerome T. ConnorVibhavari DasagiBart De VylderEdgar A. Duéñez-GuzmánRomuald ElieRichard Everett 0001Daniel HennesEdward Hughes 0001Mina KhanMarc LanctotKate LarsonGuy LeverSiqi Liu 0002Luke MarrisKevin R. McKeePaul MullerJulien PérolatFlorian StrubAndrea TacchettiEugene TarassovZhe WangKarl TuylsDeveloping, evaluating and scaling learning agents in multi-agent environments.271-284202235AI Commun.4https://doi.org/10.3233/AIC-220113db/journals/aicom/aicom35.html#GempABBBCDVDEEH22Ian GempRahul SavaniMarc LanctotYoram BachrachThomas W. Anthony 0001Richard Everett 0001Andrea TacchettiTom EcclesJános KramárSample-based Approximation of Nash in Large Many-Player Games via Gradient Descent.507-5152022AAMAShttps://www.ifaamas.org/Proceedings/aamas2022/pdfs/p507.pdfhttps://dl.acm.org/doi/10.5555/3535850.3535908conf/atal/2022db/conf/atal/aamas2022.html#GempSLBA0TEK22Siqi Liu 0002Marc LanctotLuke MarrisNicolas HeessSimplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games.13793-138062022ICMLhttps://proceedings.mlr.press/v162/liu22h.htmlconf/icml/2022db/conf/icml/icml2022.html#LiuLMH22Finbarr TimbersNolan BardEdward LockhartMarc LanctotMartin SchmidNeil BurchJulian SchrittwieserThomas HubertMichael BowlingApproximate Exploitability: Learning a Best Response.3487-34932022IJCAIhttps://doi.org/10.24963/ijcai.2022/484conf/ijcai/2022db/conf/ijcai/ijcai2022.html#TimbersBLLSBSHB22Julien PérolatBart De VylderDaniel HennesEugene TarassovFlorian StrubVincent de BoerPaul MullerJerome T. ConnorNeil BurchThomas Anthony 0001Stephen McAleerRomuald ElieSarah H. CenZhe WangAudrunas GruslysAleksandra MalyshevaMina KhanSherjil OzairFinbarr TimbersToby PohlenTom EcclesMark Rowland 0001Marc LanctotJean-Baptiste LespiauBilal PiotShayegan OmidshafieiEdward LockhartLaurent SifreNathalie BeauguerlangeRémi MunosDavid SilverSatinder Singh 0001Demis HassabisKarl TuylsFigure Data for the paper "Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning".2022Octoberhttps://doi.org/10.5281/zenodo.7118519Zenodostreams/repo/zenodoStephen McAleerKevin Wang 0003John B. LanierMarc LanctotPierre BaldiTuomas SandholmRoy FoxAnytime PSRO for Two-Player Zero-Sum Games.2022abs/2201.07700CoRRhttps://arxiv.org/abs/2201.07700db/journals/corr/corr2201.html#abs-2201-07700Siqi Liu 0002Marc LanctotLuke MarrisNicolas HeessSimplex Neural Population Learning: Any-Mixture Bayes-Optimality in Symmetric Zero-sum Games.2022abs/2205.15879CoRRhttps://doi.org/10.48550/arXiv.2205.15879db/journals/corr/corr2205.html#abs-2205-15879Stephen McAleerGabriele FarinaMarc LanctotTuomas SandholmESCHER: Eschewing Importance Sampling in Games by Computing a History Value Function to Estimate Regret.2022abs/2206.04122CoRRhttps://doi.org/10.48550/arXiv.2206.04122db/journals/corr/corr2206.html#abs-2206-04122Samuel SokotaRyan D'OrazioJ. Zico KolterNicolas LoizouMarc LanctotIoannis MitliagkasNoam BrownChristian KroerA Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum Games.2022abs/2206.05825CoRRhttps://doi.org/10.48550/arXiv.2206.05825db/journals/corr/corr2206.html#abs-2206-05825Julien PérolatBart De VylderDaniel HennesEugene TarassovFlorian StrubVincent de BoerPaul MullerJerome T. ConnorNeil BurchThomas W. Anthony 0001Stephen McAleerRomuald ElieSarah H. CenZhe WangAudrunas GruslysAleksandra MalyshevaMina KhanSherjil OzairFinbarr TimbersToby PohlenTom EcclesMark Rowland 0001Marc LanctotJean-Baptiste LespiauBilal PiotShayegan OmidshafieiEdward LockhartLaurent SifreNathalie BeauguerlangeRémi MunosDavid SilverSatinder Singh 0001Demis HassabisKarl TuylsMastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning.2022abs/2206.15378CoRRhttps://doi.org/10.48550/arXiv.2206.15378db/journals/corr/corr2206.html#abs-2206-15378Ian GempThomas W. Anthony 0001Yoram BachrachAvishkar BhoopchandKalesha BullardJerome T. ConnorVibhavari DasagiBart De VylderEdgar A. Duéñez-GuzmánRomuald ElieRichard Everett 0001Daniel HennesEdward Hughes 0001Mina KhanMarc LanctotKate LarsonGuy LeverSiqi Liu 0002Luke MarrisKevin R. McKeePaul MullerJulien PérolatFlorian StrubAndrea TacchettiEugene TarassovZhe WangKarl TuylsDeveloping, Evaluating and Scaling Learning Agents in Multi-Agent Environments.2022abs/2209.10958CoRRhttps://doi.org/10.48550/arXiv.2209.10958db/journals/corr/corr2209.html#abs-2209-10958Luke MarrisMarc LanctotIan GempShayegan OmidshafieiStephen McAleerJerome T. ConnorKarl TuylsThore GraepelGame Theoretic Rating in N-player general-sum games with Equilibria.2022abs/2210.02205CoRRhttps://doi.org/10.48550/arXiv.2210.02205db/journals/corr/corr2210.html#abs-2210-02205Dustin MorrillRyan D'OrazioReca SarfatiMarc LanctotJames R. WrightAmy R. GreenwaldMichael BowlingHindsight and Sequential Rationality of Correlated Play.5584-55942021AAAIhttps://doi.org/10.1609/aaai.v35i6.16702conf/aaai/2021db/conf/aaai/aaai2021.html#MorrillDSLWGB21Samuel SokotaEdward LockhartFinbarr TimbersElnaz DavoodiRyan D'OrazioNeil BurchMartin SchmidMichael BowlingMarc LanctotSolving Common-Payoff Games with Approximate Policy Iteration.9695-97032021AAAIhttps://doi.org/10.1609/aaai.v35i11.17166conf/aaai/2021db/conf/aaai/aaai2021.html#SokotaLTDDBSBL21Michal SustrMartin SchmidMatej MoravcíkNeil BurchMarc LanctotMichael BowlingSound Algorithms in Imperfect Information Games.1674-16762021AAMAShttps://www.ifaamas.org/Proceedings/aamas2021/pdfs/p1674.pdfhttps://dl.acm.org/doi/10.5555/3463952.3464197conf/atal/2021db/conf/atal/aamas2021.html#SustrSMBLB21Luke MarrisPaul MullerMarc LanctotKarl TuylsThore GraepelMulti-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers.7480-74912021ICMLhttp://proceedings.mlr.press/v139/marris21a.htmlconf/icml/2021db/conf/icml/icml2021.html#MarrisMLTG21Dustin MorrillRyan D'OrazioMarc LanctotJames R. WrightMichael BowlingAmy R. GreenwaldEfficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games.7818-78282021ICMLhttp://proceedings.mlr.press/v139/morrill21a.htmlconf/icml/2021db/conf/icml/icml2021.html#MorrillDLWBG21Julien PérolatRémi MunosJean-Baptiste LespiauShayegan OmidshafieiMark Rowland 0001Pedro A. OrtegaNeil BurchThomas W. Anthony 0001David BalduzziBart De VylderGeorgios PiliourasMarc LanctotKarl TuylsFrom Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.8525-85352021ICMLhttp://proceedings.mlr.press/v139/perolat21a.htmlconf/icml/2021db/conf/icml/icml2021.html#PerolatMLOROBAB21Abhinav Gupta 0002Marc LanctotAngeliki LazaridouDynamic population-based meta-learning for multi-agent communication with natural language.16899-169122021NeurIPShttps://proceedings.neurips.cc/paper/2021/hash/8caa38721906c1a0bb95c80fab33a893-Abstract.htmlconf/nips/2021db/conf/nips/neurips2021.html#GuptaLL21Samuel SokotaEdward LockhartFinbarr TimbersElnaz DavoodiRyan D'OrazioNeil BurchMartin SchmidMichael BowlingMarc LanctotSolving Common-Payoff Games with Approximate Policy Iteration.2021abs/2101.04237CoRRhttps://arxiv.org/abs/2101.04237db/journals/corr/corr2101.html#abs-2101-04237Dustin MorrillRyan D'OrazioMarc LanctotJames R. WrightMichael BowlingAmy GreenwaldEfficient Deviation Types and Learning for Hindsight Rationality in Extensive-Form Games.2021abs/2102.06973CoRRhttps://arxiv.org/abs/2102.06973db/journals/corr/corr2102.html#abs-2102-06973Ian GempRahul SavaniMarc LanctotYoram BachrachThomas W. Anthony 0001Richard Everett 0001Andrea TacchettiTom EcclesJános KramárSample-based Approximation of Nash in Large Many-Player Games via Gradient Descent.2021abs/2106.01285CoRRhttps://arxiv.org/abs/2106.01285db/journals/corr/corr2106.html#abs-2106-01285Luke MarrisPaul MullerMarc LanctotKarl TuylsThore GraepelMulti-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers.2021abs/2106.09435CoRRhttps://arxiv.org/abs/2106.09435db/journals/corr/corr2106.html#abs-2106-09435Abhinav Gupta 0002Marc LanctotAngeliki LazaridouDynamic population-based meta-learning for multi-agent communication with natural language.2021abs/2110.14241CoRRhttps://arxiv.org/abs/2110.14241db/journals/corr/corr2110.html#abs-2110-14241Martin SchmidMatej MoravcikNeil BurchRudolf KadlecJoshua DavidsonKevin WaughNolan BardFinbarr TimbersMarc LanctotG. Zacharias HollandElnaz DavoodiAlden ChristiansonMichael BowlingPlayer of Games.2021abs/2112.03178CoRRhttps://arxiv.org/abs/2112.03178db/journals/corr/corr2112.html#abs-2112-03178Karl TuylsJulien PérolatMarc LanctotEdward Hughes 0001Richard Everett 0001Joel Z. LeiboCsaba SzepesváriThore GraepelBounds and dynamics for empirical game theoretic analysis.7202034Auton. Agents Multi Agent Syst.1https://doi.org/10.1007/s10458-019-09432-ydb/journals/aamas/aamas34.html#TuylsPLHELSG20Nolan BardJakob N. FoersterSarath ChandarNeil BurchMarc LanctotH. Francis SongEmilio ParisottoVincent DumoulinSubhodeep MoitraEdward Hughes 0001Iain DunningShibl MouradHugo LarochelleMarc G. BellemareMichael BowlingThe Hanabi challenge: A new frontier for AI research.1032162020280Artif. Intell.https://doi.org/10.1016/j.artint.2019.103216db/journals/ai/ai280.html#BardFCBLSPDMHDM20Yoram BachrachRichard Everett 0001Edward Hughes 0001Angeliki LazaridouJoel Z. LeiboMarc LanctotMichael JohansonWojciech M. CzarneckiThore GraepelNegotiating team formation using deep reinforcement learning.1033562020288Artif. Intell.https://doi.org/10.1016/j.artint.2020.103356db/journals/ai/ai288.html#BachrachEHLLLJC20Daniel HennesDustin MorrillShayegan OmidshafieiRémi MunosJulien PérolatMarc LanctotAudrunas GruslysJean-Baptiste LespiauPaavo ParmasEdgar A. Duéñez-GuzmánKarl TuylsNeural Replicator Dynamics: Multiagent Learning via Hedging Policy Gradients.492-5012020AAMAShttps://dl.acm.org/doi/10.5555/3398761.3398822https://www.ifaamas.org/Proceedings/aamas2020/pdfs/p492.pdfconf/atal/2020db/conf/atal/aamas2020.html#HennesMOMPLGLPD20Paul MullerShayegan OmidshafieiMark Rowland 0001Karl TuylsJulien PérolatSiqi Liu 0002Daniel HennesLuke MarrisMarc LanctotEdward Hughes 0001Zhe WangGuy LeverNicolas HeessThore GraepelRémi MunosA Generalized Training Approach for Multiagent Learning.2020ICLRhttps://openreview.net/forum?id=Bkl5kxrKDrconf/iclr/2020db/conf/iclr/iclr2020.html#MullerORTPLHMLH20Rémi MunosJulien PérolatJean-Baptiste LespiauMark Rowland 0001Bart De VylderMarc LanctotFinbarr TimbersDaniel HennesShayegan OmidshafieiAudrunas GruslysMohammad Gheshlaghi AzarEdward LockhartKarl TuylsFast computation of Nash Equilibria in Imperfect Information Games.7119-71292020ICMLhttp://proceedings.mlr.press/v119/munos20a.htmlconf/icml/2020db/conf/icml/icml2020.html#MunosPLRVLTHOGA20Thomas W. Anthony 0001Tom EcclesAndrea TacchettiJános KramárIan GempThomas C. HudsonNicolas PorcelMarc LanctotJulien PérolatRichard Everett 0001Satinder Singh 0001Thore GraepelYoram BachrachLearning to Play No-Press Diplomacy with Best Response Policy Iteration.2020NeurIPShttps://proceedings.neurips.cc/paper/2020/hash/d1419302db9c022ab1d48681b13d5f8b-Abstract.htmlconf/nips/2020db/conf/nips/neurips2020.html#AnthonyETKGHPLP20Julien PérolatRémi MunosJean-Baptiste LespiauShayegan OmidshafieiMark Rowland 0001Pedro A. OrtegaNeil BurchThomas W. Anthony 0001David BalduzziBart De VylderGeorgios PiliourasMarc LanctotKarl TuylsFrom Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization.2020abs/2002.08456CoRRhttps://arxiv.org/abs/2002.08456db/journals/corr/corr2002.html#abs-2002-08456Finbarr TimbersEdward LockhartMartin SchmidMarc LanctotMichael BowlingApproximate exploitability: Learning a best response in large games.2020abs/2004.09677CoRRhttps://arxiv.org/abs/2004.09677db/journals/corr/corr2004.html#abs-2004-09677Thomas W. Anthony 0001Tom EcclesAndrea TacchettiJános KramárIan GempThomas C. HudsonNicolas PorcelMarc LanctotJulien PérolatRichard Everett 0001Satinder Singh 0001Thore GraepelYoram BachrachLearning to Play No-Press Diplomacy with Best Response Policy Iteration.2020abs/2006.04635CoRRhttps://arxiv.org/abs/2006.04635db/journals/corr/corr2006.html#abs-2006-04635Michal SustrMartin SchmidMatej MoravcíkNeil BurchMarc LanctotMichael BowlingSound Search in Imperfect Information Games.2020abs/2006.08740CoRRhttps://arxiv.org/abs/2006.08740db/journals/corr/corr2006.html#abs-2006-08740Audrunas GruslysMarc LanctotRémi MunosFinbarr TimbersMartin SchmidJulien PérolatDustin MorrillVinícius Flores ZambaldiJean-Baptiste LespiauJohn SchultzMohammad Gheshlaghi AzarMichael BowlingKarl TuylsThe Advantage Regret-Matching Actor-Critic.2020abs/2008.12234CoRRhttps://arxiv.org/abs/2008.12234db/journals/corr/corr2008.html#abs-2008-12234Yoram BachrachRichard Everett 0001Edward Hughes 0001Angeliki LazaridouJoel Z. LeiboMarc LanctotMichael JohansonWojciech M. CzarneckiThore GraepelNegotiating Team Formation Using Deep Reinforcement Learning.2020abs/2010.10380CoRRhttps://arxiv.org/abs/2010.10380db/journals/corr/corr2010.html#abs-2010-10380Dustin MorrillRyan D'OrazioReca SarfatiMarc LanctotJames R. WrightAmy GreenwaldMichael BowlingHindsight and Sequential Rationality of Correlated Play.2020abs/2012.05874CoRRhttps://arxiv.org/abs/2012.05874db/journals/corr/corr2012.html#abs-2012-05874Guy BarashMauricio Castillo-EffenNiyati ChhayaPeter ClarkHuáscar EspinozaEitan FarchiChristopher W. GeibOdd Erik GundersenSeán Ó hÉigeartaighJosé Hernández-OralloChiori HoriXiaowei Huang 0001Kokil JaidkaPavan KapanipathiSarah KerenSeokhwan KimMarc LanctotDanny LangeJulian J. McAuleyDavid R. MartinezMarwan MattarMausamMartin MichalowskiReuth MirskyRoozbeh MottaghiJoseph C. OsbornJulien PérolatMartin SchmidArash Shaban-NejadOnn ShehoryBiplav SrivastavaWilliam W. StreileinKartik TalamadupulaJulian TogeliusKoichiro YoshinoQuanshi ZhangImed ZitouniReports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence.67-78201940AI Mag.3https://doi.org/10.1609/aimag.v40i3.4981db/journals/aim/aim40.html#BarashCCCEFGGhH19Martin SchmidNeil BurchMarc LanctotMatej MoravcikRudolf KadlecMichael BowlingVariance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games Using Baselines.2157-21642019AAAIhttps://doi.org/10.1609/aaai.v33i01.33012157conf/aaai/2019db/conf/aaai/aaai2019.html#SchmidBLMKB19Edward LockhartMarc LanctotJulien PérolatJean-Baptiste LespiauDustin MorrillFinbarr TimbersKarl TuylsComputing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent.2019IJCAIhttps://doi.org/10.24963/ijcai.2019/66conf/ijcai/2019db/conf/ijcai/ijcai2019.html#LockhartLPLMTT19464-470Nolan BardJakob N. FoersterSarath ChandarNeil BurchMarc LanctotH. Francis SongEmilio ParisottoVincent DumoulinSubhodeep MoitraEdward Hughes 0001Iain DunningShibl MouradHugo LarochelleMarc G. BellemareMichael BowlingThe Hanabi Challenge: A New Frontier for AI Research.2019abs/1902.00506CoRRhttp://arxiv.org/abs/1902.00506db/journals/corr/corr1902.html#abs-1902-00506Joel Z. LeiboEdward Hughes 0001Marc LanctotThore GraepelAutocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research.2019abs/1903.00742CoRRhttp://arxiv.org/abs/1903.00742db/journals/corr/corr1903.html#abs-1903-00742Shayegan OmidshafieiChristos H. PapadimitriouGeorgios PiliourasKarl TuylsMark Rowland 0001Jean-Baptiste LespiauWojciech M. CzarneckiMarc LanctotJulien PérolatRémi Munosα-Rank: Multi-Agent Evaluation by Evolution.2019abs/1903.01373CoRRhttp://arxiv.org/abs/1903.01373db/journals/corr/corr1903.html#abs-1903-01373Edward LockhartMarc LanctotJulien PérolatJean-Baptiste LespiauDustin MorrillFinbarr TimbersKarl TuylsComputing Approximate Equilibria in Sequential Adversarial Games by Exploitability Descent.2019abs/1903.05614CoRRhttp://arxiv.org/abs/1903.05614db/journals/corr/corr1903.html#abs-1903-05614Shayegan OmidshafieiDaniel HennesDustin MorrillRémi MunosJulien PérolatMarc LanctotAudrunas GruslysJean-Baptiste LespiauKarl TuylsNeural Replicator Dynamics.2019abs/1906.00190CoRRhttp://arxiv.org/abs/1906.00190db/journals/corr/corr1906.html#abs-1906-00190Marc LanctotEdward LockhartJean-Baptiste LespiauVinícius Flores ZambaldiSatyaki UpadhyayJulien PérolatSriram Srinivasan 0005Finbarr TimbersKarl TuylsShayegan OmidshafieiDaniel HennesDustin MorrillPaul MullerTimo EwaldsRyan Faulkner 0001János KramárBart De VylderBrennan SaetaJames BradburyDavid DingSebastian BorgeaudMatthew LaiJulian SchrittwieserThomas W. Anthony 0001Edward Hughes 0001Ivo DanihelkaJonah Ryan-DavisOpenSpiel: A Framework for Reinforcement Learning in Games.2019abs/1908.09453CoRRhttp://arxiv.org/abs/1908.09453db/journals/corr/corr1908.html#abs-1908-09453Paul MullerShayegan OmidshafieiMark Rowland 0001Karl TuylsJulien PérolatSiqi Liu 0002Daniel HennesLuke MarrisMarc LanctotEdward Hughes 0001Zhe WangGuy LeverNicolas HeessThore GraepelRémi MunosA Generalized Training Approach for Multiagent Learning.2019abs/1909.12823CoRRhttp://arxiv.org/abs/1909.12823db/journals/corr/corr1909.html#abs-1909-12823Todd HesterMatej VeceríkOlivier PietquinMarc LanctotTom SchaulBilal PiotDan HorganJohn QuanAndrew SendonarisIan OsbandGabriel Dulac-ArnoldJohn P. AgapiouJoel Z. LeiboAudrunas GruslysDeep Q-learning From Demonstrations.2018AAAIhttps://doi.org/10.1609/aaai.v32i1.11757conf/aaai/2018db/conf/aaai/aaai2018.html#HesterVPLSPHQSO183223-3230Karl TuylsJulien PérolatMarc LanctotJoel Z. LeiboThore GraepelA Generalised Method for Empirical Game Theoretic Analysis.77-852018AAMAShttp://dl.acm.org/citation.cfm?id=3237402conf/atal/2018db/conf/atal/aamas2018.html#TuylsPLLG18Peter SunehagGuy LeverAudrunas GruslysWojciech Marian CzarneckiVinícius Flores ZambaldiMax JaderbergMarc LanctotNicolas SonneratJoel Z. LeiboKarl TuylsThore GraepelValue-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward.2085-20872018AAMAShttp://dl.acm.org/citation.cfm?id=3238080conf/atal/2018db/conf/atal/aamas2018.html#SunehagLGCZJLSL18Kris CaoAngeliki LazaridouMarc LanctotJoel Z. LeiboKarl TuylsStephen ClarkEmergent Communication through Negotiation.2018ICLR (Poster)https://openreview.net/forum?id=Hk6WhagRWconf/iclr/2018db/conf/iclr/iclr2018.html#CaoLLLTC18Sriram Srinivasan 0005Marc LanctotVinícius Flores ZambaldiJulien PérolatKarl TuylsRémi MunosMichael BowlingActor-Critic Policy Optimization in Partially Observable Multiagent Environments.3426-34392018NeurIPShttps://proceedings.neurips.cc/paper/2018/hash/e22dd5dabde45eda5a1a67772c8e25dd-Abstract.htmlhttp://papers.nips.cc/paper/7602-actor-critic-policy-optimization-in-partially-observable-multiagent-environmentsconf/nips/2018db/conf/nips/nips2018.html#SrinivasanLZPTM18Karl TuylsJulien PérolatMarc LanctotJoel Z. LeiboThore GraepelA Generalised Method for Empirical Game Theoretic Analysis.2018abs/1803.06376CoRRhttp://arxiv.org/abs/1803.06376db/journals/corr/corr1803.html#abs-1803-06376Kris CaoAngeliki LazaridouMarc LanctotJoel Z. LeiboKarl TuylsStephen ClarkEmergent Communication through Negotiation.2018abs/1804.03980CoRRhttp://arxiv.org/abs/1804.03980db/journals/corr/corr1804.html#abs-1804-03980Martin SchmidNeil BurchMarc LanctotMatej MoravcikRudolf KadlecMichael BowlingVariance Reduction in Monte Carlo Counterfactual Regret Minimization (VR-MCCFR) for Extensive Form Games using Baselines.2018abs/1809.03057CoRRhttp://arxiv.org/abs/1809.03057db/journals/corr/corr1809.html#abs-1809-03057Sriram Srinivasan 0005Marc LanctotVinícius Flores ZambaldiJulien PérolatKarl TuylsRémi MunosMichael BowlingActor-Critic Policy Optimization in Partially Observable Multiagent Environments.2018abs/1810.09026CoRRhttp://arxiv.org/abs/1810.09026db/journals/corr/corr1810.html#abs-1810-09026Joel Z. LeiboVinícius Flores ZambaldiMarc LanctotJanusz MareckiThore GraepelMulti-agent Reinforcement Learning in Sequential Social Dilemmas.464-4732017AAMAShttp://dl.acm.org/citation.cfm?id=3091194conf/atal/2017db/conf/atal/aamas2017.html#LeiboZLMG17Marc LanctotVinícius Flores ZambaldiAudrunas GruslysAngeliki LazaridouKarl TuylsJulien PérolatDavid SilverThore GraepelA Unified Game-Theoretic Approach to Multiagent Reinforcement Learning.4190-42032017NIPShttps://proceedings.neurips.cc/paper/2017/hash/3323fe11e9595c09af38fe67567a9394-Abstract.htmlhttp://papers.nips.cc/paper/7007-a-unified-game-theoretic-approach-to-multiagent-reinforcement-learningconf/nips/2017db/conf/nips/nips2017.html#LanctotZGLTPSG17Joel Z. LeiboVinícius Flores ZambaldiMarc LanctotJanusz MareckiThore GraepelMulti-agent Reinforcement Learning in Sequential Social Dilemmas.2017abs/1702.03037CoRRhttp://arxiv.org/abs/1702.03037db/journals/corr/corr1702.html#LeiboZLMG17Todd HesterMatej VeceríkOlivier PietquinMarc LanctotTom SchaulBilal PiotAndrew SendonarisGabriel Dulac-ArnoldIan OsbandJohn P. AgapiouJoel Z. LeiboAudrunas GruslysLearning from Demonstrations for Real World Reinforcement Learning.2017abs/1704.03732CoRRhttp://arxiv.org/abs/1704.03732db/journals/corr/corr1704.html#HesterVPLSPSDOA17Peter SunehagGuy LeverAudrunas GruslysWojciech Marian CzarneckiVinícius Flores ZambaldiMax JaderbergMarc LanctotNicolas SonneratJoel Z. LeiboKarl TuylsThore GraepelValue-Decomposition Networks For Cooperative Multi-Agent Learning.2017abs/1706.05296CoRRhttp://arxiv.org/abs/1706.05296db/journals/corr/corr1706.html#SunehagLGCZJLSL17Marc LanctotVinícius Flores ZambaldiAudrunas GruslysAngeliki LazaridouKarl TuylsJulien PérolatDavid SilverThore GraepelA Unified Game-Theoretic Approach to Multiagent Reinforcement Learning.2017abs/1711.00832CoRRhttp://arxiv.org/abs/1711.00832db/journals/corr/corr1711.html#abs-1711-00832Karl TuylsJulien PérolatMarc LanctotGeorg OstrovskiRahul SavaniJoel Z. LeiboToby OrdThore GraepelShane LeggSymmetric Decomposition of Asymmetric Games.2017abs/1711.05074CoRRhttp://arxiv.org/abs/1711.05074db/journals/corr/corr1711.html#abs-1711-05074David SilverThomas HubertJulian SchrittwieserIoannis AntonoglouMatthew LaiArthur GuezMarc LanctotLaurent SifreDharshan KumaranThore GraepelTimothy P. LillicrapKaren SimonyanDemis HassabisMastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm.2017abs/1712.01815CoRRhttp://arxiv.org/abs/1712.01815db/journals/corr/corr1712.html#abs-1712-01815Branislav BosanskýViliam LisýMarc LanctotJirí CermákMark H. M. WinandsAlgorithms for computing strategies in two-player simultaneous move games.1-402016237Artif. Intell.https://doi.org/10.1016/j.artint.2016.03.005https://www.wikidata.org/entity/Q59209595db/journals/ai/ai237.html#BosanskyLLCW16David SilverAja HuangChris J. MaddisonArthur GuezLaurent SifreGeorge van den Driessche 0002Julian SchrittwieserIoannis AntonoglouVedavyas PanneershelvamMarc LanctotSander DielemanDominik GreweJohn NhamNal KalchbrennerIlya SutskeverTimothy P. LillicrapMadeleine LeachKoray KavukcuogluThore GraepelDemis HassabisMastering the game of Go with deep neural networks and tree search.484-4892016529Nat.7587https://doi.org/10.1038/nature16961https://www.wikidata.org/entity/Q28005460db/journals/nature/nature529.html#SilverHMGSDSAPL16Chrisantha FernandoDylan BanarseMalcolm ReynoldsFrederic BesseDavid PfauMax JaderbergMarc LanctotDaan WierstraConvolution by Evolution: Differentiable Pattern Producing Networks.109-1162016GECCOhttps://doi.org/10.1145/2908812.2908890conf/gecco/2016db/conf/gecco/gecco2016.html#FernandoBRBPJLW16Ziyu Wang 0001Tom SchaulMatteo HesselHado van HasseltMarc LanctotNando de FreitasDueling Network Architectures for Deep Reinforcement Learning.1995-20032016ICMLhttp://proceedings.mlr.press/v48/wangf16.htmlconf/icml/2016db/conf/icml/icml2016.html#WangSHHLF16Audrunas GruslysRémi MunosIvo DanihelkaMarc LanctotAlex GravesMemory-Efficient Backpropagation Through Time.4125-41332016NIPShttps://proceedings.neurips.cc/paper/2016/hash/a501bebf79d570651ff601788ea9d16d-Abstract.htmlhttp://papers.nips.cc/paper/6221-memory-efficient-backpropagation-through-timeconf/nips/2016db/conf/nips/nips2016.html#GruslysMDLG16Chrisantha FernandoDylan BanarseMalcolm ReynoldsFrederic BesseDavid PfauMax JaderbergMarc LanctotDaan WierstraConvolution by Evolution: Differentiable Pattern Producing Networks.2016abs/1606.02580CoRRhttp://arxiv.org/abs/1606.02580db/journals/corr/corr1606.html#FernandoBRBPJLW16Audrunas GruslysRémi MunosIvo DanihelkaMarc LanctotAlex GravesMemory-Efficient Backpropagation Through Time.2016abs/1606.03401CoRRhttp://arxiv.org/abs/1606.03401db/journals/corr/corr1606.html#GruslysMDLG16Viliam LisýMarc LanctotMichael H. BowlingOnline Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games.27-362015AAMAShttp://dl.acm.org/citation.cfm?id=2772887conf/atal/2015db/conf/atal/aamas2015.html#LisyLB15Johannes HeinrichMarc LanctotDavid SilverFictitious Self-Play in Extensive-Form Games.805-8132015ICMLhttp://proceedings.mlr.press/v37/heinrich15.htmlconf/icml/2015db/conf/icml/icml2015.html#HeinrichLS15Ziyu Wang 0001Nando de FreitasMarc LanctotDueling Network Architectures for Deep Reinforcement Learning.2015abs/1511.06581CoRRhttp://arxiv.org/abs/1511.06581db/journals/corr/corr1511.html#WangFL15Tom PepelsMark H. M. WinandsMarc LanctotReal-Time Monte Carlo Tree Search in Ms Pac-Man.245-25720146IEEE Trans. Comput. Intell. AI Games3https://doi.org/10.1109/TCIAIG.2013.2291577https://www.wikidata.org/entity/Q56883109db/journals/tciaig/tciaig6.html#PepelsWL14Marc LanctotFurther developments of extensive-form replicator dynamics using the sequence-form representation.1257-12642014AAMAShttp://dl.acm.org/citation.cfm?id=2617448conf/atal/2014db/conf/atal/aamas2014.html#Lanctot14Marc LanctotMark H. M. WinandsTom PepelsNathan R. SturtevantMonte Carlo Tree Search with heuristic evaluations using implicit minimax backups.1-82014CIGhttps://doi.org/10.1109/CIG.2014.6932903conf/cig/2014db/conf/cig/cig2014.html#LanctotWPS14Mandy J. W. TakMarc LanctotMark H. M. WinandsMonte Carlo Tree Search variants for simultaneous move games.1-82014CIGhttps://doi.org/10.1109/CIG.2014.6932889conf/cig/2014db/conf/cig/cig2014.html#TakLW14Tom PepelsTristan CazenaveMark H. M. WinandsMarc LanctotMinimizing Simple and Cumulative Regret in Monte-Carlo Tree Search.1-152014CGW@ECAIhttps://doi.org/10.1007/978-3-319-14923-3_1conf/ecai/2014cgwdb/conf/ecai/cgw2014.html#PepelsCWL14Tom PepelsMandy J. W. TakMarc LanctotMark H. M. WinandsQuality-based Rewards for Monte-Carlo Tree Search Simulations.705-7102014ECAIhttps://doi.org/10.3233/978-1-61499-419-0-705conf/ecai/2014db/conf/ecai/ecai2014.html#PepelsTLW14Marc J. V. PonsenSteven de JongMarc LanctotComputing Approximate Nash Equilibria and Robust Best-Responses Using Sampling.2014abs/1401.4591CoRRhttp://arxiv.org/abs/1401.4591db/journals/corr/corr1401.html#PonsenJL14Marc LanctotMark H. M. WinandsTom PepelsNathan R. SturtevantMonte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups.2014abs/1406.0486CoRRhttp://arxiv.org/abs/1406.0486db/journals/corr/corr1406.html#LanctotWPS14Marc LanctotMark H. M. WinandsLOA Wins Lines of Action Tournament.239-240201336J. Int. Comput. Games Assoc.4https://doi.org/10.3233/ICG-2013-36416db/journals/icga/icga36.html#LanctotW13Marc LanctotMark H. M. WinandsSIA Wins Surakarta Tournament.241201336J. Int. Comput. Games Assoc.4https://doi.org/10.3233/ICG-2013-36418db/journals/icga/icga36.html#LanctotW13aMarkus EsserMichael GrasMark H. M. WinandsMaarten P. D. SchaddMarc LanctotImproving Best-Reply Search.125-1372013Computers and Gameshttps://doi.org/10.1007/978-3-319-09165-5_11conf/cg/2013db/conf/cg/cg2013.html#EsserGWSL13Todd W. NellerMarc LanctotDevika SubramanianStephanie E. AugustModel AI Assignments 2013.2013EAAIhttps://doi.org/10.1609/aaai.v27i3.19009conf/eaai/2013db/conf/eaai/eaai2013.html#NellerLSA13Marc LanctotViliam LisýMark H. M. WinandsMonte Carlo Tree Search in Simultaneous Move Games with Applications to Goofspiel.28-432013CGW@IJCAIhttps://doi.org/10.1007/978-3-319-05428-5_3https://www.wikidata.org/entity/Q59209609conf/ijcai/2013cgwdb/conf/ijcai/cgw2013.html#LanctotLW13Marc LanctotAbdallah SaffidineJoel VenessChristopher ArchibaldMark H. M. WinandsMonte Carlo *-Minimax Search.2013IJCAIhttp://www.aaai.org/ocs/index.php/IJCAI/IJCAI13/paper/view/6862http://ijcai.org/Abstract/13/093conf/ijcai/2013db/conf/ijcai/ijcai2013.html#LanctotSVAW13580-586Viliam LisýVojtech KovaríkMarc LanctotBranislav BosanskýConvergence of Monte Carlo Tree Search in Simultaneous Move Games.2112-21202013NIPShttps://proceedings.neurips.cc/paper/2013/hash/1579779b98ce9edb98dd85606f2c119d-Abstract.htmlhttp://papers.nips.cc/paper/5145-convergence-of-monte-carlo-tree-search-in-simultaneous-move-gamesconf/nips/2013db/conf/nips/nips2013.html#LisyKLB13Marc LanctotAbdallah SaffidineJoel VenessChristopher ArchibaldMark H. M. WinandsMonte Carlo *-Minimax Searchhttp://arxiv.org/abs/1304.60572013CoRRabs/1304.6057db/journals/corr/corr1304.html#abs-1304-6057Viliam LisýVojtech KovaríkMarc LanctotBranislav BosanskýConvergence of Monte Carlo Tree Search in Simultaneous Move Games.2013CoRRhttp://arxiv.org/abs/1310.8613abs/1310.8613db/journals/corr/corr1310.html#LisyKLB13Richard G. GibsonMarc LanctotNeil BurchDuane SzafronMichael BowlingGeneralized Sampling and Variance in Counterfactual Regret Minimization.2012AAAIhttps://doi.org/10.1609/aaai.v26i1.8241conf/aaai/2012db/conf/aaai/aaai2012.html#GibsonLBSB121355-1361Michael JohansonNolan BardMarc LanctotRichard G. GibsonMichael BowlingEfficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization.837-8462012AAMAShttp://dl.acm.org/citation.cfm?id=2343816conf/atal/2012db/conf/atal/aamas2012.html#JohansonBLGB12Marc LanctotRichard G. GibsonNeil BurchMichael BowlingNo-Regret Learning in Extensive-Form Games with Imperfect Recall.2012ICMLhttp://icml.cc/2012/papers/58.pdfconf/icml/2012db/conf/icml/icml2012.html#LanctotGBB12Richard G. GibsonNeil BurchMarc LanctotDuane SzafronEfficient Monte Carlo Counterfactual Regret Minimization in Games with Many Player Actions.1889-18972012NIPShttps://proceedings.neurips.cc/paper/2012/hash/3df1d4b96d8976ff5986393e8767f5b2-Abstract.htmlhttp://papers.nips.cc/paper/4569-efficient-monte-carlo-counterfactual-regret-minimization-in-games-with-many-player-actionsconf/nips/2012db/conf/nips/nips2012.html#GibsonBLS12Marc LanctotRichard G. GibsonNeil BurchMartin ZinkevichMichael H. BowlingNo-Regret Learning in Extensive-Form Games with Imperfect Recallhttp://arxiv.org/abs/1205.06222012CoRRabs/1205.0622db/journals/corr/corr1205.html#abs-1205-0622Marc J. V. PonsenSteven de JongMarc LanctotComputing Approximate Nash Equilibria and Robust Best-Responses Using Sampling.575-605201142J. Artif. Intell. Res.https://doi.org/10.1613/jair.3402db/journals/jair/jair42.html#PonsenJL11Joel VenessMarc LanctotMichael H. BowlingVariance Reduction in Monte-Carlo Tree Search.1836-18442011NIPShttps://proceedings.neurips.cc/paper/2011/hash/d736bb10d83a904aefc1d6ce93dc54b8-Abstract.htmlhttp://papers.nips.cc/paper/4288-variance-reduction-in-monte-carlo-tree-searchconf/nips/2011db/conf/nips/nips2011.html#VenessLB11Marc J. V. PonsenMarc LanctotSteven de JongMCRNR: Fast Computing of Restricted Nash Responses by Means of Sampling.2010Interactive Decision Theory and Game Theoryhttp://aaai.org/ocs/index.php/WS/AAAIW10/paper/view/1985conf/aaai/2010gamedb/conf/aaai/game2010.html#PonsenLJ10Marc LanctotKevin WaughMartin ZinkevichMichael H. BowlingMonte Carlo Sampling for Regret Minimization in Extensive Games.1078-10862009NIPShttps://proceedings.neurips.cc/paper/2009/hash/00411460f7c92d2124a67ea0f4cb5f85-Abstract.htmlhttp://papers.nips.cc/paper/3713-monte-carlo-sampling-for-regret-minimization-in-extensive-gamesconf/nips/2009db/conf/nips/nips2009.html#LanctotWZB09Franisek SailerMichael BuroMarc LanctotAdversarial Planning Through Strategy Simulation.80-872007CIGhttps://doi.org/10.1109/CIG.2007.368082conf/cig/2007db/conf/cig/cig2007.html#SailerBL07John P. AgapiouThomas W. Anthony 0001Thomas Anthony 0001Ioannis AntonoglouChristopher ArchibaldStephanie E. AugustMohammad Gheshlaghi AzarYoram BachrachPierre BaldiDavid BalduzziDylan BanarseGuy BarashNolan BardNathalie BeauguerlangeMarc G. BellemareHeymann BenjaminFrederic BesseAvishkar BhoopchandVincent de BoerSebastian BorgeaudBranislav BosanskýMichael H. BowlingMichael BowlingJames BradburyNoam BrownKalesha BullardNeil BurchMichael BuroKris CaoMauricio Castillo-EffenTristan CazenaveSarah H. CenJiri CermakJirí CermákSarath ChandarNiyati ChhayaAlden ChristiansonPeter ClarkStephen ClarkJerome T. ConnorWojciech Czarnecki 0001Wojciech M. CzarneckiWojciech Marian CzarneckiLuca D'Amico-WongIvo DanihelkaVibhavari DasagiJoshua DavidsonElnaz DavoodiSander DielemanDavid DingRyan D'OrazioGeorge van den Driessche 0002Edgar A. Duéñez-GuzmánGabriel Dulac-ArnoldVincent DumoulinIain DunningTom EcclesRomuald ElieHuáscar EspinozaMarkus EsserRichard Everett 0001Timo EwaldsEitan FarchiGabriele FarinaRyan Faulkner 0001Chrisantha FernandoJakob N. FoersterRoy FoxNando de FreitasChristopher W. GeibIan GempRichard G. GibsonThore GraepelMichael GrasAlex GravesAmy GreenwaldAmy R. GreenwaldDominik GreweAudrunas GruslysArthur GuezOdd Erik GundersenAbhinav Gupta 0002Andras GyorgyDemis HassabisHado van HasseltNicolas HeessSeán Ó hÉigeartaighJohannes HeinrichDaniel HennesJosé Hernández-OralloMatteo HesselTodd HesterG. Zacharias HollandDaniel HorganDan HorganChiori HoriAja HuangXiaowei Huang 0001Thomas HubertThomas C. HudsonEdward Hughes 0001Max JaderbergKokil JaidkaMichael JohansonSteven de JongRudolf KadlecMichael KaisersNal KalchbrennerPavan KapanipathiKoray KavukcuogluSarah KerenMina KhanSeokhwan KimJ. Zico KolterAnna KoopVojtech KovaríkJános KramárChristian KroerDharshan KumaranMatthew LaiDanny LangeJohn B. LanierHugo LarochelleKate LarsonAngeliki LazaridouMadeleine LeachShane LeggJoel Z. LeiboJean-Baptiste LespiauGuy LeverZun Li 0002Timothy P. LillicrapViliam LisýSiqi Liu 0002Edward LockhartNicolas LoizouChris J. MaddisonAleksandra MalyshevaYiran MaoJanusz MareckiLuke MarrisDavid R. MartinezMarwan MattarMausamStephen McAleerStephen Marcus McAleerJulian J. McAuleyKevin R. McKeeMartin MichalowskiReuth MirskyIoannis MitliagkasSubhodeep MoitraMatej MoravcikMatej MoravcíkDustin MorrillRoozbeh MottaghiShibl MouradPaul MullerRémi MunosTodd W. NellerJohn NhamShayegan OmidshafieiToby OrdPedro A. OrtegaIan OsbandJoseph C. OsbornGeorg OstrovskiSherjil OzairVedavyas PanneershelvamChristos H. PapadimitriouEmilio ParisottoDavid C. ParkesPaavo ParmasRoma PatelTom PepelsJulien PérolatSarah PerrinDavid PfauOlivier PietquinGeorgios PiliourasBilal PiotTobias PohlenToby PohlenMarc J. V. PonsenNicolas PorcelJohn QuanMalcolm ReynoldsMark Rowland 0001Jonah Ryan-DavisBrennan SaetaAbdallah SaffidineFranisek SailerTuomas SandholmReca SarfatiRahul SavaniMaarten P. D. SchaddTom SchaulMartin SchmidJulian SchrittwieserJohn SchultzAndrew SendonarisArash Shaban-NejadOnn ShehoryLaurent SifreDavid SilverKaren SimonyanSatinder Singh 0001Max Olan SmithSamuel SokotaH. Francis SongNicolas SonneratSriram Srinivasan 0005Biplav SrivastavaWilliam W. StreileinFlorian StrubNathan R. SturtevantDevika SubramanianPeter SunehagMichal SustrIlya SutskeverDavid SychrovskyDuane SzafronCsaba SzepesváriAndrea TacchettiMandy J. W. TakKartik TalamadupulaBrian TannerEugene TarassovFinbarr TimbersJulian TogeliusKarl TuylsSatyaki UpadhyayMatej VeceríkJoel VenessBart De VylderKevin A. WangKevin Wang 0003Zhe WangZiyu Wang 0001Kevin WaughMichael P. WellmanDaan WierstraMark H. M. WinandsJames R. WrightKoichiro YoshinoVinícius Flores ZambaldiHugh ZhangQuanshi ZhangMartin ZinkevichImed Zitouni