iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.dagstuhl.de/pid/209/4885.html

dblp: Miljan Martic

default search action

combined dblp search
author search
venue search
publication search

ask others

Miljan Martic

> Home > Persons

Person information

SPARQL queries

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2021
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-03938
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-03938
Grégoire Delétang, Jordi Grau-Moya, Miljan Martic, Tim Genewein, Tom McGrath, Vladimir Mikulik, Markus Kunesch, Shane Legg, Pedro A. Ortega:
Causal Analysis of Agent Behavior for AI Safety. CoRR abs/2103.03938 (2021)
2020
[c4]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/KrakovnaONML20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/KrakovnaONML20
Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg:
Avoiding Side Effects By Considering Future Tasks. NeurIPS 2020
[c3]
- view
  - electronic edition @ neurips.cc (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/nips/MikulikDMGMLO20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MikulikDMGMLO20
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega:
Meta-trained agents implement Bayes-optimal agents. NeurIPS 2020
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-07877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-07877
Victoria Krakovna, Laurent Orseau, Richard Ngo, Miljan Martic, Shane Legg:
Avoiding Side Effects By Considering Future Tasks. CoRR abs/2010.07877 (2020)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11223
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11223
Vladimir Mikulik, Grégoire Delétang, Tom McGrath, Tim Genewein, Miljan Martic, Shane Legg, Pedro A. Ortega:
Meta-trained agents implement Bayes-optimal agents. CoRR abs/2010.11223 (2020)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12237
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12237
Tim Genewein, Tom McGrath, Grégoire Delétang, Vladimir Mikulik, Miljan Martic, Shane Legg, Pedro A. Ortega:
Algorithms for Causal Reasoning in Probability Trees. CoRR abs/2010.12237 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c2]
- view
  - electronic edition @ ceur-ws.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ijcai/KrakovnaOML19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/KrakovnaOML19
Victoria Krakovna, Laurent Orseau, Miljan Martic, Shane Legg:
Penalizing Side Effects using Stepwise Relative Reachability. AISafety@IJCAI 2019
2018
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1806-01186
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1806-01186
Victoria Krakovna, Laurent Orseau, Miljan Martic, Shane Legg:
Measuring and avoiding side effects using relative reachability. CoRR abs/1806.01186 (2018)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1811-07871
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-07871
Jan Leike, David Krueger, Tom Everitt, Miljan Martic, Vishal Maini, Shane Legg:
Scalable agent alignment via reward modeling: a research direction. CoRR abs/1811.07871 (2018)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1812-05979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-05979
Miljan Martic, Jan Leike, Andrew Trask, Matteo Hessel, Shane Legg, Pushmeet Kohli:
Scaling shared model governance via model splitting. CoRR abs/1812.05979 (2018)
2017
[c1]
- view
- export record
  dblp key:
  - conf/nips/ChristianoLBMLA17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChristianoLBMLA17
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep Reinforcement Learning from Human Preferences. NIPS 2017: 4299-4307
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1706-03741
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1706-03741
Paul F. Christiano, Jan Leike, Tom B. Brown, Miljan Martic, Shane Legg, Dario Amodei:
Deep reinforcement learning from human preferences. CoRR abs/1706.03741 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-09883
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-09883
Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A. Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, Shane Legg:
AI Safety Gridworlds. CoRR abs/1711.09883 (2017)

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.