iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://dblp.org/pid/258/4701.rss
dblp: Newton Cheng https://dblp.org/pid/258/4701.html dblp person page RSS feed Thu, 25 Apr 2024 05:41:32 +0200 en-US daily 1 released under the CC0 1.0 license dblp@dagstuhl.de (dblp team) dblp@dagstuhl.de (dblp team) Computers/Computer_Science/Publications/Bibliographies http://www.rssboard.org/rss-specification https://dblp.org/img/logo.144x51.pngdblp: Newton Chenghttps://dblp.org/pid/258/4701.html14451 Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training.https://doi.org/10.48550/arXiv.2401.05566, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , :
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training. CoRR abs/2401.05566 ()]]>
https://dblp.org/rec/journals/corr/abs-2401-05566Mon, 01 Jan 2024 00:00:00 +0100
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning.https://doi.org/10.48550/arXiv.2307.11768, , , , , , , , , , , , , , , , , , , , , , , :
Question Decomposition Improves the Faithfulness of Model-Generated Reasoning. CoRR abs/2307.11768 ()]]>
https://dblp.org/rec/journals/corr/abs-2307-11768Sun, 01 Jan 2023 00:00:00 +0100
Measuring Faithfulness in Chain-of-Thought Reasoning.https://doi.org/10.48550/arXiv.2307.13702, , , , , , , , , , , , , , , , , , , , , , , , , , , , , :
Measuring Faithfulness in Chain-of-Thought Reasoning. CoRR abs/2307.13702 ()]]>
https://dblp.org/rec/journals/corr/abs-2307-13702Sun, 01 Jan 2023 00:00:00 +0100
Towards Understanding Sycophancy in Language Models.https://doi.org/10.48550/arXiv.2310.13548, , , , , , , , , , , , , , , , , , :
Towards Understanding Sycophancy in Language Models. CoRR abs/2310.13548 ()]]>
https://dblp.org/rec/journals/corr/abs-2310-13548Sun, 01 Jan 2023 00:00:00 +0100
Specific versus General Principles for Constitutional AI.https://doi.org/10.48550/arXiv.2310.13798, , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , :
Specific versus General Principles for Constitutional AI. CoRR abs/2310.13798 ()]]>
https://dblp.org/rec/journals/corr/abs-2310-13798Sun, 01 Jan 2023 00:00:00 +0100
Topological Link Models of Multipartite Entanglement.https://doi.org/10.22331/q-2022-06-20-741, , , :
Topological Link Models of Multipartite Entanglement. Quantum 6: 741 ()]]>
https://dblp.org/rec/journals/quantum/BaoCHS22Sat, 01 Jan 2022 00:00:00 +0100
The Quantum Entropy Cone of Hypergraphs.https://arxiv.org/abs/2002.05317, , , :
The Quantum Entropy Cone of Hypergraphs. CoRR abs/2002.05317 ()]]>
https://dblp.org/rec/journals/corr/abs-2002-05317Wed, 01 Jan 2020 00:00:00 +0100