Papers
- Is Cosine-Similarity of Embeddings Really About Similarity?, Steck et al., 2024
- Refusal in LLMs is Mediated by a Single Direction (Pre-print Blog), Arditi et al., 2024
- Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, Guo et al., 2014
- Distributed Deep Q-Learning, Ong et al., 2015
- Deep Reinforcement Learning from Self-Play in Imperfect-Information Games, Heinrich and Silver, 2016