home

about

writing

March / April / May 2024 Reading

May 29, 2024

Papers

  • Is Cosine-Similarity of Embeddings Really About Similarity?, Steck et al., 2024
  • Refusal in LLMs is Mediated by a Single Direction (Pre-print Blog), Arditi et al., 2024
  • Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning, Guo et al., 2014
  • Distributed Deep Q-Learning, Ong et al., 2015
  • Deep Reinforcement Learning from Self-Play in Imperfect-Information Games, Heinrich and Silver, 2016

Blogs

  • Compound AI Systems, BAIR
  • Practicing AI research, Jason Wei
  • Successful language model evals, Jason Wei
  • Some intuitions about large language models, Jason Wei