Adam Karvonen
About

Posts

  • Apr 13, 2025

    Frontier AI Models Still Fail at Basic Physical Tasks: A Manufacturing Case Study

  • Jul 21, 2024

    Using an LLM perplexity filter to detect weight exfiltration

  • Jun 12, 2024

    Evaluating Sparse Autoencoders with Board Games

  • Jun 11, 2024

    An Intuitive Explanation of Sparse Autoencoders for LLM Interpretability

  • Mar 20, 2024

    Manipulating Chess-GPT's World Model

  • Jan 3, 2024

    Chess-GPT's Internal World Model

subscribe via RSS

Adam Karvonen

  • Adam Karvonen
  • adam.karvonen@gmail.com
  • adamkarvonen
  • a_karvonen

Adam Karvonen's personal website