My research centers on interpretable and efficient reasoning in AI.
My research centers on interpretable and efficient reasoning in AI. I aim to make AI systems more trustworthy as they increasingly shape decisions in everyday life, while also helping ensure that advanced reasoning capabilities remain broadly accessible rather than concentrated among only a few actors.
I aim to make AI systems more trustworthy as they increasingly shape decisions in everyday life, while also helping ensure that advanced reasoning capabilities remain broadly accessible rather than concentrated among only a few actors.
My work spans a spectrum from verbalized reasoning to mechanistically interpretable reasoning. This includes making AI agents more capable and efficient through inference-time rule distillation, as well as developing model editing methods that improve how factual knowledge is updated inside large language models.