Interpretability Toolkit

Understanding transformer internals

2024 · Principal Investigator
PythonPyTorchTransformersVisualization

LLM Reasoning Benchmark

Evaluating logical reasoning in language models

2023 · Lead Researcher
PythonEvaluationBenchmark Design