Research Scientist, Interpretability
About the position
Responsibilities
• Develop methods for understanding LLMs by reverse engineering algorithms learned in their weights
• Design and run robust experiments, both quickly in toy scenarios and at scale in large models
• Build infrastructure for running experiments and visualizing results
• Work with colleagues to communicate results internally and publicly
Requirements
• Have a strong track record of scientific research (in any field), and have done some work on Interpretability
• Enjoy team science - working collaboratively to make big discoveries
• Are comfortable with messy experimental science. We're inventing the field as we work, and the first textbook is years away
• You view research and engineering as two sides of the same coin. Every team member writes code, designs and runs experiments, and interprets results
• You can clearly articulate and discuss the motivations behind your work, and teach us about what you've learned. You like writing up and communicating your results, even when they're null
• Familiarity with Python is required for this role
Benefits
• Competitive compensation
• Generous vacation and parental leave
• Flexible working hours
• Lovely office space in which to collaborate with colleagues
• Optional equity donation matching
Apply tot his job
Apply To this Job