Research Engineer / Scientist, Anthropic

Alignment

$350-500k

Kubernetes

Mid and Senior level

San Francisco Bay Area

More information about location

AI safety and research company

Open for applications

AI safety and research company

1001+ employees

B2BArtificial IntelligenceDeep TechMachine LearningSaaS

Open for applications

$350-500k

Kubernetes

Mid and Senior level

San Francisco Bay Area

More information about location

1001+ employees

B2BArtificial IntelligenceDeep TechMachine LearningSaaS

Company mission

To create reliable, interpretable, and steerable AI systems.

Job

Company

Role

Who you are

You want to build and run elegant and thorough machine learning experiments to help us understand and steer the behavior of powerful AI systems
You care about making AI helpful, honest, and harmless, and are interested in the ways that this could be challenging in the context of human-level capabilities
You could describe yourself as both a scientist and an engineer
Have significant software, ML, or research engineering experience
Have some experience contributing to empirical AI research projects
Have some familiarity with technical AI safety research
Prefer fast-moving collaborative projects to extensive solo efforts
Pick up slack, even if it goes outside your job description
Care about the impacts of AI
Education requirements: We require at least a Bachelor's degree in a related field or equivalent experience

Desirable

Have experience authoring research papers in machine learning, NLP, or AI safety
Have experience with LLMs
Have experience with reinforcement learning
Have experience with Kubernetes clusters and complex shared codebases

What the job involves

As a Research Engineer on Alignment Science, you'll contribute to exploratory experimental research on AI safety, with a focus on risks from powerful future systems (like those we would designate as ASL-3 or ASL-4 under our Responsible Scaling Policy), often in collaboration with other teams including Interpretability, Fine-Tuning, and the Frontier Red Team
Our blog provides an overview of topics that the Alignment Science team is either currently exploring or has previously explored. Our current topics of focus include..
Scalable Oversight: Developing techniques to keep highly capable models helpful and honest, even as they surpass human-level intelligence in various domains
AI Control: Creating methods to ensure advanced AI systems remain safe and harmless in unfamiliar or adversarial scenarios
Alignment Stress-testing: Creating model organisms of misalignment to improve our empirical understanding of how alignment failures might arise
Automated Alignment Research: Building and aligning a system that can speed up & improve alignment research
Representative projects:
Testing the robustness of our safety techniques by training language models to subvert our safety techniques, and seeing how effective they are at subverting our interventions
Run multi-agent reinforcement learning experiments to test out techniques like AI Debate
Build tooling to efficiently evaluate the effectiveness of novel LLM-generated jailbreaks
Write scripts and prompts to efficiently produce evaluation questions to test models’ reasoning abilities in safety-relevant contexts
Contribute ideas, figures, and writing to research papers, blog posts, and talks
Run experiments that feed into key AI safety efforts at Anthropic, like the design and implementation of our Responsible Scaling Policy

Our take

Funded by Google and founded by ex-OpenAI employees, Anthropic has emerged as one of the leading companies in AI, focused on building reliable and safe AI systems. Since its launch, the company has rapidly grown into a major player in the industry, with its AI assistant Claude now widely recognized as a strong competitor to ChatGPT.

Like other chatbots, Claude is accessible through a conversational interface and supports a wide range of text-generation and reasoning tasks. A key point of differentiation remains its emphasis on safety, with Anthropic's "Constitutional AI" approach aiming to produce more aligned and less harmful outputs than earlier generations of AI systems.

While Claude has received significant praise for its performance and alignment, some users have notes areas for improvement, particularly in technical domains such as complex mathematics and precision in grammar. However, with substantial backing freom major tech players and multi-billion-dollar funding rounds, Anthropic has firmly established itself as a long-term competitor in the AI space.

Now the company is continuing to refine Claude's capabilities while expanding globally, including growing its presence in Europe. And its ongoing commitment to responsible AI development put it in a position to be not just a competitor, but a key influence on the direction of the industry as a whole.

Freddie

Company Specialist at Welcome to the Jungle

Insights

Top investors

190% employee growth in 12 months

Glassdoor (4.5)

Trustpilot (1.4)

Company

Funding (last 2 of 15 rounds)

May 2026

$65bn

SERIES H

Feb 2026

$30bn

SERIES G

Total funding: $124.3bn

Company benefits

Comprehensive health, dental, and vision insurance for you and your dependents
Inclusive fertility benefits via Carrot Fertility
22 weeks of paid parental leave
Flexible paid time off and absence policies
Mental health support for you and your dependents
Competitive salary and equity packages
Optional equity donation matching at a 1:1 ratio, up to 25% of your equity grant
Retirement plans with competitive matching
Life and income protection plans
$500/month flexible wellness and time saver stipend
Commuter benefits
Annual education stipend
Home office stipends
Relocation support for those moving for Anthropic
Daily meals and snacks in the office

Company HQ

SoMa, San Francisco, CA

Leadership

Dario Amodei

(Co-Founder & CEO)

Prior to co-founding Anthropic in 2021, Dario spent almost 5 years at OpenAI as a Team Lead for AI safety and as the Vice President of Research. They also worked as a Research Scientist at Google.

Jared Kaplan

(Co-Founder & Chief Science Officer)

Jared hs been a Professor at John Hopkins University since 2012. They are also an experienced Researcher having worked previously at OpenAI and the SLAC National Accelerator Laboratory.

Jack Clark

(Co-Founder)

Jack is an experienced Reporter having worked previously at the Register and Bloomberg. They also worked for OpenAI in Strategy and Communications and as a Policy Director.

Sam McCandlish

(Co-Founder & CTO)

Sam graduated from Stanford University with a PhD in Theoretical Physics. They previously worked as a Research Lead at OpenAI.

Tom Brown

(Co-Founder)

Tom previously worked as a Member of the Technical Staff for both OpenAI and Google Brain.

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 244 more jobs at Anthropic