Software Engineer, Anthropic

Safeguards

$240-325k

Offers optional equity donation matching

SQL

Python

Tensorflow

Scikit-Learn

PyTorch

Mid, Senior and Expert level

London

More information about location

AI safety and research company

Open for applications

AI safety and research company

1001+ employees

B2BArtificial IntelligenceDeep TechMachine LearningSaaS

Open for applications

$240-325k

Offers optional equity donation matching

SQL

Python

Tensorflow

Scikit-Learn

PyTorch

Mid, Senior and Expert level

London

More information about location

1001+ employees

B2BArtificial IntelligenceDeep TechMachine LearningSaaS

Company mission

To create reliable, interpretable, and steerable AI systems.

Job

Company

Role

Who you are

Bachelor’s degree in Computer Science, Software Engineering or comparable experience
3-8+ years of experience in a software engineering position, preferably with a focus on integrity, spam, fraud, or abuse detection
Proficiency in SQL, Python, and data analysis tools
Strong communication skills and ability to explain complex technical concepts to non-technical stakeholders

Desirable

Have experience building trust and safety mechanisms for AI/ML systems, such as fraud detection models or security monitoring tools or the infrastructure to support these systems at scale
Have experience with machine learning frameworks like Scikit-Learn, Tensorflow, or Pytorch, and experience building machine learning models
Have experience with prompt engineering, jailbreak attacks, and other adversarial inputs
Have worked closely with operational teams to build custom internal tooling

What the job involves

We are looking for software engineers to help build safety and oversight mechanisms for our AI systems
As a trust and safety software engineer, you will work to monitor models, prevent misuse, and ensure user well-being
This role will focus on building systems to detect unwanted model behaviors and prevent disallowed use of models
You will apply your technical skills to uphold our principles of safety, transparency, and oversight while enforcing our terms of service and acceptable use policies
Develop monitoring systems to detect unwanted behaviors from our API partners and potentially take automated enforcement actions; surface these in internal dashboards to analysts for manual review
Build abuse detection mechanisms and infrastructure
Surface abuse patterns to our research teams to harden models at the training stage
Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms that work at scale
Analyze user reports of inappropriate content or accounts

Application process

Deadline to apply: None. Applications will be reviewed on a rolling basis

Salary benchmarks

Our take

Funded by Google and founded by ex-OpenAI employees, Anthropic is an AI company focused on building a reliable, innovative AI system. While the company is relatively new to the market it has already launched a highly anticipated AI chat assistant called Claude which is set to rival OpenAI's ChatGPT.

Similar to other chatbots, Claude is accessible through a chat interface and can provide a number of conversation and text-processing tasks. A key point of differentiation is that Claude is designed to produce less harmful outputs than that of the other chatbots that came before it.

While there is much praise for the chat assistant, there is still room for improvement with feedback reporting that Claude is worse at maths and grammar compared to its rival. However, with industry giants like Google backing this company from an early stage, it is clear Anthropic has the potential to become a leading player within the AI industry.

Freddie

Company Specialist at Welcome to the Jungle

Insights

Top investors

190% employee growth in 12 months

Glassdoor (4.3)

Trustpilot (1.7)

Company

Funding (last 2 of 12 rounds)

Mar 2025

$3.5bn

SERIES E

Jan 2025

$1bn

GROWTH EQUITY VC

Total funding: $16.3bn

Company benefits

Comprehensive health, dental, and vision insurance for you and your dependents
Inclusive fertility benefits via Carrot Fertility
Generous subsidy for OneMedical
21 weeks of paid parental leave
Unlimited PTO
Optional equity donation matching at a 3:1 ratio, up to 50% of your equity grant
401(k) plan with 4% matching
$500/month flexible wellness stipend
Commuter coverage
Annual education stipend
A home office improvement stipend when you first join
Relocation support for those moving to the Bay Area

Company HQ

SoMa, San Francisco, CA

Leadership

Dario Amodei

(CEO & Co-Founder)

Spent almost 5 years at OpenAI as a Team Lead for AI safety and as the Vice President of Research. Also worked as a Research Scientist at Google.

Jared Kaplan

(Chief Science Officer & Co-Founder)

Has been a Professor at John Hopkins University for the past 10 years. Also an experienced researcher having worked previously at OpenAI and the SLAC National Accelerator Laboratory.

Jack Clark

(Co-Founder)

Experienced reporter having worked previously at the Register and Bloomberg. Also worked for OpenAI in Strategy and Communications and as a Policy Director.

Sam McCandlish

(Co-Founder)

Graduated from Stanford University with a PhD in Theoretical Physics. Previously worked as a Research Lead at OpenAI.

Tom Brown

(Co-Founder)

Previously worked as a Member of the Technical Staff for both OpenAI and Google Brain.

Share this job

View 162 more jobs at Anthropic