Staff Research Engineer, Cohere

Model Efficiency

Salary not provided
Senior and Expert level
Remote in Canada, US
New York
San Francisco Bay Area
Toronto

More information about location

Cohere

Natural language processing AI platform

Open for applications

Cohere

Natural language processing AI platform

501-1000 employees

B2BArtificial IntelligenceMachine LearningSaaS

Open for applications

Salary not provided
Senior and Expert level
Remote in Canada, US
New York
San Francisco Bay Area
Toronto

More information about location

501-1000 employees

B2BArtificial IntelligenceMachine LearningSaaS

Company mission

To build machines that understand the world, and to make them safely accessible to all.

Role

Who you are

  • Have a PhD in Machine Learning or a related field
  • Understand LLM architecture, and how to optimize LLM inference given resource constraints
  • Have significant experience with one or more techniques that enhance model efficiency
  • Have strong software engineering skills
  • Have an appetite to work in a fast-paced high-ambiguity start-up environment
  • Published and presented at top-tier conferences and venues (ICLR, ACL, NeurIPS)
  • Are passionate about mentoring others
  • If you consider yourself a thoughtful worker, a lifelong learner, and a kind and playful team member, Cohere is the place for you

What the job involves

  • Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers
  • Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft
  • Large Language Models (LLMs) have demonstrated remarkable performance across various tasks
  • However, the substantial computational and memory requirements of LLM inference pose challenges for deployment
  • The mission of the model efficiency team is pushing the limits of LLM serving efficiency of our foundation models with techniques such as model architecture optimization, efficient algorithms, and software/hardware co-optimization
  • As a Staff Research Engineer in the model efficiency team, you will develop innovative solutions to boost the performance of LLM inference

Our take

The AI industry biggest stakeholders insist that the next step in enhancing web experiences will be through natural language interactions. While this may be true, only companies with considerable resources can reasonably address the gap needed to bring NLP to the forefront of web integration. Cohere was founded to address this shortfall between NLP development, widespread usage, and the resources required.

Cohere works on building top-of-the-class NLP processing models and software which it makes available to customers through APIs and custom solutions. In a strategic move, the company has partnered with Google Cloud, giving them access to resources that will undoubtedly push the field of NLP forward. This allows Cohere to serve as middlemen by offering a wide range of customers a product built on unmatched technology.

AI is currently a worldwide talking point and so it is no surprise that Cohere has seen rapid development, and raised $270M in a 2023 funding round. The company plans to introduce a dialogue model that would resemble ChatGPT, however, in this case, its technology would be mainly accessible to developers and businesses.

Steph headshot

Steph

Company Specialist at Welcome to the Jungle

Insights

Top investors

Some candidates hear
back within 2 weeks

192% employee growth in 12 months

Company

Funding (last 2 of 5 rounds)

Jul 2024

$500m

SERIES D

Mar 2023

$270m

SERIES C

Total funding: $940m

Company benefits

  • We cover 100% of premiums across health, dental, vision & travel
  • We offer RRSP, 401K, and Pension Scheme contributions
  • We offer lunch and fitness studio credits and a quality time fund which can be spent on things like dog walking and laundry services
  • We want everyone who contributes to our success to get a ‘piece of the pie’
  • Work Remote & PTO
  • We know that some people work better at home, so we are remote-first, and sure that all meetings, events, and opportunities are set up for our distributed team
  • Every employee at Cohere gets 6 weeks of paid time off & US and Canadian federal holidays, and we provide unlimited sick days
  • All new parents (including those who adopt or go through a surrogate journey) are eligible to receive 100% salary for 6 months in Canada, the US, and the UK
  • The decision and journey to having kids is different for each person and family. Part of our commitment to diversity is being able to support the widest variety of these scenarios as we can so we offer financial support for egg freezing and IVF for those in the US, Canada and the UK
  • Every employee at Cohere receives an annual $2,000 education fund to use as they please

Company values

  • We build for a positive future
  • We build for the many
  • We always stay curious
  • Now never stops, and that's pretty fun

Company HQ

Grange Park, Toronto, ON

Leadership

Has a PhD in Computer Science from Oxford University. Previously worked as a Researcher at Google (Brain) and FOR.ai

Previous experience working in research and engineering at For.ai, Cortex labs, Pressly and Ranomics.

Previously worked as a Research Engineer at Google (Brain).

Salary benchmarks

We don't have enough data yet to provide salary benchmarks for this role.

Submit your salary to help other candidates with crowdsourced salary estimates.

Share this job

View 10 more jobs at Cohere