Senior Site Reliability Engineer, DoubleVerify

$107-231k

This role will also be eligible for bonus/commission (as applicable) and equity

AWS
Kubernetes
GCP
Python
Bash
Linux
Go
Terraform
Splunk
Ansible
Azure
Chef
Prometheus
Grafana
Unix
Senior and Expert level
New York

3 days a week in office

DoubleVerify

Software platform for digital media

Open for applications

DoubleVerify

Software platform for digital media

1001+ employees

B2BSecurityMarketingPublishingAnalyticsSaaSAdvertising

Open for applications

$107-231k

This role will also be eligible for bonus/commission (as applicable) and equity

AWS
Kubernetes
GCP
Python
Bash
Linux
Go
Terraform
Splunk
Ansible
Azure
Chef
Prometheus
Grafana
Unix
Senior and Expert level
New York

3 days a week in office

1001+ employees

B2BSecurityMarketingPublishingAnalyticsSaaSAdvertising

Company mission

DV's mission is to give advertisers clarity and confidence in their digital investment - across buying platforms, channels and media formats.

Role

Who you are

  • Experience: 5+ years in site reliability engineering, DevOps, or a related field, with experience mentoring and educating other engineers
  • Technical Proficiency: Expertise in Linux/Unix systems administration, cloud platforms (AWS, GCP, or Azure), and container orchestration tools like Kubernetes
  • Programming Skills: Proficiency in scripting and programming languages such as Python, Go, or Bash for automation and tool development
  • Monitoring and Observability: Experience with monitoring and logging tools such as Prometheus, Grafana, Splunk, or Nagios. Proven ability to develop and track SLIs, SLOs, and SLAs
  • Automation and Infrastructure as Code: Hands-on experience automating infrastructure and deployments using tools like Terraform, Ansible, or Chef
  • Communication and Mentorship: Strong verbal and written communication skills, with a passion for mentoring and educating team members on technical concepts and SRE best practices
  • Problem-Solving Aptitude: Exceptional analytical skills with a proactive approach to identifying and resolving system issues
  • Team Collaboration: Ability to work both independently and collaboratively within a team environment

Desirable

  • Advanced Education: Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
  • Certifications: Relevant industry certifications such as AWS Certified DevOps Engineer, Google Professional Cloud DevOps Engineer, or Certified Kubernetes Administrator (CKA)
  • Security Awareness: Familiarity with security best practices in cloud and containerized environments
  • Configuration Management: Experience with infrastructure as code and configuration management tools like Terraform, Ansible, or Chef

What the job involves

  • As a Senior Site Reliability Engineer (SRE) at DoubleVerify, you will play a critical role in building and scaling our SRE team
  • This dual-role position requires both hands-on technical expertise and a passion for mentoring and educating team members
  • You will be responsible for implementing and promoting SRE best practices, including the development and monitoring of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs)
  • Your contributions will ensure the reliability, scalability, and performance of our digital media measurement platforms, directly impacting our mission of delivering media transparency and accountability
  • Team Development and Mentorship: Build and grow the SRE team by recruiting, mentoring, and educating team members on SRE principles, promoting a culture of reliability and automation
  • Technical Contributor: Contribute directly to the design, implementation, and maintenance of highly available infrastructure and services, with a focus on automation to minimize manual intervention
  • SLA/SLO/SLI Management: Define, monitor, and report on SLIs, SLOs, and SLAs to ensure alignment with business objectives and user expectations. Use these metrics to drive reliability improvements and guide decision-making
  • Incident Management and Response: Develop and implement robust incident response processes, including on-call rotations and post-incident reviews, to minimize downtime and prevent recurrence
  • Collaboration and Communication: Partner closely with development, operations, and product teams to integrate reliability into the software development lifecycle, promoting cross-functional collaboration
  • Continuous Improvement: Analyze system performance data to identify areas for improvement, implementing solutions to enhance reliability, scalability, and efficiency

Share this job

View 39 more jobs at DoubleVerify

Insights

Top investors

16% employee growth in 12 months

Company

Company benefits

  • Health and Fitness Reimbursement
  • Unlimited paid time off policy
  • Work from home opportunities
  • Health insurance
  • 401k
  • Healthcare coverage
  • Virtual company events
  • Tuition reimbursement

Funding (last 2 of 4 rounds)

Oct 2020

$350m

GROWTH EQUITY VC

Aug 2011

$33m

LATE VC

Total funding: $396.5m

Our take

The importance of digital advertising continues to grow, but it can be difficult for companies to protect their brand identity online. DoubleVerify solves this problem using software that monitors campaign performance in real-time.

Since it was first launched, DoubleVerify has successfully carved out an impressive market share, and now boasts customers such as digital advertising giant Facebook as long-time customers. Its success is largely owed to the quality of its technology, which far outperforms that of its rivals. DoubleVerify has consolidated its success by expanding the range of languages its tools support. It has also expanded into new markets such as Australia and New Zealand, a key indicator of the company’s continued growth.

The digital ad space is changing constantly, and DoubleVerify will need to remain as agile and adaptive as it is at present in order to compete with rivals such as Tremor and Quantcast. A 2021 IPO was followed by two rounds of funding and several product releases, including a tiered brand suitability tool as well as key executive hires.

Freddie headshot

Freddie

Company Specialist at Welcome to the Jungle