Senior Site Reliability Engineer, Kentik

Platform Engineering

$186-251k

+ Stock Options

React
AWS
GCP
JavaScript
Python
Kafka
Redis
Bash
Go
Node.js
Postgres
Ruby
Terraform
Jenkins
Ansible
Saltstack
MySQL
Azure
Puppet
Chef
Prometheus
Grafana
Slack
Zoom
Git
JSON
REST API
Senior and Expert level
Remote in US
Kentik

Network observability, security, and performance

Job no longer available

Kentik

Network observability, security, and performance

201-500 employees

B2BSecurityNetworkingSaaS

Job no longer available

$186-251k

+ Stock Options

React
AWS
GCP
JavaScript
Python
Kafka
Redis
Bash
Go
Node.js
Postgres
Ruby
Terraform
Jenkins
Ansible
Saltstack
MySQL
Azure
Puppet
Chef
Prometheus
Grafana
Slack
Zoom
Git
JSON
REST API
Senior and Expert level
Remote in US

201-500 employees

B2BSecurityNetworkingSaaS

Company mission

To provide clients with an infrastructure visibility solution to make their business-critical operations run seamlessly.

Role

Who you are

  • While prior experience in a remote environment is not required, we highly value strong collaboration and communication skills, and a high level of independence and autonomy
  • 5+ years of experience in Systems Administration, Datacenter/IT and/or SRE related projects
  • Experience working with *nix system command line (e.g. ssh, grep, awk)
  • Detailed understanding of major internet protocols works (tcp/ip, dns, http, TLS)
  • Experience with or desire to learn about microservices, containers and orchestration
  • Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar
  • A passion for documenting code, processes, and infrastructure in runbooks and wikis
  • Strong collaboration and communication skills. Kentik is a fully remote, global company - so we are looking for someone who can work well in an asynchronous environment using tools such as Slack, Zoom, Google Docs, Git, etc
  • Worked with a configuration management (infrastructure as code) platform such as: Ansible, Puppet, Chef, SaltStack or CFEngine
  • Worked with metrics monitoring solutions such as grafana, prometheus, and OpenTelemetry
  • A strong preference towards automation - coding in Bash, Python, Ruby, or Go
  • Experience with public cloud (AWS, GCP, Azure, etc.) architectures and technologies management using Terraform

What the job involves

  • Kentik's Platform Engineering group is responsible for storing, enriching, and querying traffic metadata and metrics from the world's largest networks
  • Our platform actively monitors infrastructure, triggers automated responses to outages and attacks, and plays a critical role in delivering complete network observability to our customers
  • Our platform ingests trillions of records and serves hundreds of thousands of queries for our users each day
  • The scale that this group services is tremendous
  • Platform Engineering consists of 4 teams; (1) Ingest and Storage, (2) Query, (3) Network Data, (4) SRE
  • As a senior engineer in platform engineering, you will have the opportunity to co-own, design, and implement state of the art reliability engineering to help make sure our data-intensive platform continues to play a critical role for some of the most influential companies on the internet
  • We have built a team of world-class engineers, network experts, and technology thought leaders in a remote-friendly culture from day one
  • Ensure our real-time, scalable, microservices-based infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware, across multiple locations as well as all major cloud vendors
  • Work on tools and processes to better monitor our platform as well ensure its stability through our rapid growth
  • Deep-diving into diverse topics, from NetFlow and IP routing, to database replication strategies or HTTP optimization
  • Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
  • Contribute code, code reviews and tools or patches to all kinds of existing code
  • Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure
  • Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team

Share this job

View 16 more jobs at Kentik

Insights

Top investors

10% employee growth in 12 months

Company

Company benefits

  • Dental and vision insurance
  • Parental leave
  • Stock options
  • Flexible paid time off
  • 401k plan
  • Work from home opportunities
  • Health insurance

Funding (last 2 of 5 rounds)

Oct 2021

$40m

SERIES C

May 2020

$23.5m

LATE VC

Total funding: $101.7m

Our take

The rise in internet use has resulted in an often unmanageable amount of network traffic across websites, data centers, and most recently cloud services. The resulting mass of information from this is problematic to interpret and thus difficult to use for troubleshooting and engagement insights.

Kentik provides an integrated network monitoring dashboard for enterprises that helps provide visual data illustrations and insights based on traffic. This allows administrators to identify bottlenecks in their networks, whilst providing opportunities to provide cyber protection and monitor customer experiences.

The Kentik dashboard’s approach to network management is unique due to its focus on the visualization of information. This allows insights regarding network limitations and bottlenecks to be understandable and manageable to enterprises regardless of whether staff has knowledge in networking data.

Freddie headshot

Freddie

Company Specialist at Welcome to the Jungle