Avatar for Datadog

Modern monitoring & analytics. See inside any stack, any app, at any scale, anywhere

Software Engineer - Web Reliability

$105k – $145k AngelList Est.
Apply now

About Datadog:

We're on a mission to build the best platform in the world for engineers to understand and scale their systems, applications, and teams. We operate at high scale—trillions of data points per day—providing always-on alerting, metrics visualization, logs, and application tracing for tens of thousands of companies. Our engineering culture values pragmatism, honesty, and simplicity to solve hard problems the right way.

The team:

At Datadog, Web Reliability Engineers are strong developers as well as have a good background in systems; blending deep and practical knowledge of infrastructure design and deployments. They are at the frontline maintaining and expanding the capabilities of our web-facing applications and infrastructure.

The opportunity:

We’re looking for Web Reliability Engineers to join our new Web Reliability Engineering team. Today Datadog runs in multiple datacenters, with hundreds of engineers contributing to our APIs. As we continue to grow we have found challenges specific to the tooling our teams use to develop on our API’s. Additionally this team needs to build reliable tooling to enable developers to easily depend on a uniform set of tooling that can also give them accurate observability into their APIs.

You will:

  • Provide internal tooling and frameworks that empower teams to develop, maintain and manage web-facing applications.
  • Codify proven practices to improve developer experience and service reliability.
  • Explore new ways to strengthen, automate, deploy and manage our web facing infrastructure
  • Define the future of our API platform and supporting infrastructure.
  • Enable engineering teams to self-service, self-report day-to-day operations dealing with web-facing applications.


  • You have experience contributing to a software engineering team
  • Experience in 24x7 production environments
  • You have production experience with distributed web applications, e.g. haproxy, ALB, redis, authn/authz
  • You have a track record as an engineer in the operations of a large site
  • You value correctness and efficiency; you leave no stone unturned when diagnosing production issues
  • You handle infrastructure with code because automation lets you focus on the more difficult and rewarding problems

Bonus points:

  • You have experience with building large scale web-facing applications in a heterogeneous environment.
  • You are fully fluent in python or go.
Paris • EugeneRemote
Hires remotely
Job type
Visa sponsorship
Not Available

Medical insurance

Retirement savings plan

Open paid time off

Catered lunches

Snacks & drinks

Fitness fund

Commuter benefits

Outings & events

Referral bonus

Datadog at a glance

Modern monitoring & analytics. See inside any stack, any app, at any scale, anywhere

Datadog focuses on SaaS, Enterprise Software, Information Technology, Analytics, and Software. Their company has offices in New York City, San Francisco, New York, Boston, and Chicago. They have a very large team that's between 1001-5000 employees. To date, Datadog has raised $147.9M of funding; their latest round was closed on September 2019 at a valuation of $11B.

You can view their website at https://www.datadoghq.com or find them on Twitter and LinkedIn.

More jobs at Datadog

View all jobs

Open-Source Software Engineer - .NET / C#

Open-Source Software Engineer - .NET / C#

Software Engineer - Alerting

Software Engineer - Compute

Software Engineer - Cloud Metrics

Similar jobs to Software Engineer - Web Reliability at Datadog

Avatar for Data&Data
Real-time, omni-channel detection of digital counterfeiting and grey market sales
Avatar for EXPLOY
Difficult roads often lead to beautiful destinations. "Bon Voyage"
Avatar for Adok
Make meetings engaging with hands-on collaboration
Avatar for Shine
All-in-one bank account for entrepreneurs ✨
Avatar for DIMPL
Allowing businesses to get paid on time
Avatar for Project POP
Notre mission : transformer le management en entreprise
Avatar for Selectra
Global leader for utility price comparison
Avatar for MANSA
Mansa reinvents the traditional bank’s scoring model