Avatar for Wayfair

Wayfair is a technology leader, reinventing the way people shop for their homes

Senior Site Reliability Engineer, Configuration Management

$145k – $200k AngelList Est.
Apply now

Senior Site Reliability Engineer, Configuration Management

Boston, MA

Wayfair is a leader in the e-commerce space for all things home. We live and breathe modern technologies. We are a “move fast break things, rethink old standards” team with a startup feel but working with platforms at a massive scale.

We’re looking for smart, logical thinkers who produce and advocate for performant and scalable architecture. We care about thought leadership, community involvement, and the ever-changing SRE landscape. We’re particularly interested in engineers who can help us develop our Platform scaling and Config management strategy and help us adopt, implement and support popular mainstream configuration management platforms like HashiCorp Consul, Puppet, HashiCorp Vault into our existing infrastructure for the purposes of automation and ease of use for both internal and external stakeholders.

On the Platform Scaling team as a Senior/Staff Site Reliability Engineer you’ll have a multitude of opportunities to flex your strengths as well as learn new things while directly assisting our internal customers. We contribute to (and create) bleeding-edge open source projects and continuously push the envelope to explore the future of e-commerce and modern infrastructure systems. Our current scale is in 20,000+ systems comprising 50+ platforms and services (and growing fast!) across multiple global geo locales and GCP regions.

What You’ll Do:

  • Manage central platforms as a service for rapid growth and scale that enable a developer community of 2,000 write and deploy code multiple times/day
  • Develop monitoring, define SLAs, SLOs and error budgets for mission critical platforms while helping coordinate product launches and reliability exercises
  • Write clean, high-performance, and well tested, infrastructure code with a focus on reusability and automation (Shell, Python, GoLang, Puppet)
  • Help determine the future roadmap of platforms and services in service discovery, configuration orchestration, and secret management
  • Create and maintain detailed documentation for both self-service and onboarding
  • Help build our team out by mentoring junior engineers and help develop their skills while assisting them on projects

What You’ll Need:

  • 6+ years of experience in systems and/or software engineering and the SRE and DevOps paradigms
  • Experience in one or more programming languages used in modern infrastructure paradigms (Ruby, Python, Go, PHP, etc.), as well as familiarity with version control platforms such as Git
  • Experience working with configuration and orchestration management tools (Puppet, Ansible, HashiCorp Consul and HashiCorp Vault)
  • Experience deploying and managing infrastructure within a public cloud provider as a part of a hybrid environment with high availability requirements
  • Expertise in performance testing tools and SRE best practices

Good things to have:

  • Experience managing a full application stack with high availability requirements.
  • Knowledge of Hashicorp product - Consul, Vault.
  • Involvement in some on-premise to cloud migration
  • Experience with performance tuning on Linux kernels.
  • Expertise in performance testing tools and best practices
  • Ability to communicate effectively, both verbally and in writing
  • Proven ability to collaborate and work well within a team.

About Us:

Wayfair is one of the world’s largest online destinations for the home. Whether you work in our global headquarters in Boston or Berlin, or in our warehouses or offices throughout the world, we’re reinventing the way people shop for their homes. Through our commitment to industry-leading technology and creative problem-solving, we are confident that Wayfair will be home to the most rewarding work of your career. If you’re looking for rapid growth, constant learning, and dynamic challenges, then you’ll find that amazing career opportunities are knocking.

No matter who you are, Wayfair is a place you can call home. We’re a community of innovators, risk-takers, and trailblazers who celebrate our differences, and know that our unique perspectives make us stronger, smarter, and well-positioned for success. We value and rely on the collective voices of our employees, customers, community, and suppliers to help guide us as we build a better Wayfair – and world – for all. Every voice, every perspective matters. That’s why we’re proud to be an equal opportunity employer. We do not discriminate on the basis of race, color, ethnicity, ancestry, religion, sex, national origin, sexual orientation, age, citizenship status, marital status, disability, gender identity, gender expression, veteran status, or genetic information.

Wayfair at a glance

Wayfair is a technology leader, reinventing the way people shop for their homes

Wayfair focuses on E-Commerce, Home & Garden, Home Decor, and Furniture. Their company has offices in Boston and Berlin. They have a large team that's between 501-1000 employees. To date, Wayfair has raised $358M of funding; their latest round was closed on October 2014 at a valuation of $2.4B.

You can view their website at http://www.wayfair.com/ or find them on Twitter, Facebook, and LinkedIn.

More jobs at Wayfair

View all jobs

Site Reliability Engineer, Configuration Management

Senior Manager, Site Reliability Engineering

Senior Manager, Site Reliability Engineering

Executive Leadership, Engineering

Industrial Engineer

Similar jobs to Senior Site Reliability Engineer, Configuration Management at Wayfair

Avatar for Ourglass
A private space to share your day with 12 friends
Avatar for HigherMe
Helping retail & hourly employers find, screen, and hire better employees faster
Avatar for Staked
Institutional grade staking and lending-based passive yield services for cryptocurrency
Avatar for Squarelink
End-to-end onboarding solution for decentralized applications
Avatar for 1upHealth
Healthcare API patform for applications to connect to EHR data in minutes