Our software is designed to make your storage management much, much easier
Site Reliability Engineer
About the position:
As a Qumulo Member of Technical Staff in our SRE team, you will create, maintain, and operate our managed service, which allows end users to provision performant and easy-to-use scale out storage at a click of a button. You will also develop, maintain, and operate our telemetry service which collects data from our worldwide fleet of clusters.
As a member of our SRE team, you will contribute to the research and implement cutting edge tools and technologies to support our services. You will collaborate with many roles on our engineering and customer success teams to help us deliver an outstanding file data platform to our customers.
About the company:
At Qumulo, we are building a file data platform that is the foundation of our customers’ innovation and growth. Increasingly, businesses depend on hundreds of terabytes to petabytes of unstructured file data to store data like videos, images, security logs, sensor inputs, and genome sequences. Qumulo’s platform enables customers to easily manage that scale and gives them the flexibility to use that data either in their data centers or in the public cloud.
Founded in 2012, Qumulo is a Seattle-based company with over 300 employees world-wide. Our mission is simple: to enable customers to manage their file data at scale with unrivaled freedom, control and real-time visibility. Learn more about how we are helping our wide variety of customers innovate at www.qumulo.com.
At Qumulo, our values define who we are and bring us together. At Qumulo, you will work with individuals who are data driven and have a desire to work in collaborative team environments. Our engineering team emphasizes good engineering practices and you will work in the areas of database, operating systems, and distributed system theory. Our team is passionate about learning and solving hard problems in a customer focused way. In building the future of storage, our engineering culture is action oriented and motivated to see results for our customers.
As a member of our SRE team, you will work collaboratively with our development teams to build a managed service that is easy to operate and has extremely high uptimes. You will also optimize and operate our telemetry service which receives data from our worldwide fleet of clusters. You will operate and continuously improve our services’ reliability, scalability, performance, security, and uptime. Through evaluating new tools and technologies, you will help to implement those that better our service. We are extremely collaborative, have rigorous execution standards, and strive for continuous improvement.
- Previous experience running 24 x 7 production operation for customer facing services, including on call duties.
- Experience in systems automation and infrastructure as code, utilizing modern tooling for workflow automation and CI/CD.
- Ability to monitor systems utilizing industry standard tools
- Experience implementing container/container-fleet-orchestration technologies such as Kubernetes, ECS, or Docker.
- Can effectively use scripting languages such as Python / Ruby / Bash
- Proven experience in Linux system administration and troubleshooting.
- Proficiency in a cloud service such ase AWS, Azure, or GCP.
Qumulo is an Equal Opportunity Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, age, disability, military status, or national origin or any other characteristic protected under federal, state, or applicable local law.
Employment is contingent upon successful completion of background screening
Qumulo at a glance
Qumulo focuses on Enterprise Software, Storage, and Big Data. Their company has offices in Seattle. They have a large team that's between 201-500 employees. To date, Qumulo has raised $222.3M of funding; their latest round was closed on June 2018.