Avatar for Aware

Enabling collaboration for everyone through security, compliance and insight

Site Reliability Engineer (SRE) - Cloud platform

$110k – $150k • 0.1% – 0.3%
Apply now

We are looking for a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join our rapidly growing engineering team.

As a critical member of our Engineering team, you are responsible for building out and optimizing Aware SaaS production infrastructure, scaling and monitoring cloud services, and developing solutions to extend the platform with automation. You will work closely with both product and machine learning software engineers to design infrastructure and coordinate the production deployment of Aware platform assets. This position is a full-time, full-remote opportunity.


50% time in project works - building new tools, solutions, and reusable script templates for the cloud service(s)

Responsible for monitoring the availability, latency, performance, efficiency, Aware production cloud service(s) with best-in-class technology

Lead the implementation and success of launch on new features and capabilities.

Build & Release Management, Configuration Management and Monitoring system with code automations

Demand Forecasting and Capacity Planning for cloud infrastructures

Accountability in communicating with Internal and external stakeholders on production reliability metrices and changes

Mentor and and knowledge sharing with other engineering team members

Work with Engineering team members for on-call duty & rotation

About You

Proficient in one or more programming languages (e.g. C#, Java, Python, C++ )

5+ years of experience in hands-on coding and deploying solutions on cloud platforms (E.g., Azure, AWS, Google Cloud)

3+ years of experience with container technology deployment and cloud services configuration management

3+ years of experience with Linux

Experience with fast-paced, agile development team, and customers satisfaction focused product team

Experience with on-call, production incidents response and perform RCA

Experience with log management tools, building monitoring dashboard

Experience with manage SLA and contribute to the cross-function team

Aware Technologies

Cloud Native Computing Fundation solutions, Linux, Docker, Kubernetes, Elasticsearch, PostgreSQL, Service Bus, RabbitMQ, Redis, NoSQL Storage, Git with integrated CI/CD systems.

Hires remotely in
North America
Job type
Visa sponsorship
Not Available
5+ years
Hiring contact

James Tsai

Avatar for James Tsai

Aware at a glance

Enabling collaboration for everyone through security, compliance and insight

Aware focuses on Enterprise Software, Data Security, Software Compliance, and Cloud Security. Their company has offices in Columbus. They have a small team that's between 11-50 employees.

You can view their website at https://www.awarehq.com or find them on Twitter, Facebook, and LinkedIn.

More jobs at Aware

View all jobs

Data Engineer - Data Platform

Business Development Representative

Similar jobs to Site Reliability Engineer (SRE) - Cloud platform at Aware

Avatar for Pacific MGMT
working with top brands and athletes to deliver real results
Avatar for Vantage Point Logistics
Inbound freight data solutions for health care institutions and higher education
Avatar for Dispatch Goods
A system of trackable, reusable food containers
Avatar for Aware
Enabling collaboration for everyone through security, compliance and insight
Avatar for Goken India
To empower our associates to thrive in client environment and build better products