Site Reliability Engineer (SRE) - Cloud platform$110k – $150k • 0.1% – 0.3%
We are looking for a highly motivated and experienced Senior Site Reliability Engineer (SRE) to join our rapidly growing engineering team.
As a critical member of our Engineering team, you are responsible for building out and optimizing Aware SaaS production infrastructure, scaling and monitoring cloud services, and developing solutions to extend the platform with automation. You will work closely with both product and machine learning software engineers to design infrastructure and coordinate the production deployment of Aware platform assets. This position is a full-time, full-remote opportunity.
50% time in project works - building new tools, solutions, and reusable script templates for the cloud service(s)
Responsible for monitoring the availability, latency, performance, efficiency, Aware production cloud service(s) with best-in-class technology
Lead the implementation and success of launch on new features and capabilities.
Build & Release Management, Configuration Management and Monitoring system with code automations
Demand Forecasting and Capacity Planning for cloud infrastructures
Accountability in communicating with Internal and external stakeholders on production reliability metrices and changes
Mentor and and knowledge sharing with other engineering team members
Work with Engineering team members for on-call duty & rotation
Proficient in one or more programming languages (e.g. C#, Java, Python, C++ )
5+ years of experience in hands-on coding and deploying solutions on cloud platforms (E.g., Azure, AWS, Google Cloud)
3+ years of experience with container technology deployment and cloud services configuration management
3+ years of experience with Linux
Experience with fast-paced, agile development team, and customers satisfaction focused product team
Experience with on-call, production incidents response and perform RCA
Experience with log management tools, building monitoring dashboard
Experience with manage SLA and contribute to the cross-function team
Cloud Native Computing Fundation solutions, Linux, Docker, Kubernetes, Elasticsearch, PostgreSQL, Service Bus, RabbitMQ, Redis, NoSQL Storage, Git with integrated CI/CD systems.