AustinRecruiter Since 2001
the smart solution for Austin jobs

Site Reliability Engineer (SRE) 100% remote

Company: MissionStaff
Location: Austin
Posted on: January 16, 2022

Job Description:

Are you looking for a 100% remote SRE position, Look no further: Through powerful analytics, this company transforms data into intelligence, in a fast and efficient manner. Through leading-edge, proprietary technology and a massive data repository, their data and analytical solutions harness the power of data fusion, uncovering the relevance of disparate data points and converting them into comprehensive and insightful views of people, businesses, assets, and their interrelationships. Currently they are looking for a SRE to join their growing team.

The Senior Site Reliability Engineer is responsible for ensuring availability, minimizing latency, and maximizing performance, capacity and scalability of software services across multiple AWS accounts. This person will join a growing technical team, leveraging automation platforms and their subject matter expertise to ensure that systems are highly available, security compliant and performant.

What You Will DO:
Develop strategies for continuous monitoring and analysis to reduce both downtime and required manual intervention
Build nontrivial internal tooling to support and enable engineering workflows
Design and write automation that investigate how our infrastructure handles failure and scaling
Monitor the breadth of our full platform stack (hosts, applications, and performance)
Embrace and encourage the adoption of the DevOps culture and philosophies
Write and maintain detailed documentation, including architectural diagrams
Guide and mentor peers and colleagues on best practice approaches to full stack monitoring, log analysis, and infrastructure/application performance management).

What you Bring
3-5 years of experience with customer facing production environment(s) using containerization and orchestration tools
3-5 years of experience with building observability systems using products like Elastic Search, Logstash, Prometheus, Kibana and AWS CloudWatch
3-5 years of combined experience in SRE/DevOps or Software Development roles in a full stack engineering environment
Strong communication skills, confidently representing your expertise to peers and stakeholders across the organization
Must have experience enriching alerts for faster root-cause detection and incident resolution
Experience with Infrastructure as Code solutions, particularly Terraform/Terragrunt and/or AWS SAM/CloudFormation
Strong scripting experience in Bash and/or Python
Experience with configuration management software such as Ansible, Chef, Puppet or Salt
Experience in leveraging enterprise cloud monitoring frameworks such as Datadog, Blue Matador, NewRelic, etc.
Industry Certifications (AWS Solutions Architect Professional or DevOps Engineer Professional) a big plus

Unlimited PTO- typically comes to about 2-3 weeks, ranges with different managers
You'll be part of a culture you can be proud of. Friendly and inclusive - it's what makes them unique they support and help you from the moment you join.
They will work with you to make the right development choices for your career. The skills you gain will help you to get the most out of your time with them, and make you more marketable in the future.

The Offer
Competitive Salary: Up to $170K DOE

Keywords: MissionStaff, Austin , Site Reliability Engineer (SRE) 100% remote, Engineering , Austin, Texas

Click here to apply!

Didn't find what you're looking for? Search again!

I'm looking for
in category

Log In or Create An Account

Get the latest Texas jobs by following @recnetTX on Twitter!

Austin RSS job feeds