Site Reliability Performance Test engineer
Company: Cygnus Professionals Inc
Posted on: November 9, 2018
Hi, My name is Yateesh from Cygnus Professionals . I am looking for an Site Reliability Performance Test engineer - Austin, TX for one of my clients. Please find the information below. You can reach me on (Ext 9057) or Job Description: Title: Site Reliability Performance Test engineer Location: Austin, TX Duration: Full Time Preferred Qualifications: Proficient in production monitoring concepts and implementation including synthetic, real user, application performance, system, log, time-series, and dashboarding. Includes tools like appdynamics, dynatrace, newrelic, splunk, grafana, ELK, etc Proficient in production systems design including High Availability, Disaster Recovery, Performance, Efficiency, and Security Proficient in a modern scripting language (preferably python) Proficient in a modern infrastructure automation toolkit such as Puppet or Chef Proficient in a Linux or Unix based environment Deep understanding of modern Microservices based architectures and operations Experience in destructive testing methodologies and tools such as chaos monkey Experience in CI/CD automation Experience in a version control systems such as Git or SVN Experience in a cloud computing platform and the associated automation patterns it provides Experience in defensive coding practices and patterns for high-availability Exposure to a modern objected oriented programming language (preferably Java) As a member of our Reliability Engineering team, you will be responsible for scaling some of the largest software products in Retail by automating the application infrastructure, deployment, and monitoring of those products in production. You will also be part of a 24x7 on-call team that will lead the triage of incidents for your products using your expertise to mitigate the problem as soon as possible. Our "own what you build" mentality empowers you to make decisions quickly to deliver reliability improvements without the red tape that typically surrounds enterprise environments. Our Reliability Engineering motto is: Enable Speed with High Availability. You should have a passion for automating as much as possible and constantly be on the lookout for areas where operational and code efficiencies can be improved. You will work directly with product engineering teams leveraging XP principles, and, when you aren''t automating all the things, you will be proactively executing destructive tests, participating in "game day" exercises, and related activities to improve the operational readiness of your product(s Major Tasks, Responsibilites And Key Accountabilities Writes custom code or scripts to automate infrastructure, monitoring services, and test cases Writes custom code or scripts to do "destructive testing" to ensure adequate resiliency in production Creates meaningful dashboards, logging, alerting, and responses to ensure that issues are captured and addressed proactively Identifies unsecured code areas and implements fixes as they are discovered with or without tooling Contributes to foundational code elements that can be reused many times by a product Contributes to meaningful architecture diagrams and other documentation needed for security reviews or other interested parties Defines Service Level Objectives for product(s) to constantly measure their reliability in production and help prioritize backlog work Fields questions from other product teams or support teams Monitors tools and participates in conversations to encourage collaboration across product teams Provides application support for software running in production Proactively monitors production Service Level Objectives for product(s) Proactively reviews the Performance and Capacity of all aspects of production: code, infrastructure, data, and message processing Triages high priority issues and outages as they arise Participates in and leads learning activities around modern software design and development core practices (communities of practice) Proactively views articles, tutorials, and videos to learn about new technologies and best practices being used within other technology organizations Attends conferences and learns how to apply new technologies where appropriate
Keywords: Cygnus Professionals Inc, Austin , Site Reliability Performance Test engineer, IT / Software / Systems , Austin, Texas
Didn't find what you're looking for? Search again!