My Shortlist

Your shortlisted jobs will appear here. To view your shortlist: Login Or Register

More Jobs Like This
Date Added: Wed 09/06/2021

Site Reliability Engineer (SRE) With Google Cloud Platform

Memphis, TN, US
Add To Shortlist Apply Now


Job Type: Permanent, FullTime

COOK SYSTEMS has an immediate need for a Site Reliability Engineer (SRE) with Google Cloud Platform Site Reliability Engineers (SREs) are responsible for maintaining the availability and uptime of infrastructure. SREs use software engineering principles to solve operational challenges to create reliable infrastructure. We try to reduce the toil from our everyday work using as much automation as possible. Responsibilities Implement SRE principles and practices across organization to improve performance and efficiency Research and implement solutions to build an always-up, always-available, resilient services Integrate and automate existing manual solutions and processes Participate in an on-call rotation for availability incidents Plan for growth and capacity of the infrastructure Troubleshoot and support productions issues Participates on cross functional company project teams responsible for implementing technology. Investigates anomaliesoutages and determines steps to reproduce, root cause, and solutions options. Monitors environment performance and provides all necessary reporting analysis. Attends relevant conferenceseminars to remain current on new and upcoming technology. Skills Requirements - Candidates must have Google Cloud Platform experience - Google Cloud Platform cloud architect and Certified Kubernetes Administrator certifications will also help a candidate stand out Hands-on experience with cloud service providers(at least one of Google Cloud Platform, AWS) Hands-on experience with at least one configuration management software (AnsibleChefPuppet) Experience with setting up Logging (e.g. ELK) and Monitoring(e.g. Prometheus) solutions Working knowledge of containers and any one container orchestration platform(KubernetesNomadMesosSwarm) Understanding and experience in at least one CICD pipeline (JenkinsTravisCircleCIGitlab etc.) Good understanding of UnixLinux operating systems and its internals Well-versed with Linux CLI Apart from shell scripting(shbash), proficient with one other programming language(PythonRubyGoPerl) Working knowledge of any one distributed version control systems (gitbzrhg) Ability to write good technical user document Exposure to managing Infrastructure as Code with tools like TerraformCloudFormation or using Cloud Provider SDKs Experience Requirement At least 4-6 years of hands-on DevOpsSRE experience At least 1-2 years of experience developing code, either maintaining scripts or applications eoe
Apply Now