Loading

Banner Image
  • Location

    Chattanooga, Tennessee

  • Job title:

    Site Reliability Engineer

  • Sector:

    Technology

  • Job type:

    Direct Hire

  • Job ref:

    6671

The role of Site Reliability Engineer will drive the design, development, maintenance and support of the Platform Infrastructure on Amazon Web Services.  Client utilizes a wide range of tools and applications in the Platform Infrastructure that must be configured and managed so they operate in a highly available manner.

This role requires a strong technical understanding of AWS, Kubernetes, Helm, Terraform.  Additionally, this role requires a solid understanding of web application operation, networks and network security.  The Site Reliability Engineer must also have the ability to effectively communicate and work cross functionally with the Software Development team on the design and deployment of new applications and services as well as troubleshooting and resolving issues. 

 

Responsibilities

  • Design, configure, and maintain development, test, and production cloud environments on a global enterprise scale.
  • Provide production support and troubleshooting for global production instances.
  • Administer, secure, and maintain multiple Kubernetes clusters to support a global SaaS service.
  • Assist product engineers in development and deployment of backend applications.
  • Build out and augment infrastructure services such as monitoring, logging, VPNs, and automated load testing.
  • Evaluate and develop new technology that contributes to the availability, quality, and automation of key systems.
  • Provide production support and troubleshooting for global production instances.
  • Conduct periodic on call duties on a scheduled rotation.
  • Other duties as required.

 

Qualifications

  • BS in Computer Science, related technical fields, or equivalent practical experience.
  • Proficiency in Go, Python, Javascript or equivalent.
  • 2+ years experience with management of production web software stacks.
  • Strong experience with script development, and Linux/Unix systems.
  • Experience with containers, schedulers, and Service Oriented Architecture.
  • Experience with highly-available, globally distributed systems.
  • Solid working understanding of network security.
  • Strong interpersonal, presentation, written and verbal communication skills.
  • Ability to respond quickly and manage time effectively in a fast-paced, dynamic environment.
  • Working experience with agile processes.
  • Ability to understand and solve complex problems.
  • Project management skills and attention to detail


#LI-KO
#LI-REMOTE

 

ehire.com/jobs

A Human Approach to Staffing

Our Company is committed to the principles of equal employment. We are committed to complying with all federal, state, and local laws providing equal employment opportunities, and all other employment laws and regulations. It is our intent to maintain a work environment which is free of harassment, discrimination, or retaliation because of sex, gender, race, religion, color, national origin, physical or mental disability, genetic information, marital status, age, sexual orientation, gender identity, military service, veteran status, or any other status protected by federal, state, or local laws. The Company is dedicated to the fulfillment of this policy in regard to all aspects of employment, including but not limited to recruiting, hiring, placement, transfer, training, promotion, rates of pay, and other compensation, termination, and all other terms, conditions, and privileges of employment.