Home > Find Jobs

Job Search

A tropical beach
Propelus company logo

Propelus

USA

Posted on: 05 November 2023

Experience

n/a

Work

n/a

Employee Type

n/a

Salary Range

n/a

Site Reliability Engineer

Propelus is modernizing how professionals, their employers, regulators, and associations work better together. For over 20 years, Propelus solutions — CE Broker, EverCheck, and Immuware — have propelled the progress of millions of dedicated professionals in their career journey. Our market-leading workforce compliance management technology, full-lifecycle continuing education software, and vital data simplify total professional management for a happier workforce, better operations, and safer communities.

We are looking for a Site Reliability Engineer (SRE) to build out, maintain, and automate our rapidly expanding infrastructure.

The SRE will be part of a talented team of engineers that demonstrate superb technical competency, delivering mission critical infrastructure and ensuring the highest levels of availability (24x7x365), performance and security of our Clear to Work healthcare platform. The ideal candidate will have a background in IT, computer systems engineering, or systems engineering and analysis.

Day-to-Day Responsibilities:

  • Be on an on-call (PagerDuty) rotation to respond to incidents that impact business services and provide support for service engineers with customer incidents.

  • Use your on-call shift to prevent incidents from ever happening.

  • Build monitoring that alerts on symptoms rather than on outages and proactively ensures the highest levels of systems and infrastructure availability.

  • Document every action so your findings turn into repeatable actions and then into automation (create necessary runbooks).

  • Monitor application performance to detect potential bottlenecks, identify possible solutions, and work with developers to implement recommendations.

  • Follow security best practices and maintain capacity provisioning and redundancy strategies.

What you bring to the team:

  • BS/MS degree in Computer Science, Engineering, or equivalent professional experience

  • Proven 2+ years of working with an on-call dynamic, conducting Incident procedures, and communication. Including the ability to triage and resolve issues and being responsible for the stability and performance of critical business services

  • Proven 2+ years doing performance monitoring and diagnosing service disruptions with APM, Log management systems, and RUM, using tools such as New Relic, Dynatrace, Datadog, Splunk, Logrocket, or out-of-the-box AWS monitoring services.

  • Proven 3+ years of working experience in Software Development or Operations positions applying DevOps principles

  • Proven 2+ years of working with Scrum methodology + project management (technical definition, planning, estimation, and execution of projects)

  • Proven working experience configuring, troubleshooting, and monitoring UNIX /Linux and containerized (docker, k8s) distributed systems. 

  • Proven experience with Configuration Management Tools (Ansible, Chef, AWS SSM)

  • Intermediate Cloud experience with Production Workloads in AWS

  • Intermediate programming/scripting skills (e.g., Shell, Python, Javascript)

  • Experience with Infrastructure as Code (Terraform, Pulumi, or Cloudformation) 

  • Critical thinking, problem solving, proactive, and analytical skills.

  • Proven experience defining and implementing applications reliability targets with Service Level Objectives (SLOs) based on Service Level Indicators (SLIs)

  • Commitment to the highest standards of personal and business ethics and conduct.

Benefits and Perks for Propelus employees include but are not limited to:
  • Awarded one of BuiltIn's 2023 Best Place to Work and 7 years running by Outside Magazine!

  • Professional development allowance to help you grow in the ways that mean the most to you.

  • Flexibility for balancing work with the rest of life and ample PTO, including paid time off for volunteering and for becoming a new parent.

  • 401K with company matching, as well as financial planning education and resources.

  • Employees choose from HSA, FSA and traditional insurance options for medical, dental, and vision coverage for themselves and dependents.

  • Wellness benefits - we’ll help you pay for fitness endeavors and organic produce delivery services.

  • Check us out for yourself at our careers page or our Propelus culture Instagram accounts.

We are an equal opportunity employer and value diversity at Propelus. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. Candidates from all backgrounds are encouraged to apply.

This position is scheduled to work 40 hours per week, M-F unless required otherwise by projects. This job is open to candidates authorized to work in the US and located within US borders.

Tags

AWS
cloud
docker
javascript
python
Apply to job