SRE Architect Job at Advantage Solutions, Saint Louis, MO

aUlZaGZ5WHBwZFlxbnR1dXBjNU9JaGRyY1E9PQ==
  • Advantage Solutions
  • Saint Louis, MO

Job Description

As an SRE Architect with a specialization in Devops, monitoring and diagnostics, you will play a critical role in ensuring the reliability, availability, and performance of our mission-critical services. You will design and implement end-to-end monitoring solutions, build observability pipelines, and help create scalable systems for proactive incident detection, diagnostics, and root cause analysis. In this role, you will work closely with engineering, product, and operations teams to drive a culture of reliability and continuous improvement.

Monitoring & Observability:

  • Design and implement comprehensive monitoring and alerting solutions for production systems across multiple environments (cloud, on-prem, hybrid).
  • Develop and refine metrics collection and visualization strategies using tools like Prometheus, Grafana, OpenTelemetry, and others.
  • Build dashboards and custom monitoring solutions to ensure system health, performance, and security.
  • Establish SLIs (Service Level Indicators), SLOs (Service Level Objectives), and SLAs (Service Level Agreements) to align with business goals.

Incident Management & Diagnostics:

  • Develop and implement tools and systems for real-time diagnostics and root cause analysis during incidents.
  • Lead post-mortem analysis and drive remediation of systemic issues to prevent future incidents.
  • Design diagnostic tools and automation to reduce mean time to detection (MTTD) and mean time to resolution (MTTR).
  • Collaborate with engineering teams to define monitoring standards and ensure that new features and services meet reliability and observability requirements.

System Design & Architecture:

  • Architect scalable, resilient, and highly available systems with observability baked in from the start.
  • Apply SRE principles to design and optimize services for reliability, availability, and performance.
  • Identify and address single points of failure, bottlenecks, and other operational risks in production environments.

Automation & Tooling:

  • Create, maintain, and improve automation tools that enhance monitoring, diagnostics, and incident response.
  • Integrate monitoring and observability tools into CI/CD pipelines for proactive issue detection and remediation.
  • Contribute to the development of custom diagnostic tools for troubleshooting complex, distributed systems.

Collaboration & Knowledge Sharing:

  • Collaborate with software engineering, platform engineering, and DevOps teams to ensure seamless integration of monitoring and diagnostics practices.
  • Mentor and coach junior SREs and other team members on best practices for observability and incident management.
  • Stay up-to-date with the latest industry trends and innovations in monitoring, diagnostics, and reliability engineering.

Education & Training Experience:

  • Experience with advanced observability techniques, such as synthetic monitoring, canary deployments, and feature flags.
  • Certification in cloud platforms (AWS, GCP, Azure), or monitoring tools (e.g., Prometheus Certified Associate).
  • Previous experience in an SRE or DevOps leadership role.
  • Knowledge of serverless architecture, microservices, and edge computing environments.
  • Strong experience in distributed systems, cloud platforms (AWS, GCP, Azure), and container orchestration (Kubernetes, Docker).
  • Deep knowledge of monitoring tools such as Datadog and Cloud Monitoring
  • Proficient in instrumentation techniques (e.g., OpenTelemetry, StatsD, custom metrics).
  • Experience with log aggregation and analysis tools like ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, or similar.
  • Expertise in alerting and notification systems, including PagerDuty, Opsgenie, or VictorOps.

Architect position

This position is an individual contributor.

Travel required: 5%

Job Tags

Similar Jobs

Catholic Funeral & Cemetery Services

Funeral Director Job at Catholic Funeral & Cemetery Services

 ...Funeral Director Location : Mt. Olivet Cemetery- Wheat Ridge, CO ***Must have a Mortuary Science degree OR in the process of...  ...visitation, flowers, caskets, urn, photos, etc. Drive Funeral Home vehicles for services and transportation of families for service... 

Bishop Street Law Group

Paralegal or Legal Assistant Job at Bishop Street Law Group

Corporate and Administrative Law Salary $60,000 and above depending on experience Reserved Parking, Medical, Vision, Dental, 401(k) and Profit Sharing Please send resumes via email to ****@*****.*** or via LinkedIn Inquiries to be confidential

Addison Group

Business Development Manager Job at Addison Group

 ...The role of Business Development Manager (BDM) is primarily responsible for prospecting new clients for the division for which they are...  ...for our top producers ~ Seasonal parties and events Training & Development: Our Learning & Development department is integral... 

Holy Cross Health Fl

Urology Physician Assistant Job at Holy Cross Health Fl

 ...Holy Cross Medical Group is seeking to recruit a Physician Assistant to join our Urology practice located in Fort Lauderdale, FL. This position involves working with the urologists in the outpatient clinic, the hospital, and the operating room. The patient population... 

Randstad USA

Technical Writer Job at Randstad USA

 ...distributions, notifications and fulfilling internal and external document distribution. Must haves: Proficiency in Microsoft Office and Excel Ability to follow written procedures Detail oriented Ability to work autonomously Description of Work:...