Sr. Site Reliability Engineer

Qode
Texas

Role: Sr. Site Reliability Engineer (SRE) – Unified Observability & AIOps

Location: Austin, TX / Fort Mill, SC (Hybrid)

Job Type: Full Time

Role Summary

We are seeking a Senior SRE with strong expertise in Unified Observability, proactive detection, AIOps, and GenAI-driven operations to support complex, distributed financial services platforms. The role requires hands-on experience designing SLI/SLO-driven monitoring , dynamic thresholds , intelligent alerting , and AI/ML-based anomaly detection across multi-stream architectures.

Key Responsibilities

Observability & Reliability Engineering

  • Design and implement unified observability dashboards across metrics, logs, traces, events, and topology
  • Define and manage SLIs, SLOs, and error budgets aligned to business outcomes
  • Build actionable dashboards for operations, engineering, and leadership
  • Implement alerting strategies using static and dynamic thresholds

Proactive Detection & AIOps

  • Leverage AI/ML/AIOps to detect anomalies, predict incidents, and reduce MTTR
  • Transition monitoring from reactive alerts to proactive insights
  • Implement noise reduction, alert correlation, and root cause analysis
  • Apply baseline modeling, seasonality detection, and anomaly scoring

Distributed Systems & Dependency Analysis

  • Monitor and troubleshoot multi-service architectures involving:
  • Microservices
  • Downstream APIs
  • Kafka / streaming platforms
  • Cloud infrastructure (Terraform, IaC)
  • Identify whether issues originate from:
  • Upstream/downstream dependencies
  • Streaming platform
  • Infrastructure
  • Application code

Tooling & Platforms

  • Deep hands-on experience with Dynatrace (mandatory)
  • Experience with:
  • OpenTelemetry
  • Prometheus / Grafana
  • ELK / EFK
  • Cloud-native monitoring (AWS/Azure/GCP)
  • Strong JSON-based telemetry manipulation and enrichment

GenAI & LLM Enablement

  • Apply GenAI / LLMs for:
  • Incident summarization
  • Root cause explanation
  • Runbook recommendations
  • Auto-remediation suggestions
  • Collaborate with platform teams to operationalize GenAI safely

Required Skills & Experience

✅ 15+ years in SRE / Production Engineering

✅ Strong Unified Observability background (not infra-only)

✅ Hands-on Dynatrace experience (metrics, traces, logs, Davis AI)

✅ SLI/SLO engineering experience in production systems

✅ Experience implementing dynamic thresholds and anomaly detection

✅ Knowledge of AI/ML concepts applied to Ops (AIOps)

✅ Distributed systems troubleshooting expertise

✅ Experience with Kafka or streaming data platforms

Differentiators (Highly Valued)

  • Experience in financial services or regulated environments
  • Proven reduction of alert noise and MTTR using AIOps
  • GenAI / LLM integration into operations workflows
Posted 2026-04-21

Recommended Jobs

Transmission Technician

Randall Reed's Planet Ford 635
Garland, TX

The responsibilities of a Transmission Technician include diagnosing, maintaining, repairing, rebuilding and installing transmissions on customer vehicles. The ideal candidate will have proper certifi…

View Details
Posted 2025-08-28

Associate Athletics Director for Ticket Sales and Retention

University of Texas at San Antonio
San Antonio, TX

: Location: San Antonio, TX Regular/Temporary: Regular Job ID: 13520 Full/Part Time: Full Time Org Marketing Statement The University of Texas at San Antonio is a Tier One research universi…

View Details
Posted 2026-04-21

Doors and Windows Warranty Service Technician

UTS
Plainview, TX

UTS, LLC is looking for experienced independent contractors to perform warranty service work for door and window manufacturers and retailers across the United States. Our company brokers inspection, r…

View Details
Posted 2026-04-09

Cashier (Bookkeeping)

Huntsville Memorial Hospital
Huntsville, TX

POSITION PURPOSE Under general supervision of the Director, the Cashier performs detailed processing of all documents relating to the payment of hospital debts and general accounting duties L…

View Details
Posted 2026-02-03

OTR Company Driver Class A

CDN Logistics, Inc.
Dallas, TX

$5,000 Sign-On Bonus CDN Logistics is a Veteran-Owned 53ft dry van company based out of Northlake, IL. We are currently hiring Class A CDL OTR Company Drivers to join our fleet. Reach out t…

View Details
Posted 2026-04-15

Senior Loan Processor

M/I Homes
Dallas, TX

M/I Homes has been building new homes of outstanding quality and superior design for more than 40 years. Founded in 1976 by Irving and Melvin Schottenstein and guided by Irving’s drive to always “tre…

View Details
Posted 2026-01-15

Backend Software Engineer

SGS Consulting
Texas

Job Responsibilities: Collaborate with cross-functional teams and partners to design and build scalable backend services and APIs. Help partners integrate their apps/services on to Client's fam…

View Details
Posted 2025-11-14

Patient Financial Specialist Lead-Business Office

CHRISTUS Health
San Antonio, TX

Description Summary: The associate is responsible for the duties and services that are of a support nature to the Revenue Cycle division of CHRISTUS Health. The associate ensures that all proc…

View Details
Posted 2026-04-03

Pediatric Echo Sonographer (Baytown)

Houston Methodist Baytown Hospital
Baytown, TX

At Houston Methodist, the Pediatric Echo Sonographer position is responsible for performing a range of routine to complex echocardiograms including but not limited to Transthoracic Echocardiogram or T…

View Details
Posted 2026-04-06

Anesthesiologist Dallas TX

HEALTHCARE RECRUITMENT COUNSELORS
Dallas, TX

Anesthesiologist Dallas (DFW) TX $715k+ potentially exceeding $770k total compensation plus a dditional productivity incentives and profit-sharing $25k Sign on Bonus We are l…

View Details
Posted 2026-04-15