SRE Architect
Job Description: SRE Architect
Location: Austin, TX Hybrid)
Employment Type: Full-Time
Experience Level: Architect
Role Overview
We are seeking an experienced Site Reliability Engineer (SRE) Architect to design, build, and scale highly reliable, resilient, and observable systems. This role is ideal for a hands-on architect who can define SRE strategy, influence engineering practices, and partner closely with development, platform, and security teams.
The position requires onsite or hybrid presence in Austin, TX , with collaboration across distributed teams.
Key Responsibilities
Architecture & Reliability
- Define and own the SRE architecture strategy , including reliability, availability, scalability, and performance standards.
- Design resilient, fault-tolerant systems for cloud-native and hybrid environments .
- Establish and govern SLIs, SLOs, and error budgets across platforms and services.
- Lead capacity planning, resilience testing, and chaos engineering initiatives .
Platform & Cloud Engineering
- Architect and operate platforms on AWS/GCP/Azure (multi-cloud or hybrid setups).
- Design and manage Kubernetes-based platforms (EKS/GKE/AKS).
- Drive Infrastructure as Code (IaC) practices using Terraform, Ansible , or similar tools.
- Standardize environments, deployment patterns, and runtime configurations.
Operational Excellence
- Build and maintain observability frameworks using tools such as Prometheus, Grafana, Datadog, ELK, Splunk, or equivalent.
- Lead incident management , root cause analysis (RCA), and post-incident reviews.
- Reduce MTTR through automation, tooling, and process improvements.
- Participate in and improve on-call models , escalation policies, and runbooks.
DevOps & Automation
- Partner with engineering teams to embed CI/CD best practices .
- Drive automation across provisioning, deployments, testing, and operations.
- Improve system reliability by eliminating manual operational toil.
Security & Governance
- Architect secure platforms aligned with enterprise security standards.
- Implement best practices for secrets management, access control, compliance, and audits .
- Collaborate with Security and Compliance teams on governance models.
Leadership & Collaboration
- Act as a technical mentor and thought leader within SRE and platform teams.
- Influence engineering culture toward reliability-focused design.
- Partner with product, application, and infrastructure teams to deliver business outcomes.
Required Qualifications
- 10+ years of experience in SRE, DevOps, Platform Engineering, or Systems Architecture .
- Strong experience designing and operating large-scale distributed systems .
- Deep hands-on expertise with cloud platforms (AWS/GCP/Azure) .
- Advanced experience with Kubernetes and containerized workloads .
- Strong knowledge of Linux internals, networking, storage, and system performance .
- Proven experience implementing IaC and configuration management .
- Proficiency in one or more programming/scripting languages (Python, Go, Bash, etc.).
- Strong understanding of observability, monitoring, and alerting strategies .
- Excellent communication and stakeholder management skills.
Preferred Qualifications
- Experience in multi-cloud or regulated environments .
- Background supporting high-throughput, high-availability, or data-intensive systems .
- Experience with Kafka, Spark, or large-scale data platforms .
- Exposure to fintech, healthcare, enterprise SaaS, or hyperscale platforms .
- Prior experience as Principal Engineer, Architect, or Lead SRE .
Work Model
- Hybrid / Onsite role based in Austin, TX
- Requires regular collaboration with local and global teams
Why Join Us
- Architect systems at enterprise scale
- Influence platform and reliability strategy across teams
- Work with modern cloud-native technologies
- High-impact role with strong visibility and ownership
Recommended Jobs
Software Engineer III - C++/QT
TCP is committed to cultivating a diverse and inclusive team. However, we are not able to sponsor visas for this role. About TCP (TimeClock Plus): For more than 30 years, TCP has helped organiz…
PRN Registered Nurse
Job Description Job Description About Discovery At Home Discovery At Home is part of the Discovery Senior Living family of companies, a recognized industry leader for performance, innovation a…
Marketing Team Leader
**Position Overview:** We are seeking an innovative and results-driven Marketing Team Leader to spearhead our marketing initiatives and drive brand awareness. The ideal candidate will possess strong l…
Food Photographer for Restaurant Shoots in San Antonio, TX
The Role : We are seeking a talented Photographer to capture stunning images for restaurant shoots in San Antonio, TX . The ideal candidate will work closely with restaurant owners and chefs to…
Part-Time Sales Receptionist
About Milan Laser Hair Removal Milan Laser Hair Removal is one of the nation’s premier laser hair removal providers. That’s because we only use top of the line lasers, and all our treatments are perf…
Production Team Partner - Linen Bagger & Folder - UniFirst
Our Production Team is Kind of a Big Deal! UniFirst is seeking a reliable and hardworking Production Team Partner to join our UniFirst Family. As a Team Partner in the Production Department, you wi…
Warehouse Devices Deployment Technician
Job Responsibilities: Diagnosing and troubleshooting mechanical, configuration, and compatibility issues with equipment Maintain and calibrate printers used for labeling and documentation purpo…
Sales, Strategic Accounts Director - Image Guided Therapy (South)
JOB DESCRIPTION Job Title Sales, Strategic Accounts Director - Image Guided Therapy (South) Job Description The Director of IGT Strategic Accounts will work with 2 IGTS Districts and b…
Paralegal - Trial Litigation (HOUSTON)
Job ID#: 35057 Galleria firm is growing and adding a trial paralegal to the team duties include document review, case evaluation, document production, organization and maintenance of large amou…
CUP Manager (Hiring Immediately)
CUP Manager Position Summary: We are seeking a skilled and motivated CUP Manager to serve as a supervisor over a plant operations and repair team. The CUP Manager provides the necessary o…