DevOps Engineer
Senior DevOps Engineer with a strong background in infrastructure, compute, and storage automation to join our Storage and Compute Platform Management team. This is a contractor role focused on building scalable, reliable, and automated infrastructure systems that power our high-performance computing (HPC) and storage environments.
The successful candidate will play a key role in automating the provisioning, configuration, monitoring, and management of our compute and storage infrastructure, which supports multimegawatt CPU and GPU farms used for cutting-edge quantitative research and machine learning workloads. This is an exciting opportunity for someone passionate about infrastructure at scale, automation, and performance, with a forward-thinking mindset and a collaborative attitude.
Key Responsibilities
- Design, develop, and maintain automation frameworks for provisioning and managing HPC and storage infrastructure.
- Implement infrastructure-as-code and configuration management best practices to ensure consistency and repeatability. Collaborate with platform teams to improve scalability, reliability, and observability of systems.
- Troubleshoot performance, reliability, and scale issues across a variety of infrastructure components.
- Drive continuous improvement through automation, performance tuning, and capacity planning.
- Support the deployment and operations of distributed systems and services used across the organization.
The ideal candidate will have:
Extensive experience in infrastructure engineering, with a focus on compute and storage platforms in large-scale or high-performance environments.
A solid track record of leading and delivering successful technical infrastructure projects.
Strong experience with Python programming, particularly for automation, scripting, and systems integration.
Deep familiarity with CI/CD practices, pipelines, and tools (e.g., Jenkins, GitLab CI, ArgoCD).
Expertise in configuration management and infrastructure-as-code tools such as Ansible, Terraform, and Puppet.
Proven experience in monitoring and observability using tools such as Prometheus, Grafana, ELK stack, or similar.
Solid knowledge of Linux system administration and networking fundamentals.
Hands-on experience with containerization and orchestration platforms (Docker and Kubernetes).
Familiarity with public cloud services (AWS, Azure, GCP) and hybrid infrastructure models.
Exposure to HPC (High Performance Computing) environments and/or large-scale storage infrastructure is highly desirable.
A proactive and collaborative mindset, with a focus on continuous improvement and innovation.
Recommended Jobs
Medical Case Coordinator
Job Description Job Description Job Title: Medical Case Coordinator Location: Schertz, TX (Onsite) Job Type: Contract-to-hire Pay: $18.00 - $23.00 / Hourly Benefits: This position …
Electrical Engineer 4, Black & Veatch Corporation, Houston, TX
Duties Function as a technical specialist in a lead substation design role encompassing distribution, subtransmission and transmission level voltage classes. Apply advanced engineering techniques …
Outside Sales Professional
We are dedicated to enhancing not only the lives of our valued clients but also the personal and professional growth of our people. Weve grown 32% since last year, creating a need to expand our sales …
Utility Worker
Job Description Job Description Hill Country Transit District Job Description Job Title: Utility Worker Department: Maintenance Reports To: Director of Maintenance FLSA Status: Non-Ex…
Line Cook
Job Description Job Description Line Cook – Cypress at Fountain Place Private Members Club Job Description Job Title: Line cook Type: Full-Time Location: Dallas, TX About Us: Join …
Automotive Technician - Belton, TX
External Description Now Hiring - New Opportunities Apply Today, Interview Tomorrow! Very Competitive Experience Based Pay + Full Benefit Package! Walk In Interviews Welcome! Store#4246 Addre…
Leadership & Lifestyle Coach - Remote
Join our expanding global team as a Leadership & Lifestyle Coach (Remote) in the fast-growing personal development sector. We’re seeking talented professionals to help drive international growth. …
Manager of Quality Systems
For us, working at Safran is more than just a job; it's a passion. There's the unique opportunity to lead the way in aerospace and defense and contribute to creating a safer and more sustainable worl…
.NET Developer
Roles&Responsibilities: Full Stack Developer (.NET + React)Experience Required: 3+ Years (Hands-on)Type: Full-Time(Long term)Mandatory Technical Skills (3+ Years Hands-on):Backend:C / .NET (latest ver…
Planning & Inventory Supervisor
THE COMPANY Join a highly respected Tier 1 automotive manufacturer known for its commitment to operational excellence, advanced manufacturing, and quality-driven culture. With a global customer base …