Senior AWS Cloud Site Reliability Engineer (SRE) with AWS Database experience
Program Overview
About The Role
We are seeking an experienced and motivated Senior AWS Cloud Site Reliability Engineer (SRE) to join our dynamic team. As an AWS Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure on Amazon Web Services (AWS). The ideal candidate will have a strong background in AWS services, a deep understanding of infrastructure as code, and deep expertise with relational databases. The AWS Site Reliability Engineer (SRE) will collaborate closely with cross-functional teams, including development, quality assurance, and operations, to ensure seamless software releases and continuous improvement of our release processes.
What you will do:
- Infrastructure Automation: Design, implement, and manage infrastructure as code (IaC) solutions using tools like AWS CloudFormation, Terraform or Helm Charts to automate continuous database deployment and scaling processes. Collaborate with development teams to integrate continuous deployment practices and ensure the reliability of applications and databases.
- Monitoring and Alerting: Implement robust monitoring and alerting systems to proactively identify and address potential issues before they impact system performance. Analyze system metrics, logs, and alerts to troubleshoot and resolve issues promptly.
- Performance Optimization: Conduct performance analysis and optimization of AWS infrastructure components to enhance system efficiency and reduce latency. Identify and implement improvements to enhance system reliability and resilience.
- Incident Response: Participate in on-call rotations to respond to and resolve incidents promptly. Conduct post-incident reviews to identify root causes and implement preventive measures.
- Security and Compliance: Work closely with security teams to implement and enforce best practices for securing AWS environments. Ensure compliance with industry standards and regulations related to cloud infrastructure.
- Communication: Facilitate clear communication across teams, providing updates on release status, known issues, and any potential impact on stakeholders. Coordinate communication of release schedules and changes to all relevant parties.
- Release Planning and Coordination: Collaborate with development, QA, and operations teams to plan and coordinate database schema releases. Define release scope, schedule, and dependencies to ensure timely and smooth deployments. Create and submit change records as required for process and audit compliance. Participation in Technical Change Advisory and Review boards as required.
- Release Automation: Develop and maintain automated deployment pipelines using industry-standard tools such as GitLab CI/CD, Liquibase, or similar. Automate and streamline release processes to improve efficiency and reduce manual errors.
- Continuous Improvement: Proactively identify areas for process improvement within the release management lifecycle. Implement feedback loops to capture lessons learned from each release and apply improvements iteratively. Stay up to date with industry best practices, emerging technologies, and trends related database automation.
- Quality Assurance: Collaborate with QA teams to establish and execute release validation procedures. Ensure releases are thoroughly tested and meet quality standards before deployment. Drive continuous improvement by analyzing release management trends, identifying recurring issues, and working with teams to implement solutions.
Qualifications
Required Qualifications:
- Bachelor's Degree and 8 years of experience or 12 years of experience and a HS Degree/Diploma.
- Proven experience as a Site Reliability Engineer or similar role with a strong emphasis with relational databases.
- In-depth knowledge of AWS services like RDS and DynamoDB and expertise in managing cloud infrastructure.
- Advanced level programming and/or scripting in 3 or more of the following languages: Python, Java, Chef, Helm, Playwright, Bash, JavaScript, Terraform.
- Strong understanding of DevOps principles and continuous integration/continuous deployment (CI/CD) pipelines.
- Proficiency in CI/CD tools such as GitLab CI/CD, Liquibase, or others.
- Familiarity with infrastructure as code (IaC) tools like CloudFormation, Terraform, Helm Charts, or similar technologies.
- Hands-on experience with version control systems (GitLab, GitHub, AWS CodeCommit) and branching strategies.
- Experience with containerization and orchestration tools (e.g., Amazon Elastic Compute Service (ECS), Amazon Elastic Kubernetes Service (EKS), Docker, Kubernetes).
- Familiarity with monitoring tools (e.g., CloudWatch, Prometheus, Grafana, Datadog) and log analysis.
- Attention to detail, with a focus on maintaining high-quality software releases.
- Solid understanding of Agile methodologies and their application in release management.
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills.
- Must be a US Citizen
- Must be able to obtain and maintain the required agency clearance (6C Public Trust)
Preferred Qualifications:
- Relevant certifications in DevOps or related fields are a plus.
- High Risk Public Trust or Secret Clearance preferred
- 3 or more years in SRE or Platform Engineering group for high availability/critical platforms/applications
- 2 or more years managing relational databases
SCA / Union / Intern Rate or Range
Details
Target Salary Range: $104,000 - $166,000. This represents the typical salary range for this position. Salary is determined by various factors, including but not limited to, the scope and responsibilities of the position, the individual’s experience, education, knowledge, skills, and competencies, as well as geographic location and business and contract considerations. Depending on the position, employees may be eligible for overtime, shift differential, and a discretionary bonus in addition to base pay.
Benefits Statement: Peraton offers eligible employees a variety of benefits including medical, dental, vision, life, health savings account, short/long term disability, EAP, parental leave, 401(k), paid time off (PTO) for vacation, and company paid holidays. A full listing of available benefits can be viewed at
Application Duration Statement: The application period for the job is estimated to be 30 days from the job posting date. However, this timeline may be shortened or extended depending on business needs and the availability of qualified candidates.
EEO: Equal opportunity employer, including disability and protected veterans, or other characteristics protected by law.
Recommended Jobs
Sales Development Apprentice
Aspen Fiber Networks is excited to announce an opening for a Sales Development Apprentice. This entry-level position is perfect for recent graduates or individuals looking to break into the field of …
Speech Language Pathologist
BlueCloud Staffing is seeking a compassionate Speech-Language Pathologist (SLP) for a full-time, school-based role in Houston, Texas. Join a student-centered district known for its commitment to high…
Fleet Manager
PURPOSE: Serve as driver's supervisor/representative in Operations. Manage drivers performance in identified categories and be primary communication link to drivers. Work to achieve optimum driver an…
General Manager
Seeking a dynamic and highly organized General Manager with extensive experience in fast-casual restaurant operations. The General Manager oversees all aspects of daily restaurant performance, ensu…
Roman Catholic Religious Education Coordinator
Location: Fort Hood, Texas Primary Duties and Responsibilities: Coordinate and support Roman Catholic religious education and faith-formation programs Assist the Catholic Chaplain and Pries…
Sr. Project Engineer - Energy
Sr. Project Engineer - Energy Intertek, a leading provider of quality and safety solutions to many of the world's top-recognized brands and companies, is actively seeking a Sr. Project Engineer …
Plant/Equipment Maintenance Manager
Job Title: Plant / Equipment Maintenance Technician Industry: Construction / Trucking / Concrete Reports To: Traffic Manager – Jason Bearden Schedule: Monday–Friday, 7:00 AM – 5:00 PM Pos…
Retail Sales Associate
Summary Hiring an experienced Retail Sales Associate. You will help every customer by discovering their needs and finding the products and services that gives them the best cycling experience po…
Travel Nurse
Health Advocates Network - Nursing is seeking a travel nurse RN ICU - Intensive Care Unit for a travel nursing job in San Antonio, Texas. Job Description & Requirements ~ Specialty: ICU - Inten…
SMT Fixture/Tools Maintenance Engineer (A Group MLB PE 02)-Houston,TX
Purpose of the position Responsible for maintain, optimize, and troubleshoot fixtures of SMT (Surface Mount Technology, consumer electronics industry) fixtures(tools to supp…