DevOps Engineer
Job Description (Summary of Responsibilities):
· Cloud Infrastructure Management: Design, implement, and manage cloud-based infrastructure on AWS and Azure, ensuring optimal scalability, performance, and security.
· CI/CD Pipeline Development: Develop and maintain CI/CD pipelines using GitHub Actions for automated code deployments and testing.
· System Monitoring and Incident Management:
· Implement and configure Datadog for comprehensive system monitoring.
· Develop and maintain Datadog dashboards to visualize system performance and metrics.
· Set up proactive alerts in Datadog to detect and respond to incidents swiftly, ensuring high system reliability and uptime.
· Conduct root cause analysis of incidents and implement corrective actions using Datadog insights.
· Collaboration with AI Teams: Work closely with AI teams to support the operational aspects of LLMs, including deployment strategies and performance tuning.
· Infrastructure as Code (IaC): Implement IaC using tools like Terraform or AWS CloudFormation to automate infrastructure provisioning and management.
· Container Orchestration: Manage container orchestration systems such as Kubernetes or AWS ECS.
· Operational Support for LLMs: Provide operational support for LLMs, focusing on performance optimization and reliability.
· Scripting and Automation: Utilize scripting languages such as Python and Bash for automation and task management.
· Security and Compliance: Ensure compliance with security standards and best practices, implementing robust security measures.
· Documentation: Document system configurations, procedures, and best practices for internal and external stakeholders.
· DevOps Collaboration: Work with development teams to optimize deployment workflows, introduce best practices for DevOps, and improve overall efficiency.
· Technology and Industry Awareness: Stay up-to-date with emerging technologies and industry trends to suggest improvements and upgrades.
Qualifications and Skills Required:
· Extensive experience with AWS and Azure cloud platforms.
· Proficiency in developing CI/CD pipelines using GitHub Actions.
· Strong experience with Datadog for system monitoring, including implementation, configuration, and maintenance.
· Demonstrated ability to create and maintain Datadog dashboards for performance visualization.
· Proven expertise in setting up alerts and conducting incident response with Datadog.
· Hands-on experience with container orchestration systems such as Kubernetes or AWS ECS.
· Proficiency in Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
· Familiarity with operational aspects of Large Language Models (LLMs) is highly desirable.
· Strong scripting skills in Python, Bash, or similar languages.
· In-depth knowledge of security standards and best practices.
· Excellent documentation skills.
· Proven ability to work collaboratively with development and AI teams.
· Commitment to staying current with industry trends and emerging technologies
Recommended Jobs
Housekeeping Office Coordinator
POSITION SUMMARY Serve as the point of contact for clients and communicate with them by phone and email to respond to questions and requests. Enter and retrieve information contained in computer d…
DFW Area Residential Service HVAC Technician
(MUST HAVE RESIDENTIAL HVAC EXPERIENCE) Operate, maintain, and repair all equipment used for heating, ventilation, and air conditioning for residence equipment. This individual supports the business …
Parent & Me Soccer Instructor | Part Time
Are you the life of the party and kids just seem to naturally gravitate toward you? Do you have a blast teaching and just being with toddlers? Then this is the opportunity for you to make a positive …
Fiberglass Operator Helper
JOB DESCRIPTION Fiber Glass Systems, a business under NOV is hiring! These DAY SHIFT positions are routinely scheduled Monday through Thursday, 7:00am - 5:30pm. This entry-level position is …
Telehealth Board Certified Behavior Analyst BCBA (Dallas)
Remote (Telehealth) | Flexible Hours | Immediate Openings Pay: $80-100 per hour (based on experience) Schedule: Full-time or Part-time | You choose your hours! Why Join Us? We are a…
Offshore Welder
Job Description Job Description Description:Offshore Welder Job Description **Position:** Offshore Welder **Department:** Offshore **Employment Type:** Full-time Job Summary The Offshore…
Lead Automation Technician - TotalFlow, ROC800
Job Details: Lead Automation Technician - TotalFlow, ROC800 Join our dynamic team as an Automation Technician III/Lead, where innovation meets excellence. We offer a rewarding environment with c…
Operations Support Associate
The Operations Support Associate is responsible for assisting staff located in the Arlington, VA, office with various projects and administrative tasks. Responsibilities include conducting and managi…
Shuttle Bus Driver (Part-Time)
Parking: Shuttle Bus Driver (Part-Time) Trail Drive Management Corporation – Dickies Arena of Fort Worth, TX Dickies Arena is a 14,000 seat, spectacular multipurpose venue located adjacent to the Wi…
Senior Software Engineer
Senior Software Engineer Job Description Overview CoStar Group (NASDAQ: CSGP) is a leading global provider of commercial and residential real estate information, analytics, and online marketp…