DevOps Engineer
Job Description (Summary of Responsibilities):
· Cloud Infrastructure Management: Design, implement, and manage cloud-based infrastructure on AWS and Azure, ensuring optimal scalability, performance, and security.
· CI/CD Pipeline Development: Develop and maintain CI/CD pipelines using GitHub Actions for automated code deployments and testing.
· System Monitoring and Incident Management:
· Implement and configure Datadog for comprehensive system monitoring.
· Develop and maintain Datadog dashboards to visualize system performance and metrics.
· Set up proactive alerts in Datadog to detect and respond to incidents swiftly, ensuring high system reliability and uptime.
· Conduct root cause analysis of incidents and implement corrective actions using Datadog insights.
· Collaboration with AI Teams: Work closely with AI teams to support the operational aspects of LLMs, including deployment strategies and performance tuning.
· Infrastructure as Code (IaC): Implement IaC using tools like Terraform or AWS CloudFormation to automate infrastructure provisioning and management.
· Container Orchestration: Manage container orchestration systems such as Kubernetes or AWS ECS.
· Operational Support for LLMs: Provide operational support for LLMs, focusing on performance optimization and reliability.
· Scripting and Automation: Utilize scripting languages such as Python and Bash for automation and task management.
· Security and Compliance: Ensure compliance with security standards and best practices, implementing robust security measures.
· Documentation: Document system configurations, procedures, and best practices for internal and external stakeholders.
· DevOps Collaboration: Work with development teams to optimize deployment workflows, introduce best practices for DevOps, and improve overall efficiency.
· Technology and Industry Awareness: Stay up-to-date with emerging technologies and industry trends to suggest improvements and upgrades.
Qualifications and Skills Required:
· Extensive experience with AWS and Azure cloud platforms.
· Proficiency in developing CI/CD pipelines using GitHub Actions.
· Strong experience with Datadog for system monitoring, including implementation, configuration, and maintenance.
· Demonstrated ability to create and maintain Datadog dashboards for performance visualization.
· Proven expertise in setting up alerts and conducting incident response with Datadog.
· Hands-on experience with container orchestration systems such as Kubernetes or AWS ECS.
· Proficiency in Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
· Familiarity with operational aspects of Large Language Models (LLMs) is highly desirable.
· Strong scripting skills in Python, Bash, or similar languages.
· In-depth knowledge of security standards and best practices.
· Excellent documentation skills.
· Proven ability to work collaboratively with development and AI teams.
· Commitment to staying current with industry trends and emerging technologies
Recommended Jobs
(Job RF -1114) Customer Success Manager
Ash & Harris Executive Search is looking for a Senior Strategic Customer Success Manager Overview: This role is critical for building impeccable relationships and acting as a trusted adviser to e…
Product Manager
At Perry Weather, we build technology that helps organizations stay safe, operational, and confident when weather conditions change. From the PGA and top construction companies to thousands of school…
Material Handler 2
What if you were given the opportunity and responsibility to make a difference? At International Paper, you control your destiny. We offer challenging assignments and total rewards in countries aroun…
Turbine and Generator Lifing Consultant
Description: Engage and lead within a dynamic team providing client focused solutions for simple and combined cycle gas and steam turbines, conventional steam turbines, hydro, and nuclear-powered …
Customer Success Manager
About Megaport Megaport has transformed the way IT gets connected. We're global leaders in Network as a Service (NaaS), changing the way businesses reach the cloud. We're also a leading partner to A…
Travel- RN -CVOR
ATC Healthcare is looking for Registered Nurses! Registered Nurses provide skilled nursing services to patients in a variety of healthcare settings. The Registered Nurse, or RN, is responsible for wor…
Master Level Clinician (Therapist)
Position Summary: We are hiring a Masters-Level Clinician to deliver high-quality, trauma-informed therapy to individuals, couples, and families. The clinician will conduct assessments, crea…
Senior Infrastructure Engineer
About us: Piping Technology & Products is a fast-growing American manufacturer, providing specialty piping products to customers worldwide. With over 40 acres of operations and more than 600 employe…
Full Time Orthopedics Job Plainview, TX
Covenant Health is seeking a full-time Orthopedic Surgeon to join its hospital-based team in Plainview, Texas. This role is integral to providing high-quality orthopedic care in a facility recognized…
Nuclear Medicine Technologist PRN
JOB SUMMARY Responsible for preparing and administering radiopharmaceutical drugs that patients receive orally, by injection, or through inhalation, prior to performing nuclear imaging (PET Scans, SP…