DevOps Engineer
Job Description (Summary of Responsibilities):
· Cloud Infrastructure Management: Design, implement, and manage cloud-based infrastructure on AWS and Azure, ensuring optimal scalability, performance, and security.
· CI/CD Pipeline Development: Develop and maintain CI/CD pipelines using GitHub Actions for automated code deployments and testing.
· System Monitoring and Incident Management:
· Implement and configure Datadog for comprehensive system monitoring.
· Develop and maintain Datadog dashboards to visualize system performance and metrics.
· Set up proactive alerts in Datadog to detect and respond to incidents swiftly, ensuring high system reliability and uptime.
· Conduct root cause analysis of incidents and implement corrective actions using Datadog insights.
· Collaboration with AI Teams: Work closely with AI teams to support the operational aspects of LLMs, including deployment strategies and performance tuning.
· Infrastructure as Code (IaC): Implement IaC using tools like Terraform or AWS CloudFormation to automate infrastructure provisioning and management.
· Container Orchestration: Manage container orchestration systems such as Kubernetes or AWS ECS.
· Operational Support for LLMs: Provide operational support for LLMs, focusing on performance optimization and reliability.
· Scripting and Automation: Utilize scripting languages such as Python and Bash for automation and task management.
· Security and Compliance: Ensure compliance with security standards and best practices, implementing robust security measures.
· Documentation: Document system configurations, procedures, and best practices for internal and external stakeholders.
· DevOps Collaboration: Work with development teams to optimize deployment workflows, introduce best practices for DevOps, and improve overall efficiency.
· Technology and Industry Awareness: Stay up-to-date with emerging technologies and industry trends to suggest improvements and upgrades.
Qualifications and Skills Required:
· Extensive experience with AWS and Azure cloud platforms.
· Proficiency in developing CI/CD pipelines using GitHub Actions.
· Strong experience with Datadog for system monitoring, including implementation, configuration, and maintenance.
· Demonstrated ability to create and maintain Datadog dashboards for performance visualization.
· Proven expertise in setting up alerts and conducting incident response with Datadog.
· Hands-on experience with container orchestration systems such as Kubernetes or AWS ECS.
· Proficiency in Infrastructure as Code (IaC) tools like Terraform or AWS CloudFormation.
· Familiarity with operational aspects of Large Language Models (LLMs) is highly desirable.
· Strong scripting skills in Python, Bash, or similar languages.
· In-depth knowledge of security standards and best practices.
· Excellent documentation skills.
· Proven ability to work collaboratively with development and AI teams.
· Commitment to staying current with industry trends and emerging technologies
Recommended Jobs
Diesel Mechanic
Diesel Mechanic Location : Fort Worth, TX Schedule: Monday – Friday, 8:00 AM to 5:00 PM Compensation: $18+/hr. (based on experience) + full benefits Position Overview: Southwest Inte…
Interventional Rad Tech
Interested in a career with both meaning and growth? Whether your abilities are in direct patient care or one of the many other areas of healthcare administration and support, everyone at Parkland wo…
Seasonal PT Brand Ambassador
Position Overview Part-Time Brand Ambassadors have a customer first mindset and are passionate about providing a personalized and inspiring shopping experience that exceeds the cust…
Procurement Manager
Airtable is the no-code app platform that empowers people closest to the work to accelerate their most critical business processes. More than 500,000 organizations, including 80% of the Fortune 100, …
Product Manager
Job Type Full-time Description We are HALO! We connect people and brands to create unforgettable, meaningful, and lasting experiences that build brand engagement and loyalty for our ov…
Networking and Industrial Controls Systems Engineer (Cybersecurity) (REMOTE)
Networking and Industrial Controls Systems Engineer (Cybersecurity) Siemens Industry – Process Automation (PA): The PA business unit has an Industrial Networking/Cyber Security Engineer position o…
Finance Advisor
*We are currently hiring Finance Advisors located in all States EXCEPT we are not able to move forward with candidates that live in: CA, NY, NJ, WA, DC, IL, OR and MUST BE LOCATED WITHIN THE UN…
Explore Houston: Your Next Adventure in Labor & Delivery!
Registered Nurse - Labor & Delivery - Travel - (LD RN) Embark on a thrilling adventure as a Labor and Delivery Nurse in vibrant Houston! Join a top-tier 425-bed Level 3 Trauma center and thrive in a …
CNA -Care Lead
Aloma Healthcare, Inc. seeks a talented and experienced CNA Care Lead to join us for a full-time role as an individual contributor. The CNA Care Lead will be responsible for leading patient care teams…