PANTHEON LAB SDN BHD
Location
any
Contact Info
- Login to view contact number
- Login to view email Copy link
Senior DevOps Engineer (Gen AI Tech)
Views: 159
Posted on 05 Nov 2024
Job Description
Company Overview:
We, at Pantheon Lab Limited, push the boundaries of reality.
As a seasoned GEN AI Tech startup, our mission is to cultivate innovation through the implementation of cutting-edge deep learning technologies. Our solutions empower our global clients to effectively visualize and intellectualize their virtual machines.
By redefining the engagement with Digital Humans and Generative AI, we pioneer the integration of the digital human ecosystem into mainstream applications. Our expertise enables businesses to shape a more innovative and efficient landscape, driving their success forward.
https://www.pantheonlab.ai/
https://www.linkedin.com/company/pantheonlab/
Role Summary:
We are dedicated to building scalable, resilient, and secure infrastructure for our growing suite of applications. As a Senior DevOps Engineer, you will architect and maintain the infrastructure that powers our AI solutions while optimizing and automating deployment processes. Your role will span the development lifecycle—from automation and deployment to monitoring and incident management—collaborating closely with software engineers in Hong Kong, Singapore, and Taiwan while working remotely, to ensure our environments are efficient, secure, and resilient.
- Type: [Full Time/Contract]
- Work Mode: [Remote]
- Salary Range: RM 8000 - RM 15000[max]
- Architect, deploy, and maintain cloud infrastructure across AWS, Digital Ocean, and GPU-optimized hosting providers.
• Manage Infrastructure as Code (IaC) with Terraform for efficient and reliable infrastructure scaling.
• Build and optimize CI/CD pipelines using GitHub Actions for seamless, automated deployments.
• Containerize applications with Docker and manage orchestration with Kubernetes and ArgoCD, supporting large-scale, high-availability deployments (50+ container environments).
• Drive observability initiatives, including the implementation of monitoring, logging, and tracing solutions, using Kibana, Sentry, and other relevant tools to provide real-time insights into system performance and reliability.
• Collaborate with the team to troubleshoot and resolve production issues, ensuring uptime and responsiveness.
• Strengthen security across our infrastructure through best practices in access control, vulnerability management, and threat response.
• Oversee network architecture to support low-latency, secure connectivity across services and troubleshoot networking issues as needed.
Why Join Us:- Hands-on experience working on innovative APAC projects.
- MNC culture, collaborative, and inclusive work environment.
- Opportunities for professional growth and development.
Interested candidates, please apply using one of the following methods:
- Email your resume to jobs@pantheonlab.ai with the subject line " [Job Title] - [Company Name] - Developer Kaki"
- Apply directly through this link:https://www.pantheonlab.ai/career
Note: For candidates with specific experiences (e.g., developing systematic strategies with a verifiable track record), please mention this in your application for potential additional opportunities.
Requirements
Qualifications & Job Requirements:
- Experience: 3+ years of experience in a DevOps or Site Reliability Engineering role, with a proven record of working on large-scale deployments (50+ container environments).
- Education: Degree or above
Technical Skills:
- Expertise in observability tools for monitoring, logging, and tracing, particularly Kibana and Sentry.
• Proficiency in IaC tools, especially Terraform.
• Hands-on experience with Docker, Kubernetes, and ArgoCD for containerization and orchestration.
• Strong experience with CI/CD pipelines and version control, ideally GitHub Actions.
• Proficiency in working with major cloud providers (AWS, GCP, Azure), along with Digital Ocean and GPU-optimized hosting services.
• Strong understanding of networking principles, cybersecurity best practices, and scalability in cloud environments.
• Knowledge of JavaScript, Golang, and Python for infrastructure scripting.
• Experience with PostgreSQL and Redis is a plus.
Industry Knowledge:
• experience in an AI or machine learning-focused company, especially with infrastructure designed for high-performance GPU computing.
Soft Skills:
- Experience working remotely with teams across multiple countries.
- Strong problem-solving skills and the ability to work independently or collaboratively.
- Excellent communication and interpersonal skills.
- Proficient in both written and spoken English and Chinese.
• Ability to manage multi-cloud environments and build cloud-agnostic solutions.
Add any other relevant qualifications
Job Type
Senior
Mode
Remote
Candidate Type
Any
Salary Range
RM 8,000.00 - RM 15,000.00
Address
Unit 508, 5th Floor, Building 16W, No. 16 Science Park West Avenue, Hong Kong Science Park, Hong Kong