DevOps Engineer
DevOps Engineer
About the Company
Join a fast-growing organization revolutionizing AI computing with cutting-edge cloud services powered by advanced GPU technology. The company’s mission is to eliminate hardware limitations and deliver scalable, efficient solutions for AI workloads.
About the Role
We are seeking an experienced DevOps Engineer to ensure the reliability, performance, and scalability of our corporate infrastructure. This critical role involves collaborating across teams to design, implement, and maintain highly available and resilient systems. This position is based in Las Vegas, NV.
Key Responsibilities
- Incident Response: Lead incident response efforts, conduct root cause analysis, and implement solutions to prevent future occurrences.
- System Reliability: Develop, implement, and maintain monitoring and alerting systems to proactively identify and resolve issues.
- Automation: Build and automate processes to improve operational efficiency and reduce manual tasks.
- Capacity Planning: Analyze system performance, optimize resource utilization, and make recommendations for scaling.
- Security: Collaborate with security teams to identify and mitigate vulnerabilities in infrastructure and systems.
- Collaboration: Partner with development teams to ensure applications are designed for high reliability and performance.
- On-Call Support: Participate in on-call rotations to address incidents outside regular business hours.
Required Skills and Qualifications
- Strong knowledge of system administration and networking concepts.
- Proficiency in scripting languages (e.g., Python, Bash) and configuration management tools (e.g., Ansible, Puppet, Chef).
- Hands-on experience with cloud platforms (AWS, Azure, GCP).
- Expertise in containerization technologies (Docker, Kubernetes).
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
- Excellent troubleshooting and problem-solving skills.
- Strong communication and collaboration abilities.
Preferred Qualifications
- Experience with Infrastructure as Code tools (Terraform, CloudFormation).
- Familiarity with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI/CD).
- Experience managing databases (MySQL, PostgreSQL, MongoDB).
- Relevant certifications in cloud platforms or DevOps technologies (e.g., AWS Certified DevOps Engineer, GCP Professional DevOps Engineer, Certified Kubernetes Administrator).
Benefits
The company offers a competitive salary and a comprehensive benefits package, including:
- Stock options.
- 100% paid medical, dental, and vision benefits for employees.
- Life insurance and short-term disability coverage.
- Flexible spending account (FSA).
- 401(k) retirement plan.
- Flexible paid time off (PTO) and paid holidays.
- Parental leave.
- Mental health benefits through a leading provider.
Why Join Us?
This is an opportunity to work with cutting-edge technologies, collaborate with an exceptional team, and contribute to building innovative solutions that power AI computing at scale. If you’re passionate about DevOps, automation, and driving systems reliability, this is the perfect role for you.