Infrastructure Engineer, ML Systems
San Francisco, CA · Full-time · On-site
Overview
We're looking for an Infrastructure Engineer to build and maintain the foundational systems that power Lucidic's optimization platform. You will ensure our systems are reliable, secure, and scalable as we grow.
What You'll Do
Design and maintain cloud infrastructure for ML workloads
Build CI/CD pipelines and deployment automation
Implement security best practices and compliance requirements (SOC 2, HIPAA)
Optimize infrastructure costs while maintaining performance
Set up monitoring, alerting, and incident response processes
Support the team in debugging production issues
What We Look For
Strong experience with cloud platforms (AWS preferred, GCP/Azure acceptable)
Experience with Infrastructure as Code (Terraform, Pulumi, etc.)
Familiarity with Kubernetes and container orchestration
Understanding of networking, security, and compliance requirements
Strong scripting skills (Python, Bash)
Nice to Have
Experience supporting ML workloads and GPU infrastructure
Background in enterprise software or regulated industries
Experience with security certifications (SOC 2, HIPAA)
Familiarity with cost optimization strategies for cloud infrastructure
About Lucidic AI
We build tooling that automatically evaluates and improves enterprise AI agents using search + learning loops across prompts, tools, and policies. Our customers care about reliability, auditability, and measurable gains.
