Software Reliability Engineer for AI

MixMode
Remote Remote Full-time 🌐 English
MI
Added to JobCollate: February 5, 2026

AI Summary Powered by Gemini

MixMode is seeking a Senior Software Reliability Engineer to enhance the reliability, performance, and scalability of their AI-powered cybersecurity solutions. This role involves strengthening distributed systems, refactoring code for better observability, and partnering with ML researchers to productionize models at scale, offering an opportunity to work with cutting-edge AI in a critical security domain.

Job Description

MixMode is a leading provider of AI-powered cybersecurity solutions at scale, pioneering a patented third-wave, context-aware AI approach that automatically learns and adapts to dynamic environments. The MixMode platform delivers self-supervised, real-time threat detection for known and unknown threats across cloud, hybrid, and on-premises environments. Large organizations with big data workloads – including those in enterprise, critical infrastructure, US Department of War and US Intelligence Community – trust MixMode to defend their most important assets. Backed by PSG and Entrada Ventures, MixMode is headquartered in Santa Barbara, California. Learn more at www.mixmode.ai. Job Title: Senior Software Reliability Engineer for AI Location: Santa Barbara, CA or Remote Job Summary: We are looking for a Senior Software Engineer to improve the reliability, performance, and scalability of our production AI systems. This role focuses on understanding, refining, and strengthening existing distributed services across application, database, and Kubernetes layers. This individual will work closely with ML researchers to make our systems more robust, maintainable, flexible, and scalable. Responsibilities: • Own the reliability, performance, and operational health of production AI systems, focusing on improving complex, existing services. • Lead efforts to refactor and harden the AI codebase to improve observability, maintainability, and resilience. • Diagnose and resolve issues across distributed systems, including latency, throughput, data pipelines, and resource utilization. • Design and build monitoring, alerting, and debugging tools for high-availability services. • Partner with researchers and ML engineers to productionize models at scale. • Establish best practices for testing, deployment, capacPlease mention the word **POLITENESS** and tag RMjAwMTo0MWQwOjcwMToxMTAwOjoxNjRh when applying to show you read the job post completely (#RMjAwMTo0MWQwOjcwMToxMTAwOjoxNjRh). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Full Description

MixMode is a leading provider of AI-powered cybersecurity solutions at scale, pioneering a patented third-wave, context-aware AI approach that automatically learns and adapts to dynamic environments. The MixMode platform delivers self-supervised, real-time threat detection for known and unknown threats across cloud, hybrid, and on-premises environments. Large organizations with big data workloads – including those in enterprise, critical infrastructure, US Department of War and US Intelligence Community – trust MixMode to defend their most important assets. Backed by PSG and Entrada Ventures, MixMode is headquartered in Santa Barbara, California. Learn more at www.mixmode.ai. Job Title: Senior Software Reliability Engineer for AI Location: Santa Barbara, CA or Remote Job Summary: We are looking for a Senior Software Engineer to improve the reliability, performance, and scalability of our production AI systems. This role focuses on understanding, refining, and strengthening existing distributed services across application, database, and Kubernetes layers. This individual will work closely with ML researchers to make our systems more robust, maintainable, flexible, and scalable. Responsibilities: • Own the reliability, performance, and operational health of production AI systems, focusing on improving complex, existing services. • Lead efforts to refactor and harden the AI codebase to improve observability, maintainability, and resilience. • Diagnose and resolve issues across distributed systems, including latency, throughput, data pipelines, and resource utilization. • Design and build monitoring, alerting, and debugging tools for high-availability services. • Partner with researchers and ML engineers to productionize models at scale. • Establish best practices for testing, deployment, capacPlease mention the word **POLITENESS** and tag RMjAwMTo0MWQwOjcwMToxMTAwOjoxNjRh when applying to show you read the job post completely (#RMjAwMTo0MWQwOjcwMToxMTAwOjoxNjRh). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Required Skills

software design lead senior operational reliability health engineer digital nomad