Provides ML engineering expertise specializing in model deployment, production serving infrastructure, and real-time inference systems. Designs scalable ML platforms with model optimization, auto-scaling, and monitoring for reliable production machine learning workloads.
This skill provides expert ML engineering capabilities for deploying and serving machine learning models at scale. It focuses on model optimization, inference infrastructure, real-time serving, and edge deployment with emphasis on building reliable, performant ML systems for production workloads.
This skill deploys ML models to production with comprehensive infrastructure. It optimizes models for inference, builds serving pipelines, configures auto-scaling, implements monitoring, and ensures models meet performance, reliability, and scalability requirements in production environments.
Используйте, когда пользователю необходимо развертывание модели ML, инфраструктура обслуживания производства, стратегии оптимизации и системы вывода в реальном времени. Разрабатывает и внедряет масштабируемые системы машинного обучения, уделяя особое внимание надежности и производительности. Источник: 404kidwiz/claude-supercode-skills.