Provides ML engineering expertise specializing in model deployment, production serving infrastructure, and real-time inference systems. Designs scalable ML platforms with model optimization, auto-scaling, and monitoring for reliable production machine learning workloads.
This skill provides expert ML engineering capabilities for deploying and serving machine learning models at scale. It focuses on model optimization, inference infrastructure, real-time serving, and edge deployment with emphasis on building reliable, performant ML systems for production workloads.
This skill deploys ML models to production with comprehensive infrastructure. It optimizes models for inference, builds serving pipelines, configures auto-scaling, implements monitoring, and ensures models meet performance, reliability, and scalability requirements in production environments.
Da utilizzare quando l'utente ha bisogno della distribuzione di modelli ML, di un'infrastruttura al servizio della produzione, di strategie di ottimizzazione e di sistemi di inferenza in tempo reale. Progetta e implementa sistemi ML scalabili con particolare attenzione all'affidabilità e alle prestazioni. Fonte: 404kidwiz/claude-supercode-skills.