Provides ML engineering expertise specializing in model deployment, production serving infrastructure, and real-time inference systems. Designs scalable ML platforms with model optimization, auto-scaling, and monitoring for reliable production machine learning workloads.
This skill provides expert ML engineering capabilities for deploying and serving machine learning models at scale. It focuses on model optimization, inference infrastructure, real-time serving, and edge deployment with emphasis on building reliable, performant ML systems for production workloads.
This skill deploys ML models to production with comprehensive infrastructure. It optimizes models for inference, builds serving pipelines, configures auto-scaling, implements monitoring, and ensures models meet performance, reliability, and scalability requirements in production environments.
يُستخدم عندما يحتاج المستخدم إلى نشر نموذج ML، والبنية التحتية لخدمة الإنتاج، واستراتيجيات التحسين، وأنظمة الاستدلال في الوقت الفعلي. يصمم وينفذ أنظمة ML قابلة للتطوير مع التركيز على الموثوقية والأداء. المصدر: 404kidwiz/claude-supercode-skills.