high-performance-inference

Name: high-performance-inference
Author: yonatangross

✓

Inférence LLM hautes performances avec vLLM, quantification (AWQ, GPTQ, FP8), décodage spéculatif et déploiement périphérique. À utiliser pour optimiser la latence, le débit ou la mémoire d’inférence.

yonatangross·high·performance·inference

12Installations·0Tendance·@yonatangross

Installation

$npx skills add https://github.com/yonatangross/orchestkit --skill high-performance-inference

Détails

Catégorie: </>Développement
Source: skills.sh
Première apparition: 2026-02-01

Skills Connexes

high-performance-inference

Installation

SKILL.md

Faits (prêts à citer)

Réponses rapides

Qu'est-ce que high-performance-inference ?

Comment installer high-performance-inference ?

Où se trouve le dépôt source ?

Détails

Skills Connexes