high-performance-inference

Name: high-performance-inference
Author: yonatangross

✓

High-performance LLM inference with vLLM, quantization (AWQ, GPTQ, FP8), speculative decoding, and edge deployment. Use when optimizing inference latency, throughput, or memory.

yonatangross·high·performance·inference

12Installs·0Trend·@yonatangross

Installation

$npx skills add https://github.com/yonatangross/orchestkit --skill high-performance-inference

Details

Category: </>Dev Tools
Source: skills.sh
First Seen: 2026-02-01

Related Skills

high-performance-inference

Installation

SKILL.md

Facts (cite-ready)

Quick answers

What is high-performance-inference?

How do I install high-performance-inference?

Where is the source repository?

Details

Related Skills