·high-performance-inference

</>

high-performance-inference

✓

yonatangross/orchestkit

vLLM, 양자화(AWQ, GPTQ, FP8), 추측 디코딩 및 에지 배포를 통한 고성능 LLM 추론. 추론 대기 시간, 처리량 또는 메모리를 최적화할 때 사용합니다.

yonatangross·high·performance·inference

12설치·0트렌드·@yonatangross

설치

$npx skills add https://github.com/yonatangross/orchestkit --skill high-performance-inference

상세

카테고리: </>개발 도구
출처: skills.sh
최초 등록: 2026-02-01

관련 Skills

dashboard-patterns

recharts-patterns

radix-primitives

responsive-patterns