·high-performance-inference

</>

high-performance-inference

✓

yonatangross/skillforge-claude-plugin

vLLM, 양자화(AWQ, GPTQ, FP8), 추측 디코딩 및 에지 배포를 통한 고성능 LLM 추론. 추론 대기 시간, 처리량 또는 메모리를 최적화할 때 사용합니다.

yonatangross·high·performance·inference

4설치·0트렌드·@yonatangross

설치

$npx skills add https://github.com/yonatangross/skillforge-claude-plugin --skill high-performance-inference

상세

카테고리: </>개발 도구
출처: skills.sh
최초 등록: 2026-02-01

관련 Skills

domain-driven-design

zustand-patterns

architecture-decision-record

code-review-playbook