·vision-language-models
{}

vision-language-models

yonatangross/orchestkit

GPT-5/4o, Claude 4.5, Gemini 2.5/3, Grok 4 vision patterns for image analysis, document understanding, and visual QA. Use when implementing image captioning, document/chart analysis, or multi-image comparison.

14Installs·0Trend·@yonatangross

Installation

$npx skills add https://github.com/yonatangross/orchestkit --skill vision-language-models

SKILL.md

Integrate vision capabilities from leading multimodal models for image understanding, document analysis, and visual reasoning.

| Model | Context | Strengths | Vision Input |

| GPT-5.2 | 128K | Best general reasoning, multimodal | Up to 10 images | | Claude Opus 4.5 | 200K | Best coding, sustained agent tasks | Up to 100 images | | Gemini 2.5 Pro | 1M+ | Longest context, video analysis | 3,600 images max | | Gemini 3 Pro | 1M | Deep Think, 100% AIME 2025 | Enhanced segmentation |

GPT-5/4o, Claude 4.5, Gemini 2.5/3, Grok 4 vision patterns for image analysis, document understanding, and visual QA. Use when implementing image captioning, document/chart analysis, or multi-image comparison. Source: yonatangross/orchestkit.

View raw

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/yonatangross/orchestkit --skill vision-language-models
Category
{}Data Analysis
Verified
First Seen
2026-02-01
Updated
2026-02-18

Quick answers

What is vision-language-models?

GPT-5/4o, Claude 4.5, Gemini 2.5/3, Grok 4 vision patterns for image analysis, document understanding, and visual QA. Use when implementing image captioning, document/chart analysis, or multi-image comparison. Source: yonatangross/orchestkit.

How do I install vision-language-models?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/yonatangross/orchestkit --skill vision-language-models Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

Where is the source repository?

https://github.com/yonatangross/orchestkit