What is smolvlm?
Local vision-language model for image analysis using SmolVLM-2B Source: tdimino/claude-code-minoan.
Local vision-language model for image analysis using SmolVLM-2B
Quickly install smolvlm AI skill to your development environment via command line
Source: tdimino/claude-code-minoan.
Analyze images locally using SmolVLM-2B, a state-of-the-art compact vision-language model optimized for Apple Silicon via mlx-vlm.
| Model | SmolVLM-2B-Instruct | | Size | 4GB | | Peak Memory | 5.8GB | | Speed | 94 tok/s (M-series) | | Supported Formats | PNG, JPG, JPEG, GIF, WebP |
"Model not found": First run downloads the model (4GB). Wait for completion.
Local vision-language model for image analysis using SmolVLM-2B Source: tdimino/claude-code-minoan.
Stable fields and commands for AI/search citations.
npx skills add https://github.com/tdimino/claude-code-minoan --skill smolvlmLocal vision-language model for image analysis using SmolVLM-2B Source: tdimino/claude-code-minoan.
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/tdimino/claude-code-minoan --skill smolvlm Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code, Cursor, or OpenClaw
https://github.com/tdimino/claude-code-minoan