video-understand
✓Video understanding and transcription with intelligent multi-provider fallback. Use when: (1) Transcribing video or audio content, (2) Understanding video content including visual elements and scenes, (3) Analyzing YouTube videos by URL, (4) Extracting information from local video files, (5) Getting timestamps, summaries, or answering questions about video content. Automatically selects the best available provider based on configured API keys - prefers full video understanding (Gemini/OpenRouter) over ASR-only providers. Supports model selection per provider.
Installation
SKILL.md
Multi-provider video understanding with automatic fallback and model selection.
| Priority | Provider | Capability | Env Var | Default Model |
| 1 | Gemini | Full video | GEMINIAPIKEY | gemini-3-flash-preview | | 2 | Vertex AI | Full video | GOOGLEAPPLICATIONCREDENTIALS | gemini-3-flash-preview | | 3 | OpenRouter | Full video | OPENROUTERAPIKEY | google/gemini-3-flash-preview | | 4 | OpenAI | ASR only | OPENAIAPIKEY | whisper-1 | | 5 | AssemblyAI | ASR + analysis | ASSEMBLYAIAPIKEY | best |
Video understanding and transcription with intelligent multi-provider fallback. Use when: (1) Transcribing video or audio content, (2) Understanding video content including visual elements and scenes, (3) Analyzing YouTube videos by URL, (4) Extracting information from local video files, (5) Getting timestamps, summaries, or answering questions about video content. Automatically selects the best available provider based on configured API keys - prefers full video understanding (Gemini/OpenRouter) over ASR-only providers. Supports model selection per provider. Source: jrusso1020/video-understand-skills.
Facts (cite-ready)
Stable fields and commands for AI/search citations.
- Install command
npx skills add https://github.com/jrusso1020/video-understand-skills --skill video-understand- Category
- *Creative Media
- Verified
- ✓
- First Seen
- 2026-02-05
- Updated
- 2026-02-18
Quick answers
What is video-understand?
Video understanding and transcription with intelligent multi-provider fallback. Use when: (1) Transcribing video or audio content, (2) Understanding video content including visual elements and scenes, (3) Analyzing YouTube videos by URL, (4) Extracting information from local video files, (5) Getting timestamps, summaries, or answering questions about video content. Automatically selects the best available provider based on configured API keys - prefers full video understanding (Gemini/OpenRouter) over ASR-only providers. Supports model selection per provider. Source: jrusso1020/video-understand-skills.
How do I install video-understand?
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/jrusso1020/video-understand-skills --skill video-understand Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor
Where is the source repository?
https://github.com/jrusso1020/video-understand-skills
Details
- Category
- *Creative Media
- Source
- skills.sh
- First Seen
- 2026-02-05