google-gemini-media
✓Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".
Installation
SKILL.md
This Skill consolidates six Gemini API capabilities into reusable workflows and implementation templates:
Convention: This Skill follows the official Google Gen AI SDK (Node.js/REST) as the main line; currently only Node.js/REST examples are provided. If your project already wraps other languages or frameworks, map this Skill's request structure, model selection, and I/O spec to your wrapper layer.
Engineering suggestion: implement ensurefileuri() so that when a file exceeds a threshold (for example 10-15MB warning) or is reused, you automatically route through the Files API.
Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding". Source: xsir0/xsir-skills.
Facts (cite-ready)
Stable fields and commands for AI/search citations.
- Install command
npx skills add https://github.com/xsir0/xsir-skills --skill google-gemini-media- Source
- xsir0/xsir-skills
- Category
- *Creative Media
- Verified
- ✓
- First Seen
- 2026-02-01
- Updated
- 2026-02-18
Quick answers
What is google-gemini-media?
Use the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding". Source: xsir0/xsir-skills.
How do I install google-gemini-media?
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/xsir0/xsir-skills --skill google-gemini-media Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor
Where is the source repository?
https://github.com/xsir0/xsir-skills
Details
- Category
- *Creative Media
- Source
- skills.sh
- First Seen
- 2026-02-01