image-generation
Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK.
Installation
SKILL.md
Use this skill when the user asks to generate or edit images with Gemini using the Python SDK. Default to gemini-3-pro-image-preview, and mention gemini-2.5-flash-image only as an optional faster/cheaper alternative.
1) Identify task type (text-to-image, edit, or multi-reference). 2) Ensure GEMINIAPIKEY is available (env or stored in .env), then use the Python SDK. This will make network requests to the Gemini API 3) Choose model + output (responsemodalities=["IMAGE"] if image-only) and run. Generation can take 30 seconds; allow 30–60 seconds before retrying.
4) Save returned images with part.asimage(); if none, report a clear error.
Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK. Source: xiangyu-cas/vision-skills.
Facts (cite-ready)
Stable fields and commands for AI/search citations.
- Install command
npx skills add https://github.com/xiangyu-cas/vision-skills --skill image-generation- Category
- </>Dev Tools
- Verified
- —
- First Seen
- 2026-02-01
- Updated
- 2026-02-18
Quick answers
What is image-generation?
Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK. Source: xiangyu-cas/vision-skills.
How do I install image-generation?
Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/xiangyu-cas/vision-skills --skill image-generation Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor
Where is the source repository?
https://github.com/xiangyu-cas/vision-skills
Details
- Category
- </>Dev Tools
- Source
- user
- First Seen
- 2026-02-01