·image-generation
</>

image-generation

xiangyu-cas/vision-skills

Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK.

0Installs·0Trend·@xiangyu-cas

Installation

$npx skills add https://github.com/xiangyu-cas/vision-skills --skill image-generation

SKILL.md

Use this skill when the user asks to generate or edit images with Gemini using the Python SDK. Default to gemini-3-pro-image-preview, and mention gemini-2.5-flash-image only as an optional faster/cheaper alternative.

1) Identify task type (text-to-image, edit, or multi-reference). 2) Ensure GEMINIAPIKEY is available (env or stored in .env), then use the Python SDK. This will make network requests to the Gemini API 3) Choose model + output (responsemodalities=["IMAGE"] if image-only) and run. Generation can take 30 seconds; allow 30–60 seconds before retrying.

4) Save returned images with part.asimage(); if none, report a clear error.

Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK. Source: xiangyu-cas/vision-skills.

View raw

Facts (cite-ready)

Stable fields and commands for AI/search citations.

Install command
npx skills add https://github.com/xiangyu-cas/vision-skills --skill image-generation
Category
</>Dev Tools
Verified
First Seen
2026-02-01
Updated
2026-02-18

Quick answers

What is image-generation?

Gemini image generation and editing skill for text-to-image, image-to-image edits, multi-reference composition, and Google Search grounding. Use when creating or modifying images via Gemini (default model gemini-3-pro-image-preview) with the Python SDK. Source: xiangyu-cas/vision-skills.

How do I install image-generation?

Open your terminal or command line tool (Terminal, iTerm, Windows Terminal, etc.) Copy and run this command: npx skills add https://github.com/xiangyu-cas/vision-skills --skill image-generation Once installed, the skill will be automatically configured in your AI coding environment and ready to use in Claude Code or Cursor

Where is the source repository?

https://github.com/xiangyu-cas/vision-skills

Details

Category
</>Dev Tools
Source
user
First Seen
2026-02-01