Use PaddleOCR to extract text content from images. Supports single image or batch processing.
This skill provides optical character recognition (OCR) capabilities using the PaddlePaddle/PaddleOCR-VL-1.5 model via the SiliconFlow API. Extract text from JPG, PNG, WebP, BMP, and GIF images.
| images | Image file path(s) or glob pattern (required) | | -k, --api-key | API key (default: from SILICONFLOWAPIKEY env) | | -m, --model | OCR model name (default: PaddlePaddle/PaddleOCR-VL-1.5) | | -p, --prompt | Recognition prompt for custom behavior | | -j, --json | Output results in JSON format | | -o, --output | Save results to specified file |
OCR skill using PaddleOCR model via SiliconFlow API. This skill should be used when the user asks to "recognize text from an image", "extract text from a photo", "OCR this image", "read text from screenshot", or mentions "PaddleOCR", "image text recognition", "text extraction from images". Source: aotenjou/silicon-paddleocr.