Multimodal AI service that extracts semantic content from documents, video, audio, and image files for RAG and automated workflows.
Content Understanding operations are asynchronous long-running operations:
| prebuilt-documentSearch | Documents | Extract markdown for RAG applications | | prebuilt-imageSearch | Images | Extract content from images | | prebuilt-audioSearch | Audio | Transcribe audio with timing | | prebuilt-videoSearch | Video | Extract frames, transcripts, summaries | | prebuilt-invoice | Documents | Extract invoice fields |
Contenu Azure AI Comprendre le SDK pour Python. À utiliser pour l'extraction de contenu multimodal à partir de documents, d'images, d'audio et de vidéo. Déclencheurs : "azure-ai-contentunderstanding", "ContentUnderstandingClient", "analyse multimodale", "extraction de documents", "analyse vidéo", "transcription audio". Source : sickn33/antigravity-awesome-skills.