Open source framework

The Image
Intelligence
Layer for AI Agents

Viscribe extracts structured data from images using AI models. Define the output schema, pass the image, pick the model, and get parsed results back instead of free-form text.

Open-source workflowBring your own modelLocal paths and base64
Venice Grand Canal with Rialto Bridge and gondolas
How It Works

Simple Image Calls, Structured Results

Install the library, point it at an image, and keep the result shape predictable.

Install package
bash
1pip install viscribe
Extract structured data
python
1from pydantic import BaseModel, Field2from viscribe.images import extract3 4class Receipt(BaseModel):5    merchant_name: str | None = Field(description="Store or business name")6    total_amount: float | None = Field(description="Final receipt total")7    date: str | None = Field(description="Receipt date if visible")8    line_items: list[str] = Field(description="Visible purchased items")9 10result = extract(11    image_path="examples/venice.png",12    output_schema=Receipt,13    instruction="Extract the receipt fields visible in the image.",14    model_config={15        "model": "gpt-5-mini",16        "api_key": "sk-...",17        "temperature": 1,18    },19)20 21print(result.data.model_dump())
Model Provider Friendly

Use Your Existing AI Model Providers

Keep your provider choices. Viscribe focuses the image workflow while you route through OpenAI, Anthropic, Mistral, Groq, OpenRouter, Fireworks, or your own compatible setup.

OpenAI
Anthropic
Google
Gemini
Mistral
Groq
OpenRouter
Fireworks
Azure
Bedrock
Cohere
DeepSeek
Perplexity
xAI
Meta
Hugging Face
Replicate
Together AI
Ollama
Cloudflare
NVIDIA
OpenAI
Anthropic
Google
Gemini
Mistral
Groq
OpenRouter
Fireworks
Azure
Bedrock
Cohere
DeepSeek
Perplexity
xAI
Meta
Hugging Face
Replicate
Together AI
Ollama
Cloudflare
NVIDIA
Opinionated Image Methods

Vision Tools Built for Jobs

Five focused image methods designed around what applications and agents actually need to do.

Image methods

Image Description

Generate concise descriptions and tags from product images, screenshots, documents, or visual datasets.

Objective captions for downstream AI workflows

Input

Venice canal image
Venice canal image
Instruction
Describe the image and include useful tags.

Output

Description
A Venice canal scene with historic buildings, gondolas, and bright water framed by classic Italian architecture.
Tags
venicecanalarchitecturegondolastravel
Built for Agents & Automation

Vision Intelligence for Real Workflows

From product catalog automation to quality checks, Viscribe turns visual inputs into answers your application can use.

E-commerce automation

Product Catalog Automation

Extract product descriptions, attributes, categories, and variant details with a single schema-driven request.

10x faster catalog processing
Visual intelligence

Visual Monitoring

Track screenshots, public product images, and visual datasets for meaningful changes over time.

Change signals ready for review
AI agents

Agent Visual Reasoning

Give agents image tools that return focused answers instead of free-form prompts.

Agent-native visual context
Quality extraction

Quality Verification

Extract pass/fail checks, differences, and review notes from QA images using explicit schemas.

Automated QA workflows
Document processing

Document Data Extraction

Extract structured fields from receipts, forms, invoices, labels, and photographed documents.

Schema-ready output
Developer tools

Developer Workflows

Use one matching extract method across Python and TypeScript with clean result wrappers.

Drop into existing apps
Open Source

Free to Use, Open to Shape

Use Viscribe in your project, bring your own model provider, and share what would make it better for your workflow.

Open Source

Build image workflows with focused SDKs and your existing model provider.

Free
One extract APIModel every image task with schemas
Python + TypeScriptMatching SDKs for app and agent code
Provider choiceStructured results from your model setup
Star the repo

Looking for a hosted solution?

Tell us what you want to build.

Reach out!

Frequently Asked Questions

Join the community

Help shape Viscribe with the builders using it.

Star the repo, join Discord, and tell us what you are building so we can keep Viscribe useful for real image workflows.

Stay up to date

Share a little context and we will keep you posted on Viscribe news.

Contact us