Is ViscribeAI open source?

Yes. ViscribeAI provides open-source Python and TypeScript packages for schema-driven image extraction.

Can I use my existing AI provider?

Yes. ViscribeAI is designed around model configuration so developers can use the model provider setup that already fits their stack.

Can I use local images?

Yes. ViscribeAI supports local image paths, base64 inputs, and image URLs depending on the runtime.

Open source framework

The Image
Intelligence
Layer for AI Agents

Q: Looking for a hosted solution?

Reach out to ViscribeAI at contact@viscribe.ai and tell us what you want to build.

Viscribe extracts structured data from images using AI models. Define the output schema, pass the image, pick the model, and get parsed results back instead of free-form text.

View on GitHub Join Discord

Open-source workflowBring your own modelLocal paths and base64

Venice Grand Canal with Rialto Bridge and gondolas

How It Works

Simple Image Calls, Structured Results

Install the library, point it at an image, and keep the result shape predictable.

Install package

bash

1pip install viscribe

Extract structured data

python

1from pydantic import BaseModel, Field2from viscribe.images import extract3 4class Receipt(BaseModel):5    merchant_name: str | None = Field(description="Store or business name")6    total_amount: float | None = Field(description="Final receipt total")7    date: str | None = Field(description="Receipt date if visible")8    line_items: list[str] = Field(description="Visible purchased items")9 10result = extract(11    image_path="examples/venice.png",12    output_schema=Receipt,13    instruction="Extract the receipt fields visible in the image.",14    model_config={15        "model": "gpt-5-mini",16        "api_key": "sk-...",17        "temperature": 1,18    },19)20 21print(result.data.model_dump())

Model Provider Friendly

Use Your Existing AI Model Providers

Keep your provider choices. Viscribe focuses the image workflow while you route through OpenAI, Anthropic, Mistral, Groq, OpenRouter, Fireworks, or your own compatible setup.

OpenAI

Anthropic

Google

Gemini

Mistral

Groq

OpenRouter

Fireworks

Azure

Bedrock

Cohere

DeepSeek

Perplexity

xAI

Vision Tools Built for Jobs

Five focused image methods designed around what applications and agents actually need to do.

Image methods

Image Description

Generate concise descriptions and tags from product images, screenshots, documents, or visual datasets.

Objective captions for downstream AI workflows

Input

Venice canal image

Instruction

Describe the image and include useful tags.

Output

Description

A Venice canal scene with historic buildings, gondolas, and bright water framed by classic Italian architecture.

Vision Intelligence for Real Workflows

From product catalog automation to quality checks, Viscribe turns visual inputs into answers your application can use.

E-commerce automation

Product Catalog Automation

Extract product descriptions, attributes, categories, and variant details with a single schema-driven request.

10x faster catalog processing

Visual intelligence

Visual Monitoring

Track screenshots, public product images, and visual datasets for meaningful changes over time.

Change signals ready for review

AI agents

Agent Visual Reasoning

Give agents image tools that return focused answers instead of free-form prompts.

Agent-native visual context

Quality extraction

Quality Verification

Extract pass/fail checks, differences, and review notes from QA images using explicit schemas.

Automated QA workflows

Document processing

Document Data Extraction

Extract structured fields from receipts, forms, invoices, labels, and photographed documents.

Schema-ready output

Developer tools

Developer Workflows

Use one matching extract method across Python and TypeScript with clean result wrappers.

Drop into existing apps

Open Source

Free to Use, Open to Shape

Use Viscribe in your project, bring your own model provider, and share what would make it better for your workflow.

Open Source

Build image workflows with focused SDKs and your existing model provider.

Free

One extract APIModel every image task with schemas

Python + TypeScriptMatching SDKs for app and agent code

Provider choiceStructured results from your model setup

Star the repo

Looking for a hosted solution?

Tell us what you want to build.

Reach out!

Frequently Asked Questions

Join the community

Help shape Viscribe with the builders using it.

Star the repo, join Discord, and tell us what you are building so we can keep Viscribe useful for real image workflows.

GitHub Discord

The Image
Intelligence
Layer for AI Agents

Simple Image Calls, Structured Results

Use Your Existing AI Model Providers

Vision Tools Built for Jobs

Image methods

Image Description

Structured Extraction

Image Classification

Visual Q&A

Image Comparison

Image Description

Input

Output

Vision Intelligence for Real Workflows

Product Catalog Automation

Visual Monitoring

Agent Visual Reasoning

Quality Verification

Document Data Extraction

Developer Workflows

Free to Use, Open to Shape

Open Source

Looking for a hosted solution?

Frequently Asked Questions

Help shape Viscribe with the builders using it.

Stay up to date

The Image Intelligence Layer for AI Agents

Simple Image Calls, Structured Results

Use Your Existing AI Model Providers

Vision Tools Built for Jobs

Image methods

Image Description

Structured Extraction

Image Classification

Visual Q&A

Image Comparison

Image Description

Input

Output

Vision Intelligence for Real Workflows

Product Catalog Automation

Visual Monitoring

Agent Visual Reasoning

Quality Verification

Document Data Extraction

Developer Workflows

Free to Use, Open to Shape

Open Source

Looking for a hosted solution?

Frequently Asked Questions

Is Viscribe open source?

Which languages are supported?

Can I use my existing AI provider?

Can I use local images?

Looking for a hosted solution?

Help shape Viscribe with the builders using it.

Stay up to date

The Image
Intelligence
Layer for AI Agents