Process a document using OCR (Optical Character Recognition).
This endpoint extracts text and structure from documents (PDFs, images) and returns the content in markdown format. It supports multiple document types and can extract images embedded within documents.
The extracted markdown preserves document structure including headings, tables, lists, and other formatting.
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Request model for OCR processing.
The OCR model to use
"mistral/mistral-ocr-latest"
The document to process
Specific page indices to process (0-based). If not specified, all pages are processed.
Whether to include base64-encoded images in the response
Maximum number of images to extract per page
x >= 1Minimum size (width or height in pixels) for images to be included
x >= 1Mistral-specific OCR parameters
Successful Response
Response model for OCR processing.