OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using DocTR Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/ocr_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model in version v1.
- inputs:
Polygon Visualization,QR Code Generator,Halo Visualization,Camera Focus,SIFT Comparison,Classification Label Visualization,Circle Visualization,Mask Visualization,Morphological Transformation,Camera Calibration,Bounding Box Visualization,SIFT,Image Slicer,Image Preprocessing,Text Display,Image Slicer,Corner Visualization,Absolute Static Crop,Image Contours,Image Blur,Halo Visualization,Dynamic Crop,Dot Visualization,Polygon Zone Visualization,Reference Path Visualization,Label Visualization,Icon Visualization,Background Color Visualization,Image Threshold,Ellipse Visualization,Depth Estimation,Line Counter Visualization,Stability AI Outpainting,Contrast Enhancement,Triangle Visualization,Crop Visualization,Stitch Images,Blur Visualization,Perspective Correction,Morphological Transformation,Stability AI Image Generation,Relative Static Crop,Model Comparison Visualization,Color Visualization,Heatmap Visualization,Trace Visualization,Contrast Equalization,Camera Focus,Grid Visualization,Pixelate Visualization,Polygon Visualization,Image Convert Grayscale,Keypoint Visualization,Background Subtraction,Stability AI Inpainting - outputs:
Detections Stabilizer,Distance Measurement,Velocity,Instance Segmentation Model,Anthropic Claude,Google Vision OCR,Circle Visualization,Detections Filter,Google Gemini,Qwen 3.6 API,Detections Merge,CLIP Embedding Model,Byte Tracker,Roboflow Vision Events,Depth Estimation,Line Counter Visualization,Morphological Transformation,LMM,Model Comparison Visualization,MoonshotAI Kimi,Segment Anything 2 Model,Cache Set,Instance Segmentation Model,Twilio SMS/MMS Notification,OpenAI,Detections Classes Replacement,Twilio SMS Notification,SAM 3,Qwen-VL,S3 Sink,SAM 3,Halo Visualization,Local File Sink,Semantic Segmentation Model,SIFT Comparison,Multi-Label Classification Model,Camera Focus,Mask Visualization,Path Deviation,Anthropic Claude,MoonshotAI Kimi,Roboflow Dataset Upload,Text Display,Llama 3.2 Vision,PTZ Tracking (ONVIF),Path Deviation,GLM-OCR,Email Notification,Seg Preview,Dynamic Crop,Roboflow Custom Metadata,Detections Combine,Instance Segmentation Model,Mask Area Measurement,Overlap Analysis,Time in Zone,OpenRouter,Model Monitoring Inference Aggregator,Per-Class Confidence Filter,Webhook Sink,Google Gemma API,Stability AI Image Generation,SAM2 Video Tracker,Color Visualization,Heatmap Visualization,Contrast Equalization,YOLO-World Model,Google Gemini,Roboflow Dataset Upload,Slack Notification,Anthropic Claude,QR Code Generator,Detection Offset,Clip Comparison,OpenAI,Bounding Box Visualization,SAM 3,Florence-2 Model,Overlap Filter,Image Blur,Keypoint Detection Model,Detections Consensus,Dot Visualization,Polygon Zone Visualization,Label Visualization,Icon Visualization,Image Threshold,Cache Get,LMM For Classification,Object Detection Model,OC-SORT Tracker,Line Counter,Blur Visualization,Trace Visualization,Moondream2,Size Measurement,Perception Encoder Embedding Model,Florence-2 Model,CogVLM,Pixelate Visualization,Llama 3.2 Vision,Time in Zone,Byte Tracker,Keypoint Visualization,Polygon Visualization,Line Counter,Google Gemma,Classification Label Visualization,Morphological Transformation,Email Notification,Google Gemini,Image Preprocessing,Corner Visualization,Stitch OCR Detections,Detections Transformation,Halo Visualization,Byte Tracker,Roboflow Asset Library Attributes,ByteTrack Tracker,OpenAI,Reference Path Visualization,Detections Stitch,Background Color Visualization,Single-Label Classification Model,Ellipse Visualization,Stability AI Outpainting,Triangle Visualization,Crop Visualization,Qwen 3.5 API,Perspective Correction,Detection Event Log,Stitch OCR Detections,OpenAI-Compatible LLM,Detections List Roll-Up,BoT-SORT Tracker,Instance Segmentation Model,SORT Tracker,Time in Zone,Polygon Visualization,Pixel Color Count,OpenAI,Stability AI Inpainting
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..
-
output
result(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.parent_id(parent_id): Identifier of parent for step output.root_parent_id(parent_id): Identifier of parent for step output.prediction_type(prediction_type): String value with type of prediction.
Example JSON definition of step OCR Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}