OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using DocTR Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/ocr_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model in version v1.
- inputs:
Stitch Images,Image Threshold,Stability AI Inpainting,Corner Visualization,Image Blur,Ellipse Visualization,Perspective Correction,Depth Estimation,Camera Calibration,Absolute Static Crop,Stability AI Image Generation,Grid Visualization,Background Color Visualization,Dynamic Crop,Image Slicer,Image Preprocessing,Relative Static Crop,Contrast Equalization,Background Subtraction,Morphological Transformation,Line Counter Visualization,SIFT,SIFT Comparison,Trace Visualization,Keypoint Visualization,Halo Visualization,Dot Visualization,Pixelate Visualization,Circle Visualization,Image Convert Grayscale,Icon Visualization,QR Code Generator,Halo Visualization,Camera Focus,Polygon Visualization,Color Visualization,Text Display,Reference Path Visualization,Crop Visualization,Mask Visualization,Heatmap Visualization,Triangle Visualization,Blur Visualization,Bounding Box Visualization,Label Visualization,Classification Label Visualization,Camera Focus,Polygon Visualization,Image Slicer,Image Contours,Model Comparison Visualization,Stability AI Outpainting,Polygon Zone Visualization - outputs:
Image Threshold,Email Notification,Corner Visualization,Roboflow Dataset Upload,Stitch OCR Detections,Stability AI Image Generation,Time in Zone,Dynamic Crop,Instance Segmentation Model,Image Preprocessing,Line Counter Visualization,Detections Combine,Trace Visualization,Halo Visualization,ByteTrack Tracker,Cache Get,Roboflow Custom Metadata,Pixelate Visualization,Circle Visualization,S3 Sink,Detections Classes Replacement,Twilio SMS Notification,Halo Visualization,Anthropic Claude,OC-SORT Tracker,Polygon Visualization,Detections Consensus,Roboflow Dataset Upload,Crop Visualization,Mask Visualization,Detection Offset,CLIP Embedding Model,Heatmap Visualization,Webhook Sink,Cache Set,Google Vision OCR,Detections List Roll-Up,Florence-2 Model,Florence-2 Model,Overlap Filter,Anthropic Claude,OpenAI,OpenAI,PTZ Tracking (ONVIF),Background Color Visualization,Anthropic Claude,SIFT Comparison,Keypoint Visualization,Time in Zone,Detections Filter,Stitch OCR Detections,LMM,Detections Merge,Detections Transformation,Perception Encoder Embedding Model,SAM 3,Seg Preview,Roboflow Vision Events,Detections Stitch,Triangle Visualization,Distance Measurement,Google Gemini,Path Deviation,Model Comparison Visualization,Stability AI Outpainting,Image Blur,Ellipse Visualization,OpenAI,Time in Zone,Depth Estimation,CogVLM,Google Gemini,Velocity,Morphological Transformation,LMM For Classification,Detection Event Log,Dot Visualization,GLM-OCR,Model Monitoring Inference Aggregator,Pixel Color Count,Icon Visualization,QR Code Generator,Detections Stabilizer,Camera Focus,SAM 3,Text Display,Reference Path Visualization,Instance Segmentation Model,Llama 3.2 Vision,SORT Tracker,Byte Tracker,Label Visualization,Classification Label Visualization,Byte Tracker,Segment Anything 2 Model,Polygon Zone Visualization,Stability AI Inpainting,Google Gemini,SAM 3,Perspective Correction,Size Measurement,Email Notification,Contrast Equalization,Line Counter,Path Deviation,Byte Tracker,Line Counter,Color Visualization,OpenAI,Local File Sink,Mask Area Measurement,Twilio SMS/MMS Notification,YOLO-World Model,Clip Comparison,Bounding Box Visualization,Blur Visualization,Polygon Visualization,Moondream2,Slack Notification
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..
-
output
result(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.parent_id(parent_id): Identifier of parent for step output.root_parent_id(parent_id): Identifier of parent for step output.prediction_type(prediction_type): String value with type of prediction.
Example JSON definition of step OCR Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}