OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using DocTR Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/ocr_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model in version v1.
- inputs:
Background Color Visualization,Bounding Box Visualization,Polygon Zone Visualization,Line Counter Visualization,Ellipse Visualization,Polygon Visualization,Image Preprocessing,Triangle Visualization,SIFT,SIFT Comparison,Circle Visualization,Relative Static Crop,Absolute Static Crop,Polygon Visualization,Trace Visualization,Camera Calibration,Camera Focus,Camera Focus,Pixelate Visualization,Stability AI Outpainting,Heatmap Visualization,Stability AI Inpainting,Mask Visualization,Text Display,Contrast Equalization,Keypoint Visualization,Reference Path Visualization,Classification Label Visualization,Dot Visualization,Image Slicer,Color Visualization,Label Visualization,Dynamic Crop,Stitch Images,Halo Visualization,Image Contours,Image Threshold,Blur Visualization,Perspective Correction,Grid Visualization,Icon Visualization,Stability AI Image Generation,Morphological Transformation,Depth Estimation,Image Convert Grayscale,Background Subtraction,Image Slicer,QR Code Generator,Crop Visualization,Model Comparison Visualization,Image Blur,Corner Visualization,Halo Visualization - outputs:
Local File Sink,Background Color Visualization,Overlap Filter,Roboflow Dataset Upload,Polygon Zone Visualization,Path Deviation,Line Counter Visualization,Image Preprocessing,SAM 3,Instance Segmentation Model,SIFT Comparison,Segment Anything 2 Model,LMM,Clip Comparison,Stability AI Outpainting,Heatmap Visualization,Anthropic Claude,Text Display,Mask Visualization,Detection Event Log,OC-SORT Tracker,Roboflow Custom Metadata,Email Notification,OpenAI,Blur Visualization,Detections Combine,SAM 3,GLM-OCR,Velocity,Time in Zone,QR Code Generator,Mask Area Measurement,Webhook Sink,SAM 3,Twilio SMS Notification,CogVLM,S3 Sink,Line Counter,Detection Offset,Byte Tracker,Polygon Visualization,Trace Visualization,Pixel Color Count,Camera Focus,Pixelate Visualization,LMM For Classification,Reference Path Visualization,Line Counter,Contrast Equalization,Classification Label Visualization,Dynamic Crop,Halo Visualization,Anthropic Claude,Perspective Correction,Email Notification,Depth Estimation,Model Comparison Visualization,Instance Segmentation Model,Image Blur,ByteTrack Tracker,Seg Preview,OpenAI,Path Deviation,Google Vision OCR,Cache Set,Bounding Box Visualization,Ellipse Visualization,Size Measurement,Triangle Visualization,Florence-2 Model,Llama 3.2 Vision,Detections Stitch,YOLO-World Model,SORT Tracker,Label Visualization,Florence-2 Model,Time in Zone,Time in Zone,PTZ Tracking (ONVIF),Icon Visualization,Stability AI Image Generation,Anthropic Claude,Cache Get,Perception Encoder Embedding Model,Google Gemini,Distance Measurement,Google Gemini,Byte Tracker,Moondream2,Crop Visualization,OpenAI,Corner Visualization,Stitch OCR Detections,Stitch OCR Detections,Detections Stabilizer,Byte Tracker,Polygon Visualization,Detections Transformation,Model Monitoring Inference Aggregator,Circle Visualization,Slack Notification,OpenAI,Detections Filter,Stability AI Inpainting,Keypoint Visualization,Detections Merge,Twilio SMS/MMS Notification,Dot Visualization,Color Visualization,CLIP Embedding Model,Image Threshold,Detections Classes Replacement,Detections List Roll-Up,Morphological Transformation,Detections Consensus,Google Gemini,Halo Visualization,Roboflow Dataset Upload
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..
-
output
result(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.parent_id(parent_id): Identifier of parent for step output.root_parent_id(parent_id): Identifier of parent for step output.prediction_type(prediction_type): String value with type of prediction.
Example JSON definition of step OCR Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}