OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using DocTR Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/ocr_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model in version v1.
- inputs:
Contrast Equalization,Image Contours,Image Slicer,Depth Estimation,Polygon Visualization,QR Code Generator,Image Blur,SIFT Comparison,Stitch Images,Dynamic Crop,Bounding Box Visualization,Text Display,Model Comparison Visualization,Camera Focus,SIFT,Line Counter Visualization,Blur Visualization,Morphological Transformation,Camera Calibration,Polygon Zone Visualization,Mask Visualization,Relative Static Crop,Keypoint Visualization,Circle Visualization,Camera Focus,Trace Visualization,Pixelate Visualization,Color Visualization,Absolute Static Crop,Image Slicer,Stability AI Inpainting,Reference Path Visualization,Dot Visualization,Label Visualization,Perspective Correction,Ellipse Visualization,Crop Visualization,Halo Visualization,Image Threshold,Grid Visualization,Image Convert Grayscale,Corner Visualization,Image Preprocessing,Classification Label Visualization,Background Color Visualization,Stability AI Outpainting,Icon Visualization,Triangle Visualization,Stability AI Image Generation,Background Subtraction - outputs:
Contrast Equalization,Llama 3.2 Vision,Clip Comparison,Anthropic Claude,Detections Transformation,Local File Sink,Polygon Visualization,QR Code Generator,Time in Zone,Image Blur,SIFT Comparison,Email Notification,CLIP Embedding Model,Roboflow Dataset Upload,Text Display,Model Comparison Visualization,Camera Focus,PTZ Tracking (ONVIF).md),Moondream2,LMM,Byte Tracker,Google Vision OCR,SAM 3,Mask Visualization,Anthropic Claude,Detections Merge,Circle Visualization,Seg Preview,Stability AI Inpainting,Pixelate Visualization,Time in Zone,Reference Path Visualization,Detection Offset,Time in Zone,Detections Filter,Instance Segmentation Model,Detections Combine,Perspective Correction,Halo Visualization,Image Threshold,Crop Visualization,Path Deviation,Ellipse Visualization,Overlap Filter,Florence-2 Model,Twilio SMS Notification,Detections Stabilizer,Perception Encoder Embedding Model,Corner Visualization,Line Counter,Image Preprocessing,Detections List Roll-Up,Icon Visualization,SAM 3,Segment Anything 2 Model,Detections Consensus,Cache Set,Depth Estimation,Pixel Color Count,Detections Stitch,Dynamic Crop,Bounding Box Visualization,Cache Get,Model Monitoring Inference Aggregator,YOLO-World Model,Detection Event Log,Instance Segmentation Model,Detections Classes Replacement,Line Counter Visualization,Morphological Transformation,Blur Visualization,Polygon Zone Visualization,Line Counter,Email Notification,Keypoint Visualization,Distance Measurement,Path Deviation,Roboflow Custom Metadata,OpenAI,Google Gemini,OpenAI,Trace Visualization,CogVLM,Size Measurement,Color Visualization,Byte Tracker,Dot Visualization,Label Visualization,Slack Notification,Florence-2 Model,Google Gemini,Google Gemini,LMM For Classification,OpenAI,Stitch OCR Detections,OpenAI,Classification Label Visualization,Stitch OCR Detections,Background Color Visualization,Roboflow Dataset Upload,Stability AI Outpainting,Twilio SMS/MMS Notification,SAM 3,Anthropic Claude,Byte Tracker,Triangle Visualization,Stability AI Image Generation,Webhook Sink,Velocity
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..
-
output
result(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.parent_id(parent_id): Identifier of parent for step output.root_parent_id(parent_id): Identifier of parent for step output.prediction_type(prediction_type): String value with type of prediction.
Example JSON definition of step OCR Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}