OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using DocTR Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/ocr_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model in version v1.
- inputs:
Mask Visualization,Circle Visualization,Classification Label Visualization,Halo Visualization,Blur Visualization,QR Code Generator,Dynamic Crop,Label Visualization,Image Blur,Corner Visualization,Image Convert Grayscale,Ellipse Visualization,SIFT,Image Preprocessing,Stability AI Outpainting,Halo Visualization,Stability AI Inpainting,Image Threshold,Background Color Visualization,Image Contours,Depth Estimation,Model Comparison Visualization,Trace Visualization,Morphological Transformation,Triangle Visualization,Absolute Static Crop,Relative Static Crop,Text Display,Stitch Images,Camera Calibration,Grid Visualization,Camera Focus,Perspective Correction,Color Visualization,Dot Visualization,Image Slicer,Pixelate Visualization,Polygon Visualization,Stability AI Image Generation,Reference Path Visualization,Keypoint Visualization,Polygon Visualization,Line Counter Visualization,Bounding Box Visualization,Contrast Equalization,Polygon Zone Visualization,SIFT Comparison,Camera Focus,Icon Visualization,Crop Visualization,Background Subtraction,Image Slicer - outputs:
Anthropic Claude,Mask Visualization,Classification Label Visualization,Instance Segmentation Model,Detections Consensus,Detections Merge,Webhook Sink,Email Notification,QR Code Generator,Dynamic Crop,CLIP Embedding Model,Google Gemini,LMM,SAM 3,Path Deviation,Image Blur,Detection Offset,Corner Visualization,Line Counter,Byte Tracker,Stability AI Outpainting,Cache Set,Segment Anything 2 Model,Halo Visualization,Stability AI Inpainting,Path Deviation,Trace Visualization,Google Vision OCR,Morphological Transformation,Triangle Visualization,Instance Segmentation Model,Detections Stitch,Text Display,Google Gemini,Slack Notification,Local File Sink,Detections Filter,Detections Stabilizer,Roboflow Dataset Upload,PTZ Tracking (ONVIF).md),Camera Focus,Color Visualization,Dot Visualization,Detections Combine,Polygon Visualization,Anthropic Claude,Llama 3.2 Vision,Line Counter Visualization,LMM For Classification,Byte Tracker,Contrast Equalization,Distance Measurement,Detections Classes Replacement,SIFT Comparison,Perception Encoder Embedding Model,Time in Zone,Velocity,Circle Visualization,Moondream2,Seg Preview,Halo Visualization,Florence-2 Model,Blur Visualization,Twilio SMS/MMS Notification,Label Visualization,Clip Comparison,Email Notification,Ellipse Visualization,OpenAI,Byte Tracker,Image Preprocessing,Model Monitoring Inference Aggregator,SAM 3,Detections List Roll-Up,OpenAI,Image Threshold,Model Comparison Visualization,Background Color Visualization,Size Measurement,OpenAI,Depth Estimation,Cache Get,Line Counter,Time in Zone,CogVLM,Roboflow Custom Metadata,Stitch OCR Detections,Perspective Correction,Anthropic Claude,Pixelate Visualization,Stability AI Image Generation,Reference Path Visualization,Keypoint Visualization,Twilio SMS Notification,Polygon Visualization,SAM 3,Bounding Box Visualization,Detection Event Log,Detections Transformation,Polygon Zone Visualization,YOLO-World Model,Overlap Filter,Icon Visualization,Time in Zone,Stitch OCR Detections,Crop Visualization,Google Gemini,Pixel Color Count,OpenAI,Florence-2 Model,Roboflow Dataset Upload
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..
-
output
result(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.parent_id(parent_id): Identifier of parent for step output.root_parent_id(parent_id): Identifier of parent for step output.prediction_type(prediction_type): String value with type of prediction.
Example JSON definition of step OCR Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}