OCR Model¶
Class: OCRModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.ocr.v1.OCRModelBlockV1
Retrieve the characters in an image using Optical Character Recognition (OCR).
This block returns the text within an image.
You may want to use this block in combination with a detections-based block (i.e. ObjectDetectionBlock). An object detection model could isolate specific regions from an image (i.e. a shipping container ID in a logistics use case) for further processing. You can then use a DynamicCropBlock to crop the region of interest before running OCR.
Using a detections model then cropping detections allows you to isolate your analysis on particular regions of an image.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/ocr_model@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to OCR Model
in version v1
.
- inputs:
Image Convert Grayscale
,Absolute Static Crop
,Relative Static Crop
,Label Visualization
,Line Counter Visualization
,Background Color Visualization
,Stitch Images
,Camera Focus
,Image Contours
,Image Preprocessing
,Image Slicer
,Reference Path Visualization
,SIFT Comparison
,Triangle Visualization
,Grid Visualization
,Polygon Zone Visualization
,Keypoint Visualization
,Depth Estimation
,Bounding Box Visualization
,Image Blur
,Perspective Correction
,Halo Visualization
,Ellipse Visualization
,Color Visualization
,Crop Visualization
,Dot Visualization
,Pixelate Visualization
,Model Comparison Visualization
,Classification Label Visualization
,Camera Calibration
,Stability AI Image Generation
,Polygon Visualization
,Trace Visualization
,Corner Visualization
,Image Threshold
,Blur Visualization
,Image Slicer
,Stability AI Inpainting
,Mask Visualization
,SIFT
,Circle Visualization
,Dynamic Crop
- outputs:
Size Measurement
,LMM
,Roboflow Custom Metadata
,Distance Measurement
,Line Counter Visualization
,Background Color Visualization
,Reference Path Visualization
,Instance Segmentation Model
,SIFT Comparison
,Triangle Visualization
,Path Deviation
,Line Counter
,Llama 3.2 Vision
,Roboflow Dataset Upload
,Google Vision OCR
,Clip Comparison
,Perspective Correction
,Crop Visualization
,Webhook Sink
,Dot Visualization
,Email Notification
,Model Comparison Visualization
,Classification Label Visualization
,Instance Segmentation Model
,Slack Notification
,Stability AI Image Generation
,Cache Set
,Time in Zone
,Line Counter
,Time in Zone
,Corner Visualization
,Trace Visualization
,Image Threshold
,Local File Sink
,CogVLM
,Stability AI Inpainting
,Circle Visualization
,OpenAI
,Path Deviation
,Florence-2 Model
,Twilio SMS Notification
,Label Visualization
,Image Preprocessing
,Detections Stitch
,Polygon Zone Visualization
,Keypoint Visualization
,LMM For Classification
,Bounding Box Visualization
,Image Blur
,OpenAI
,CLIP Embedding Model
,Google Gemini
,Halo Visualization
,Ellipse Visualization
,Color Visualization
,Pixel Color Count
,YOLO-World Model
,Roboflow Dataset Upload
,Polygon Visualization
,Segment Anything 2 Model
,Cache Get
,Model Monitoring Inference Aggregator
,Mask Visualization
,Anthropic Claude
,Florence-2 Model
,Dynamic Crop
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
OCR Model
in version v1
has.
Bindings
-
input
images
(image
): The image to infer on..
-
output
result
(string
): String value.parent_id
(parent_id
): Identifier of parent for step output.root_parent_id
(parent_id
): Identifier of parent for step output.prediction_type
(prediction_type
): String value with type of prediction.
Example JSON definition of step OCR Model
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/ocr_model@v1",
"images": "$inputs.image"
}