Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection: optimized for areas of text within a larger image.ocr_text_detection: optimized for dense text documents.
Provide your Google Vision API key or set the value to rf_key:account (or
rf_key:user:<id>) to proxy requests through Roboflow's API.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/google_vision_ocr@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR in version v1.
- inputs:
Anthropic Claude,Mask Visualization,Classification Label Visualization,Instance Segmentation Model,Webhook Sink,Multi-Label Classification Model,Email Notification,QR Code Generator,Dynamic Crop,VLM As Detector,Google Gemini,LMM,Image Blur,Corner Visualization,Image Convert Grayscale,Stability AI Outpainting,Halo Visualization,Stability AI Inpainting,Object Detection Model,Image Contours,Trace Visualization,Google Vision OCR,Morphological Transformation,Triangle Visualization,Relative Static Crop,CSV Formatter,Text Display,Stitch Images,Google Gemini,Camera Calibration,Grid Visualization,Local File Sink,Slack Notification,VLM As Classifier,Roboflow Dataset Upload,Camera Focus,Color Visualization,Dot Visualization,Image Slicer,Polygon Visualization,Anthropic Claude,LMM For Classification,Line Counter Visualization,Llama 3.2 Vision,Keypoint Detection Model,Contrast Equalization,SIFT Comparison,Camera Focus,Background Subtraction,Image Slicer,Circle Visualization,Halo Visualization,Florence-2 Model,Blur Visualization,Label Visualization,Twilio SMS/MMS Notification,Clip Comparison,Email Notification,Ellipse Visualization,OpenAI,SIFT,Image Preprocessing,Model Monitoring Inference Aggregator,Single-Label Classification Model,OpenAI,Image Threshold,Background Color Visualization,Model Comparison Visualization,Depth Estimation,OpenAI,CogVLM,Absolute Static Crop,Roboflow Custom Metadata,EasyOCR,Stitch OCR Detections,Perspective Correction,Anthropic Claude,Pixelate Visualization,Stability AI Image Generation,Reference Path Visualization,Keypoint Visualization,Polygon Visualization,Twilio SMS Notification,Bounding Box Visualization,Polygon Zone Visualization,OCR Model,Icon Visualization,Crop Visualization,Stitch OCR Detections,Google Gemini,OpenAI,Florence-2 Model,Roboflow Dataset Upload - outputs:
Mask Visualization,Classification Label Visualization,Instance Segmentation Model,Detections Consensus,Detections Merge,Webhook Sink,Email Notification,QR Code Generator,LMM,SAM 3,Detection Offset,Corner Visualization,Stability AI Outpainting,Segment Anything 2 Model,Halo Visualization,Trace Visualization,Google Vision OCR,Instance Segmentation Model,Text Display,Google Gemini,Slack Notification,Local File Sink,Roboflow Dataset Upload,PTZ Tracking (ONVIF).md),Color Visualization,Dot Visualization,Polygon Visualization,Anthropic Claude,Byte Tracker,Contrast Equalization,Detections Classes Replacement,Perception Encoder Embedding Model,Velocity,Moondream2,Halo Visualization,Florence-2 Model,Blur Visualization,Twilio SMS/MMS Notification,Label Visualization,Ellipse Visualization,OpenAI,Model Monitoring Inference Aggregator,Detections List Roll-Up,OpenAI,Image Threshold,Model Comparison Visualization,Background Color Visualization,Size Measurement,OpenAI,Twilio SMS Notification,Polygon Visualization,SAM 3,Bounding Box Visualization,Overlap Filter,Icon Visualization,Time in Zone,Google Gemini,Florence-2 Model,Roboflow Dataset Upload,Anthropic Claude,Dynamic Crop,CLIP Embedding Model,Google Gemini,Path Deviation,Image Blur,Line Counter,Byte Tracker,Cache Set,Stability AI Inpainting,Path Deviation,Morphological Transformation,Triangle Visualization,Detections Stitch,Detections Filter,Detections Stabilizer,Camera Focus,Detections Combine,Llama 3.2 Vision,Line Counter Visualization,LMM For Classification,Distance Measurement,SIFT Comparison,Time in Zone,Circle Visualization,Seg Preview,Clip Comparison,Email Notification,Byte Tracker,Image Preprocessing,SAM 3,Depth Estimation,Cache Get,Line Counter,Time in Zone,CogVLM,Roboflow Custom Metadata,Stitch OCR Detections,Perspective Correction,Anthropic Claude,Pixelate Visualization,Stability AI Image Generation,Reference Path Visualization,Keypoint Visualization,Detection Event Log,Polygon Zone Visualization,YOLO-World Model,Stitch OCR Detections,Crop Visualization,Pixel Color Count,OpenAI,Detections Transformation
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR in version v1 has.
Bindings
-
input
image(image): Image to run OCR.api_key(Union[secret,string,ROBOFLOW_MANAGED_KEY]): Your Google Vision API key.
-
output
text(string): String value.language(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Google Vision OCR in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"api_key": "xxx-xxx",
"language_hints": [
"en",
"fr"
]
}