Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection: optimized for areas of text within a larger image.ocr_text_detection: optimized for dense text documents.
Provide your Google Vision API key or set the value to rf_key:account (or
rf_key:user:<id>) to proxy requests through Roboflow's API.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/google_vision_ocr@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR in version v1.
- inputs:
Stability AI Outpainting,OpenAI-Compatible LLM,Morphological Transformation,Contrast Enhancement,Crop Visualization,Camera Focus,Blur Visualization,Image Preprocessing,Corner Visualization,Ellipse Visualization,Mask Visualization,Stability AI Image Generation,Qwen-VL,Object Detection Model,Heatmap Visualization,Stitch OCR Detections,Google Gemma API,Roboflow Vision Events,Image Slicer,Qwen 3.5 API,Trace Visualization,Background Color Visualization,Slack Notification,Anthropic Claude,Qwen 3.6 API,OpenAI,Webhook Sink,Email Notification,Color Visualization,Bounding Box Visualization,Keypoint Visualization,Model Comparison Visualization,Google Gemma,Relative Static Crop,CogVLM,Llama 3.2 Vision,Polygon Zone Visualization,Qwen3.5-VL,Dynamic Crop,Camera Focus,Polygon Visualization,QR Code Generator,Google Vision OCR,GLM-OCR,Stitch Images,Google Gemini,Llama 3.2 Vision,OpenRouter,Twilio SMS Notification,Image Blur,Model Monitoring Inference Aggregator,Clip Comparison,Anthropic Claude,Image Slicer,Depth Estimation,OpenAI,Multi-Label Classification Model,Google Gemini,Classification Label Visualization,Pixelate Visualization,EasyOCR,SIFT,Florence-2 Model,MoonshotAI Kimi,Contrast Equalization,Image Threshold,Instance Segmentation Model,MoonshotAI Kimi,Dot Visualization,Polygon Visualization,Background Subtraction,Roboflow Dataset Upload,Anthropic Claude,Halo Visualization,Stability AI Inpainting,Roboflow Custom Metadata,Florence-2 Model,Label Visualization,Local File Sink,Icon Visualization,Single-Label Classification Model,Image Contours,OpenAI,Absolute Static Crop,Google Gemini,Grid Visualization,VLM As Classifier,Camera Calibration,Halo Visualization,Email Notification,OpenAI,LMM,LMM For Classification,Text Display,Image Convert Grayscale,Reference Path Visualization,Circle Visualization,Line Counter Visualization,Stitch OCR Detections,OCR Model,Keypoint Detection Model,SIFT Comparison,VLM As Detector,Morphological Transformation,Twilio SMS/MMS Notification,Roboflow Dataset Upload,CSV Formatter,S3 Sink,Triangle Visualization,Perspective Correction - outputs:
Stability AI Outpainting,CLIP Embedding Model,SAM 3,Camera Focus,Seg Preview,Corner Visualization,Ellipse Visualization,Image Preprocessing,Roboflow Vision Events,Heatmap Visualization,Trace Visualization,OC-SORT Tracker,Time in Zone,OpenAI,Email Notification,Byte Tracker,Keypoint Visualization,Detections Consensus,Model Comparison Visualization,YOLO-World Model,Polygon Zone Visualization,Byte Tracker,Dynamic Crop,Polygon Visualization,QR Code Generator,GLM-OCR,OpenRouter,Model Monitoring Inference Aggregator,Clip Comparison,Image Blur,Cache Get,Detections Stitch,Time in Zone,Detections List Roll-Up,Segment Anything 2 Model,Instance Segmentation Model,Google Gemini,Pixelate Visualization,Contrast Equalization,Image Threshold,Instance Segmentation Model,Polygon Visualization,Anthropic Claude,Halo Visualization,Roboflow Custom Metadata,Keypoint Detection Model,Florence-2 Model,Local File Sink,Icon Visualization,Detection Offset,OpenAI,SAM2 Video Tracker,SAM 3,Detections Filter,ByteTrack Tracker,Size Measurement,Multi-Label Classification Model,LMM,Detections Transformation,Reference Path Visualization,Stitch OCR Detections,SIFT Comparison,Roboflow Dataset Upload,S3 Sink,Cache Set,BoT-SORT Tracker,OpenAI-Compatible LLM,Morphological Transformation,Object Detection Model,Mask Visualization,Stability AI Image Generation,Qwen-VL,Crop Visualization,Blur Visualization,Stitch OCR Detections,Google Gemma API,Velocity,Qwen 3.5 API,Path Deviation,Perception Encoder Embedding Model,Background Color Visualization,Slack Notification,Anthropic Claude,Qwen 3.6 API,Webhook Sink,Color Visualization,Bounding Box Visualization,Google Gemma,Detection Event Log,CogVLM,Path Deviation,Llama 3.2 Vision,Instance Segmentation Model,Instance Segmentation Model,Google Vision OCR,Google Gemini,SAM 3,Distance Measurement,Single-Label Classification Model,Llama 3.2 Vision,SORT Tracker,Twilio SMS Notification,Detections Stabilizer,Moondream2,Anthropic Claude,OpenAI,Depth Estimation,Classification Label Visualization,PTZ Tracking (ONVIF),MoonshotAI Kimi,Florence-2 Model,Time in Zone,MoonshotAI Kimi,Line Counter,Dot Visualization,Roboflow Dataset Upload,Stability AI Inpainting,Per-Class Confidence Filter,Semantic Segmentation Model,Line Counter,Detections Classes Replacement,Label Visualization,Overlap Filter,Detections Merge,Google Gemini,Email Notification,Halo Visualization,OpenAI,Pixel Color Count,LMM For Classification,Text Display,Mask Area Measurement,Circle Visualization,Line Counter Visualization,Byte Tracker,Detections Combine,Overlap Analysis,Twilio SMS/MMS Notification,Morphological Transformation,Triangle Visualization,Perspective Correction
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR in version v1 has.
Bindings
-
input
image(image): Image to run OCR.api_key(Union[ROBOFLOW_MANAGED_KEY,secret,string]): Your Google Vision API key.
-
output
text(string): String value.language(string): String value.predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Google Vision OCR in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"api_key": "xxx-xxx",
"language_hints": [
"en",
"fr"
]
}