Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection: optimized for areas of text within a larger image.ocr_text_detection: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/google_vision_ocr@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR in version v1.
- inputs:
Grid Visualization,Circle Visualization,Model Monitoring Inference Aggregator,Roboflow Dataset Upload,QR Code Generator,Image Slicer,Dot Visualization,Single-Label Classification Model,Blur Visualization,Slack Notification,Perspective Correction,Roboflow Dataset Upload,Anthropic Claude,Background Color Visualization,OpenAI,Florence-2 Model,Object Detection Model,Llama 3.2 Vision,Dynamic Crop,Crop Visualization,OCR Model,EasyOCR,Trace Visualization,Keypoint Detection Model,Image Threshold,Triangle Visualization,Reference Path Visualization,Model Comparison Visualization,Polygon Visualization,Corner Visualization,Image Slicer,Florence-2 Model,Image Blur,SIFT Comparison,Bounding Box Visualization,Stitch OCR Detections,Keypoint Visualization,Image Convert Grayscale,Clip Comparison,Line Counter Visualization,SIFT,Icon Visualization,Stability AI Inpainting,VLM as Detector,Google Vision OCR,Polygon Zone Visualization,OpenAI,Webhook Sink,Camera Calibration,Instance Segmentation Model,CogVLM,Mask Visualization,Camera Focus,Twilio SMS Notification,Stability AI Outpainting,Classification Label Visualization,Multi-Label Classification Model,LMM,Image Preprocessing,Color Visualization,Morphological Transformation,Depth Estimation,LMM For Classification,Ellipse Visualization,Stability AI Image Generation,Email Notification,Halo Visualization,Stitch Images,Local File Sink,Roboflow Custom Metadata,Absolute Static Crop,Google Gemini,VLM as Classifier,Image Contours,Pixelate Visualization,CSV Formatter,Label Visualization,OpenAI,Contrast Equalization,Relative Static Crop - outputs:
CLIP Embedding Model,Detections Stitch,Circle Visualization,Time in Zone,Model Monitoring Inference Aggregator,QR Code Generator,Dot Visualization,Overlap Filter,Slack Notification,Blur Visualization,Florence-2 Model,Llama 3.2 Vision,Crop Visualization,Velocity,Image Threshold,Triangle Visualization,Reference Path Visualization,Model Comparison Visualization,Cache Set,Corner Visualization,Image Blur,Size Measurement,SIFT Comparison,Bounding Box Visualization,Stitch OCR Detections,Detections Transformation,YOLO-World Model,Distance Measurement,Icon Visualization,Stability AI Inpainting,Google Vision OCR,Polygon Zone Visualization,Webhook Sink,CogVLM,Time in Zone,Cache Get,Stability AI Outpainting,Classification Label Visualization,Morphological Transformation,Detections Stabilizer,Instance Segmentation Model,Segment Anything 2 Model,Local File Sink,Roboflow Custom Metadata,PTZ Tracking (ONVIF).md),Pixelate Visualization,Pixel Color Count,Path Deviation,Byte Tracker,Detections Filter,Detections Classes Replacement,Path Deviation,Roboflow Dataset Upload,Perception Encoder Embedding Model,Perspective Correction,Roboflow Dataset Upload,Anthropic Claude,OpenAI,Background Color Visualization,Dynamic Crop,Trace Visualization,Detection Offset,Polygon Visualization,Florence-2 Model,Moondream2,Line Counter,Byte Tracker,Keypoint Visualization,Byte Tracker,Detections Combine,Clip Comparison,Line Counter Visualization,OpenAI,Line Counter,Instance Segmentation Model,Mask Visualization,Twilio SMS Notification,Detections Consensus,LMM,Image Preprocessing,Time in Zone,Color Visualization,LMM For Classification,Ellipse Visualization,Stability AI Image Generation,Email Notification,Halo Visualization,Google Gemini,Label Visualization,Detections Merge,OpenAI,Contrast Equalization
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR in version v1 has.
Bindings
Example JSON definition of step Google Vision OCR in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}