Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection: optimized for areas of text within a larger image.ocr_text_detection: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/google_vision_ocr@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR in version v1.
- inputs:
Absolute Static Crop,Relative Static Crop,Keypoint Visualization,Object Detection Model,LMM For Classification,Google Vision OCR,Slack Notification,Trace Visualization,Color Visualization,Instance Segmentation Model,Polygon Zone Visualization,Camera Focus,Halo Visualization,CSV Formatter,OCR Model,Camera Calibration,VLM as Classifier,Triangle Visualization,Stability AI Inpainting,Image Threshold,Reference Path Visualization,Corner Visualization,Ellipse Visualization,OpenAI,Single-Label Classification Model,Morphological Transformation,Roboflow Custom Metadata,Grid Visualization,Image Preprocessing,CogVLM,Line Counter Visualization,OpenAI,Florence-2 Model,Roboflow Dataset Upload,Stitch OCR Detections,Label Visualization,Model Comparison Visualization,Multi-Label Classification Model,Roboflow Dataset Upload,OpenAI,Model Monitoring Inference Aggregator,Polygon Visualization,Llama 3.2 Vision,Icon Visualization,Local File Sink,Blur Visualization,Image Contours,Clip Comparison,VLM as Detector,Twilio SMS Notification,Bounding Box Visualization,SIFT,Classification Label Visualization,Background Color Visualization,Webhook Sink,Dynamic Crop,Dot Visualization,Pixelate Visualization,Email Notification,Image Slicer,Stitch Images,Crop Visualization,Mask Visualization,SIFT Comparison,QR Code Generator,Depth Estimation,Image Slicer,Google Gemini,Perspective Correction,Image Convert Grayscale,Stability AI Image Generation,Keypoint Detection Model,EasyOCR,Contrast Equalization,Anthropic Claude,Image Blur,Circle Visualization,Stability AI Outpainting,LMM,Florence-2 Model - outputs:
LMM For Classification,Trace Visualization,Color Visualization,Instance Segmentation Model,Polygon Zone Visualization,Halo Visualization,Triangle Visualization,Image Threshold,Detections Classes Replacement,Detection Offset,Morphological Transformation,Roboflow Custom Metadata,Image Preprocessing,Cache Set,Line Counter Visualization,Stitch OCR Detections,Path Deviation,PTZ Tracking (ONVIF).md),Model Comparison Visualization,Roboflow Dataset Upload,Line Counter,Path Deviation,Polygon Visualization,Detections Stabilizer,Llama 3.2 Vision,Icon Visualization,Local File Sink,Twilio SMS Notification,Classification Label Visualization,Background Color Visualization,Webhook Sink,Dynamic Crop,Pixelate Visualization,Detections Consensus,Email Notification,Crop Visualization,Mask Visualization,Detections Merge,Detections Transformation,Perspective Correction,Distance Measurement,Circle Visualization,Stability AI Outpainting,Byte Tracker,Size Measurement,Keypoint Visualization,Byte Tracker,Google Vision OCR,Slack Notification,Segment Anything 2 Model,Stability AI Inpainting,Reference Path Visualization,Corner Visualization,Ellipse Visualization,OpenAI,Time in Zone,CogVLM,OpenAI,YOLO-World Model,Florence-2 Model,Roboflow Dataset Upload,Label Visualization,OpenAI,Line Counter,Model Monitoring Inference Aggregator,Time in Zone,Velocity,Instance Segmentation Model,Time in Zone,Blur Visualization,Clip Comparison,Overlap Filter,Moondream2,Bounding Box Visualization,Dot Visualization,Byte Tracker,CLIP Embedding Model,Detections Combine,Detections Filter,Perception Encoder Embedding Model,SIFT Comparison,Pixel Color Count,QR Code Generator,Google Gemini,Stability AI Image Generation,Contrast Equalization,Anthropic Claude,Image Blur,Cache Get,LMM,Detections Stitch,Florence-2 Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR in version v1 has.
Bindings
Example JSON definition of step Google Vision OCR in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}