Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Circle Visualization
,Background Color Visualization
,Corner Visualization
,Bounding Box Visualization
,Twilio SMS Notification
,Object Detection Model
,Line Counter Visualization
,Image Preprocessing
,Slack Notification
,VLM as Detector
,Trace Visualization
,LMM
,Label Visualization
,Polygon Zone Visualization
,Camera Focus
,Local File Sink
,Image Slicer
,Image Slicer
,Image Blur
,Anthropic Claude
,Crop Visualization
,OCR Model
,Dot Visualization
,Google Gemini
,Relative Static Crop
,Model Comparison Visualization
,Stability AI Inpainting
,Roboflow Dataset Upload
,Pixelate Visualization
,Stitch OCR Detections
,Perspective Correction
,OpenAI
,OpenAI
,Image Convert Grayscale
,Absolute Static Crop
,Mask Visualization
,Stability AI Image Generation
,Webhook Sink
,Color Visualization
,Image Threshold
,Clip Comparison
,Dynamic Crop
,Halo Visualization
,Polygon Visualization
,CSV Formatter
,Florence-2 Model
,VLM as Classifier
,Instance Segmentation Model
,Keypoint Detection Model
,Image Contours
,CogVLM
,Camera Calibration
,Email Notification
,SIFT
,Reference Path Visualization
,Florence-2 Model
,Classification Label Visualization
,Triangle Visualization
,Model Monitoring Inference Aggregator
,Single-Label Classification Model
,SIFT Comparison
,Llama 3.2 Vision
,Google Vision OCR
,Roboflow Dataset Upload
,Keypoint Visualization
,Multi-Label Classification Model
,LMM For Classification
,Roboflow Custom Metadata
,Grid Visualization
,Ellipse Visualization
,Stitch Images
,Blur Visualization
- outputs:
Circle Visualization
,Background Color Visualization
,Corner Visualization
,Twilio SMS Notification
,Slack Notification
,LMM
,Polygon Zone Visualization
,Image Blur
,Cache Set
,Dot Visualization
,Path Deviation
,Google Gemini
,Detections Merge
,Detection Offset
,Roboflow Dataset Upload
,Stability AI Inpainting
,Pixelate Visualization
,Line Counter
,OpenAI
,Detections Consensus
,Distance Measurement
,Stability AI Image Generation
,Webhook Sink
,Color Visualization
,Image Threshold
,Halo Visualization
,Polygon Visualization
,Detections Classes Replacement
,CogVLM
,Instance Segmentation Model
,Email Notification
,Classification Label Visualization
,Llama 3.2 Vision
,Google Vision OCR
,Roboflow Dataset Upload
,Byte Tracker
,Ellipse Visualization
,Size Measurement
,Pixel Color Count
,Cache Get
,Bounding Box Visualization
,Line Counter Visualization
,Image Preprocessing
,Trace Visualization
,Label Visualization
,Local File Sink
,Detections Transformation
,Anthropic Claude
,Crop Visualization
,Detections Stitch
,YOLO-World Model
,Model Comparison Visualization
,Stitch OCR Detections
,Perspective Correction
,OpenAI
,Byte Tracker
,Path Deviation
,Mask Visualization
,Time in Zone
,Detections Filter
,Clip Comparison
,Time in Zone
,Dynamic Crop
,Florence-2 Model
,Byte Tracker
,Instance Segmentation Model
,Florence-2 Model
,Reference Path Visualization
,CLIP Embedding Model
,Triangle Visualization
,Model Monitoring Inference Aggregator
,Velocity
,SIFT Comparison
,Keypoint Visualization
,LMM For Classification
,Roboflow Custom Metadata
,Line Counter
,Segment Anything 2 Model
,Detections Stabilizer
,Blur Visualization
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}