Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Image Threshold
,Keypoint Detection Model
,CogVLM
,Image Convert Grayscale
,Anthropic Claude
,OpenAI
,Google Vision OCR
,SIFT Comparison
,Florence-2 Model
,Absolute Static Crop
,Pixelate Visualization
,Single-Label Classification Model
,SIFT
,Line Counter Visualization
,Stitch Images
,OpenAI
,Stability AI Image Generation
,Dot Visualization
,Mask Visualization
,Image Slicer
,Object Detection Model
,Instance Segmentation Model
,Background Color Visualization
,Florence-2 Model
,Model Monitoring Inference Aggregator
,Stitch OCR Detections
,Triangle Visualization
,Image Slicer
,VLM as Detector
,Roboflow Dataset Upload
,Roboflow Dataset Upload
,Image Blur
,Email Notification
,Camera Focus
,Grid Visualization
,CSV Formatter
,Polygon Zone Visualization
,Blur Visualization
,Crop Visualization
,Label Visualization
,Classification Label Visualization
,Depth Estimation
,Image Preprocessing
,Model Comparison Visualization
,Stability AI Inpainting
,LMM
,Llama 3.2 Vision
,Reference Path Visualization
,Multi-Label Classification Model
,Ellipse Visualization
,Twilio SMS Notification
,Google Gemini
,Bounding Box Visualization
,Halo Visualization
,Image Contours
,Corner Visualization
,Camera Calibration
,Roboflow Custom Metadata
,Clip Comparison
,Circle Visualization
,Perspective Correction
,Dynamic Crop
,Slack Notification
,Polygon Visualization
,VLM as Classifier
,Relative Static Crop
,Trace Visualization
,Keypoint Visualization
,OpenAI
,Webhook Sink
,Color Visualization
,OCR Model
,LMM For Classification
,Local File Sink
- outputs:
CogVLM
,Anthropic Claude
,Cache Set
,Google Vision OCR
,OpenAI
,Detections Classes Replacement
,Florence-2 Model
,Detection Offset
,Distance Measurement
,Pixelate Visualization
,OpenAI
,Stability AI Image Generation
,YOLO-World Model
,Mask Visualization
,Overlap Filter
,Triangle Visualization
,Path Deviation
,Polygon Zone Visualization
,Model Comparison Visualization
,Crop Visualization
,LMM
,Classification Label Visualization
,Segment Anything 2 Model
,Reference Path Visualization
,Twilio SMS Notification
,Line Counter
,Google Gemini
,Bounding Box Visualization
,Roboflow Custom Metadata
,Byte Tracker
,Size Measurement
,Circle Visualization
,Perspective Correction
,Slack Notification
,Pixel Color Count
,Polygon Visualization
,Detections Transformation
,Instance Segmentation Model
,Trace Visualization
,Detections Merge
,Webhook Sink
,Color Visualization
,LMM For Classification
,Detections Consensus
,Image Threshold
,Detections Stitch
,SIFT Comparison
,Line Counter Visualization
,Line Counter
,Dot Visualization
,Detections Filter
,Instance Segmentation Model
,Florence-2 Model
,Background Color Visualization
,Model Monitoring Inference Aggregator
,Detections Stabilizer
,Stitch OCR Detections
,Roboflow Dataset Upload
,Roboflow Dataset Upload
,Image Blur
,Time in Zone
,Email Notification
,Path Deviation
,CLIP Embedding Model
,Stability AI Inpainting
,Label Visualization
,Blur Visualization
,Llama 3.2 Vision
,Image Preprocessing
,Byte Tracker
,Ellipse Visualization
,ONVIF Control
,Byte Tracker
,Halo Visualization
,Corner Visualization
,Clip Comparison
,Cache Get
,Time in Zone
,Dynamic Crop
,OpenAI
,Keypoint Visualization
,Velocity
,Local File Sink
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}