Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Polygon Zone Visualization
,LMM For Classification
,VLM as Classifier
,Dot Visualization
,Morphological Transformation
,Blur Visualization
,Perspective Correction
,Corner Visualization
,LMM
,Florence-2 Model
,Grid Visualization
,Image Threshold
,Florence-2 Model
,Halo Visualization
,OpenAI
,Multi-Label Classification Model
,CogVLM
,EasyOCR
,Line Counter Visualization
,Stitch OCR Detections
,Stability AI Outpainting
,Twilio SMS Notification
,Keypoint Detection Model
,Google Vision OCR
,Camera Focus
,Roboflow Dataset Upload
,SIFT
,Email Notification
,Instance Segmentation Model
,Image Slicer
,Image Convert Grayscale
,Keypoint Visualization
,Clip Comparison
,OCR Model
,Llama 3.2 Vision
,Bounding Box Visualization
,Reference Path Visualization
,Dynamic Crop
,Roboflow Dataset Upload
,Mask Visualization
,Image Preprocessing
,Background Color Visualization
,Local File Sink
,Webhook Sink
,Camera Calibration
,OpenAI
,Depth Estimation
,Image Slicer
,QR Code Generator
,Trace Visualization
,Object Detection Model
,Contrast Equalization
,Crop Visualization
,Stability AI Image Generation
,SIFT Comparison
,Roboflow Custom Metadata
,Model Comparison Visualization
,Pixelate Visualization
,Model Monitoring Inference Aggregator
,Anthropic Claude
,Relative Static Crop
,Image Contours
,Polygon Visualization
,OpenAI
,Slack Notification
,Triangle Visualization
,Classification Label Visualization
,Circle Visualization
,Image Blur
,Label Visualization
,Google Gemini
,VLM as Detector
,Absolute Static Crop
,Stability AI Inpainting
,Icon Visualization
,Ellipse Visualization
,Color Visualization
,CSV Formatter
,Single-Label Classification Model
,Stitch Images
- outputs:
Byte Tracker
,Distance Measurement
,Time in Zone
,Dot Visualization
,Perspective Correction
,Blur Visualization
,OpenAI
,Stability AI Outpainting
,Twilio SMS Notification
,Google Vision OCR
,Roboflow Dataset Upload
,Email Notification
,Instance Segmentation Model
,Llama 3.2 Vision
,Bounding Box Visualization
,Detections Stabilizer
,Reference Path Visualization
,Image Preprocessing
,Local File Sink
,OpenAI
,Detections Stitch
,Velocity
,Stability AI Image Generation
,SIFT Comparison
,Roboflow Custom Metadata
,Cache Get
,Model Comparison Visualization
,Line Counter
,Time in Zone
,Polygon Visualization
,Slack Notification
,Triangle Visualization
,YOLO-World Model
,Detections Classes Replacement
,Circle Visualization
,Label Visualization
,Google Gemini
,Detections Merge
,Path Deviation
,Detections Consensus
,Moondream2
,Segment Anything 2 Model
,Polygon Zone Visualization
,LMM For Classification
,Morphological Transformation
,Size Measurement
,Corner Visualization
,LMM
,Pixel Color Count
,Florence-2 Model
,PTZ Tracking (ONVIF)
.md),Image Threshold
,Florence-2 Model
,Halo Visualization
,CogVLM
,Detections Combine
,Detection Offset
,Byte Tracker
,Line Counter Visualization
,Perception Encoder Embedding Model
,Stitch OCR Detections
,CLIP Embedding Model
,Clip Comparison
,Keypoint Visualization
,Line Counter
,Instance Segmentation Model
,Dynamic Crop
,Roboflow Dataset Upload
,Mask Visualization
,Background Color Visualization
,Webhook Sink
,QR Code Generator
,Trace Visualization
,Time in Zone
,Detections Transformation
,Contrast Equalization
,Byte Tracker
,Cache Set
,Crop Visualization
,Overlap Filter
,Pixelate Visualization
,Model Monitoring Inference Aggregator
,Anthropic Claude
,OpenAI
,Path Deviation
,Detections Filter
,Classification Label Visualization
,Image Blur
,Stability AI Inpainting
,Icon Visualization
,Ellipse Visualization
,Color Visualization
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}