Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Blur Visualization
,Camera Focus
,CogVLM
,Image Threshold
,Polygon Zone Visualization
,Stability AI Inpainting
,VLM as Detector
,Relative Static Crop
,Image Preprocessing
,Slack Notification
,Keypoint Visualization
,Background Color Visualization
,Grid Visualization
,Local File Sink
,Image Convert Grayscale
,Trace Visualization
,Instance Segmentation Model
,Absolute Static Crop
,Roboflow Custom Metadata
,Color Visualization
,Perspective Correction
,OpenAI
,Twilio SMS Notification
,OCR Model
,Multi-Label Classification Model
,Classification Label Visualization
,Circle Visualization
,Google Vision OCR
,Camera Calibration
,Pixelate Visualization
,Image Slicer
,Clip Comparison
,Stitch OCR Detections
,Label Visualization
,Halo Visualization
,Triangle Visualization
,Reference Path Visualization
,Image Slicer
,OpenAI
,Single-Label Classification Model
,Webhook Sink
,Line Counter Visualization
,Google Gemini
,Roboflow Dataset Upload
,Llama 3.2 Vision
,Image Blur
,Corner Visualization
,Florence-2 Model
,SIFT Comparison
,Email Notification
,Object Detection Model
,LMM
,LMM For Classification
,Anthropic Claude
,Image Contours
,Roboflow Dataset Upload
,Dynamic Crop
,Polygon Visualization
,CSV Formatter
,Depth Estimation
,SIFT
,Florence-2 Model
,Ellipse Visualization
,Mask Visualization
,Model Monitoring Inference Aggregator
,Keypoint Detection Model
,Stitch Images
,Bounding Box Visualization
,Dot Visualization
,Stability AI Image Generation
,Model Comparison Visualization
,VLM as Classifier
,Crop Visualization
- outputs:
Detection Offset
,Line Counter
,Polygon Zone Visualization
,Detections Filter
,Slack Notification
,Time in Zone
,Local File Sink
,YOLO-World Model
,Instance Segmentation Model
,Trace Visualization
,Roboflow Custom Metadata
,Perspective Correction
,OpenAI
,Distance Measurement
,Circle Visualization
,Clip Comparison
,OpenAI
,Triangle Visualization
,Halo Visualization
,Line Counter
,Byte Tracker
,Size Measurement
,Byte Tracker
,Corner Visualization
,Email Notification
,Detections Classes Replacement
,LMM
,Detections Consensus
,Roboflow Dataset Upload
,Overlap Filter
,Dynamic Crop
,Cache Set
,Velocity
,Model Monitoring Inference Aggregator
,Cache Get
,Segment Anything 2 Model
,Llama 3.2 Vision
,Anthropic Claude
,Model Comparison Visualization
,Crop Visualization
,Blur Visualization
,CLIP Embedding Model
,CogVLM
,Image Threshold
,Stability AI Inpainting
,Image Preprocessing
,Keypoint Visualization
,Background Color Visualization
,Path Deviation
,Pixel Color Count
,Color Visualization
,Twilio SMS Notification
,Classification Label Visualization
,Google Vision OCR
,Pixelate Visualization
,Stitch OCR Detections
,Label Visualization
,Time in Zone
,Reference Path Visualization
,Webhook Sink
,Google Gemini
,Roboflow Dataset Upload
,Line Counter Visualization
,Byte Tracker
,Image Blur
,Detections Transformation
,Florence-2 Model
,SIFT Comparison
,Detections Stabilizer
,LMM For Classification
,Instance Segmentation Model
,Polygon Visualization
,Florence-2 Model
,Detections Merge
,Mask Visualization
,Ellipse Visualization
,Detections Stitch
,Bounding Box Visualization
,Dot Visualization
,Stability AI Image Generation
,Path Deviation
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}