Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Slack Notification
,Image Slicer
,Stitch OCR Detections
,Stability AI Inpainting
,Pixelate Visualization
,Perspective Correction
,Clip Comparison
,Object Detection Model
,OpenAI
,Relative Static Crop
,Roboflow Custom Metadata
,Twilio SMS Notification
,VLM as Classifier
,SIFT Comparison
,Roboflow Dataset Upload
,Google Gemini
,Grid Visualization
,Ellipse Visualization
,SIFT
,Model Comparison Visualization
,CogVLM
,Halo Visualization
,Image Contours
,VLM as Detector
,Crop Visualization
,OpenAI
,Absolute Static Crop
,Camera Focus
,Image Blur
,Trace Visualization
,Circle Visualization
,Keypoint Detection Model
,Image Preprocessing
,Multi-Label Classification Model
,Background Color Visualization
,Dot Visualization
,Google Vision OCR
,Polygon Zone Visualization
,Roboflow Dataset Upload
,Florence-2 Model
,Classification Label Visualization
,Bounding Box Visualization
,Corner Visualization
,Llama 3.2 Vision
,Image Slicer
,Florence-2 Model
,Local File Sink
,Dynamic Crop
,Reference Path Visualization
,Label Visualization
,LMM For Classification
,Mask Visualization
,Stitch Images
,Triangle Visualization
,Stability AI Image Generation
,Image Threshold
,Line Counter Visualization
,OCR Model
,LMM
,Keypoint Visualization
,Model Monitoring Inference Aggregator
,Color Visualization
,Email Notification
,Single-Label Classification Model
,Blur Visualization
,Anthropic Claude
,Webhook Sink
,CSV Formatter
,Image Convert Grayscale
,Instance Segmentation Model
,Polygon Visualization
- outputs:
Segment Anything 2 Model
,Cache Get
,Stability AI Inpainting
,Clip Comparison
,Perspective Correction
,Cache Set
,Roboflow Custom Metadata
,SIFT Comparison
,Detection Offset
,CogVLM
,Ellipse Visualization
,OpenAI
,Polygon Visualization
,Trace Visualization
,CLIP Embedding Model
,Dot Visualization
,Google Vision OCR
,Polygon Zone Visualization
,Roboflow Dataset Upload
,Llama 3.2 Vision
,Classification Label Visualization
,Corner Visualization
,Byte Tracker
,Line Counter
,Dynamic Crop
,Reference Path Visualization
,Label Visualization
,Detections Stabilizer
,Mask Visualization
,Triangle Visualization
,Line Counter Visualization
,Detections Transformation
,Model Monitoring Inference Aggregator
,Time in Zone
,Blur Visualization
,Line Counter
,Anthropic Claude
,Instance Segmentation Model
,Webhook Sink
,Time in Zone
,Instance Segmentation Model
,Slack Notification
,Detections Filter
,Stitch OCR Detections
,Pixelate Visualization
,Path Deviation
,Detections Consensus
,Twilio SMS Notification
,Roboflow Dataset Upload
,Google Gemini
,Model Comparison Visualization
,Halo Visualization
,Crop Visualization
,Byte Tracker
,Image Blur
,Distance Measurement
,Circle Visualization
,Velocity
,Image Preprocessing
,Background Color Visualization
,Pixel Color Count
,Size Measurement
,Florence-2 Model
,Bounding Box Visualization
,Florence-2 Model
,Local File Sink
,Byte Tracker
,LMM For Classification
,Stability AI Image Generation
,Image Threshold
,Detections Stitch
,LMM
,Keypoint Visualization
,Email Notification
,Color Visualization
,Path Deviation
,YOLO-World Model
,Detections Classes Replacement
,OpenAI
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}