Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Keypoint Detection Model
,Stability AI Inpainting
,Google Vision OCR
,Florence-2 Model
,Model Monitoring Inference Aggregator
,Label Visualization
,Depth Estimation
,Corner Visualization
,Triangle Visualization
,Florence-2 Model
,Background Color Visualization
,Image Blur
,Polygon Zone Visualization
,CogVLM
,OCR Model
,Model Comparison Visualization
,Line Counter Visualization
,Camera Focus
,Circle Visualization
,Perspective Correction
,Relative Static Crop
,Grid Visualization
,Stability AI Image Generation
,VLM as Detector
,Trace Visualization
,Image Slicer
,CSV Formatter
,OpenAI
,Clip Comparison
,Blur Visualization
,Multi-Label Classification Model
,Classification Label Visualization
,Image Convert Grayscale
,Image Preprocessing
,Slack Notification
,SIFT Comparison
,OpenAI
,Stitch Images
,Email Notification
,Reference Path Visualization
,Stability AI Outpainting
,Llama 3.2 Vision
,Anthropic Claude
,Polygon Visualization
,Webhook Sink
,Roboflow Dataset Upload
,Camera Calibration
,Mask Visualization
,SIFT
,Bounding Box Visualization
,Image Threshold
,Roboflow Custom Metadata
,Single-Label Classification Model
,Keypoint Visualization
,Ellipse Visualization
,Crop Visualization
,Color Visualization
,Pixelate Visualization
,Image Slicer
,VLM as Classifier
,Dynamic Crop
,Image Contours
,Local File Sink
,Absolute Static Crop
,Twilio SMS Notification
,Stitch OCR Detections
,Google Gemini
,Object Detection Model
,Halo Visualization
,OpenAI
,Dot Visualization
,LMM For Classification
,Instance Segmentation Model
,Roboflow Dataset Upload
,LMM
- outputs:
Florence-2 Model
,Model Monitoring Inference Aggregator
,Label Visualization
,Florence-2 Model
,CogVLM
,Triangle Visualization
,Image Blur
,Model Comparison Visualization
,Cache Set
,Detections Transformation
,Line Counter Visualization
,Circle Visualization
,Detections Stitch
,Trace Visualization
,Byte Tracker
,Path Deviation
,Detections Consensus
,Velocity
,Detection Offset
,Reference Path Visualization
,Llama 3.2 Vision
,Detections Filter
,Polygon Visualization
,Roboflow Dataset Upload
,Time in Zone
,Segment Anything 2 Model
,Detections Merge
,Roboflow Custom Metadata
,Image Threshold
,Path Deviation
,CLIP Embedding Model
,Keypoint Visualization
,Local File Sink
,Ellipse Visualization
,Crop Visualization
,Color Visualization
,Google Gemini
,Dynamic Crop
,OpenAI
,Instance Segmentation Model
,Dot Visualization
,Instance Segmentation Model
,Roboflow Dataset Upload
,Stability AI Inpainting
,Line Counter
,Google Vision OCR
,Corner Visualization
,Overlap Filter
,Background Color Visualization
,Polygon Zone Visualization
,Byte Tracker
,Stability AI Image Generation
,Perspective Correction
,Cache Get
,Line Counter
,OpenAI
,Clip Comparison
,Blur Visualization
,Classification Label Visualization
,Time in Zone
,Image Preprocessing
,Slack Notification
,SIFT Comparison
,Byte Tracker
,OpenAI
,Pixel Color Count
,Detections Stabilizer
,YOLO-World Model
,Stability AI Outpainting
,Perception Encoder Embedding Model
,Anthropic Claude
,Size Measurement
,Webhook Sink
,Mask Visualization
,Bounding Box Visualization
,Distance Measurement
,Pixelate Visualization
,Twilio SMS Notification
,Email Notification
,PTZ Tracking (ONVIF)
.md),Stitch OCR Detections
,Detections Classes Replacement
,Halo Visualization
,LMM For Classification
,LMM
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}