Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
OpenAI
,LMM
,Roboflow Custom Metadata
,Image Convert Grayscale
,Florence-2 Model
,Absolute Static Crop
,Multi-Label Classification Model
,Twilio SMS Notification
,Relative Static Crop
,Label Visualization
,Line Counter Visualization
,Background Color Visualization
,Stitch Images
,OCR Model
,Camera Focus
,Image Contours
,Image Preprocessing
,Image Slicer
,Reference Path Visualization
,Keypoint Detection Model
,Instance Segmentation Model
,SIFT Comparison
,Object Detection Model
,Triangle Visualization
,Grid Visualization
,Polygon Zone Visualization
,Keypoint Visualization
,LMM For Classification
,Depth Estimation
,Stitch OCR Detections
,Google Vision OCR
,Roboflow Dataset Upload
,Llama 3.2 Vision
,Bounding Box Visualization
,Image Blur
,Perspective Correction
,Halo Visualization
,Clip Comparison
,Ellipse Visualization
,Color Visualization
,Crop Visualization
,OpenAI
,Webhook Sink
,Google Gemini
,Dot Visualization
,Pixelate Visualization
,Model Comparison Visualization
,Email Notification
,Classification Label Visualization
,Camera Calibration
,Slack Notification
,VLM as Detector
,Stability AI Image Generation
,Roboflow Dataset Upload
,Polygon Visualization
,Trace Visualization
,Corner Visualization
,Image Threshold
,Single-Label Classification Model
,Blur Visualization
,VLM as Classifier
,CSV Formatter
,Image Slicer
,Local File Sink
,Model Monitoring Inference Aggregator
,CogVLM
,Stability AI Inpainting
,Mask Visualization
,SIFT
,Anthropic Claude
,Florence-2 Model
,Circle Visualization
,Dynamic Crop
- outputs:
Size Measurement
,LMM
,Roboflow Custom Metadata
,Detection Offset
,Distance Measurement
,Line Counter Visualization
,Detections Classes Replacement
,Background Color Visualization
,Reference Path Visualization
,Instance Segmentation Model
,SIFT Comparison
,Triangle Visualization
,Path Deviation
,Line Counter
,Detections Stabilizer
,Detections Transformation
,Byte Tracker
,Llama 3.2 Vision
,Roboflow Dataset Upload
,Google Vision OCR
,Clip Comparison
,Perspective Correction
,Crop Visualization
,Webhook Sink
,Dot Visualization
,Detections Filter
,Email Notification
,Model Comparison Visualization
,Classification Label Visualization
,Instance Segmentation Model
,Slack Notification
,Stability AI Image Generation
,Cache Set
,Detections Merge
,Time in Zone
,Line Counter
,Time in Zone
,Corner Visualization
,Trace Visualization
,Image Threshold
,Local File Sink
,Blur Visualization
,CogVLM
,Stability AI Inpainting
,Circle Visualization
,Overlap Filter
,OpenAI
,Path Deviation
,Florence-2 Model
,Twilio SMS Notification
,Label Visualization
,Image Preprocessing
,Detections Stitch
,Byte Tracker
,Polygon Zone Visualization
,Keypoint Visualization
,LMM For Classification
,Stitch OCR Detections
,Bounding Box Visualization
,Image Blur
,OpenAI
,CLIP Embedding Model
,Google Gemini
,Halo Visualization
,Ellipse Visualization
,Color Visualization
,Pixelate Visualization
,Pixel Color Count
,YOLO-World Model
,Velocity
,Roboflow Dataset Upload
,Polygon Visualization
,Segment Anything 2 Model
,Cache Get
,Model Monitoring Inference Aggregator
,Mask Visualization
,Anthropic Claude
,Florence-2 Model
,Byte Tracker
,Dynamic Crop
,Detections Consensus
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}