Google Vision OCR¶
Class: GoogleVisionOCRBlockV1
Source: inference.core.workflows.core_steps.models.foundation.google_vision_ocr.v1.GoogleVisionOCRBlockV1
Detect text in images using Google Vision OCR.
Supported types of text detection:
text_detection
: optimized for areas of text within a larger image.ocr_text_detection
: optimized for dense text documents.
You need to provide your Google Vision API key to use this block.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/google_vision_ocr@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
ocr_type |
str |
Type of OCR to use. | ❌ |
language_hints |
List[str] |
Optional list of language codes to pass to the OCR API. If not provided, the API will attempt to detect the language automatically.If provided, language codes must be supported by the OCR API, visit https://cloud.google.com/vision/docs/languages for list of supported language codes.. | ❌ |
api_key |
str |
Your Google Vision API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Google Vision OCR
in version v1
.
- inputs:
Reference Path Visualization
,Blur Visualization
,Pixelate Visualization
,Anthropic Claude
,Multi-Label Classification Model
,Single-Label Classification Model
,LMM
,Email Notification
,Classification Label Visualization
,Llama 3.2 Vision
,CSV Formatter
,Background Color Visualization
,VLM as Detector
,Dynamic Crop
,Keypoint Visualization
,Camera Focus
,Mask Visualization
,Webhook Sink
,Clip Comparison
,Image Slicer
,Google Vision OCR
,Twilio SMS Notification
,Absolute Static Crop
,Model Monitoring Inference Aggregator
,Stability AI Image Generation
,Florence-2 Model
,Image Blur
,LMM For Classification
,Florence-2 Model
,Roboflow Dataset Upload
,CogVLM
,Circle Visualization
,OCR Model
,Grid Visualization
,Crop Visualization
,Image Convert Grayscale
,Image Threshold
,Trace Visualization
,Polygon Visualization
,Triangle Visualization
,Stability AI Inpainting
,OpenAI
,Stitch OCR Detections
,Halo Visualization
,Dot Visualization
,Polygon Zone Visualization
,Google Gemini
,Local File Sink
,Instance Segmentation Model
,Roboflow Custom Metadata
,OpenAI
,VLM as Classifier
,Camera Calibration
,Slack Notification
,Object Detection Model
,SIFT
,Corner Visualization
,Image Contours
,Model Comparison Visualization
,Stitch Images
,Bounding Box Visualization
,Line Counter Visualization
,Image Slicer
,Roboflow Dataset Upload
,Keypoint Detection Model
,Perspective Correction
,Image Preprocessing
,SIFT Comparison
,Label Visualization
,Relative Static Crop
,Color Visualization
,Ellipse Visualization
- outputs:
Classification Label Visualization
,Webhook Sink
,Background Color Visualization
,Dynamic Crop
,Cache Get
,Mask Visualization
,Clip Comparison
,Twilio SMS Notification
,Google Vision OCR
,Segment Anything 2 Model
,Detection Offset
,Model Monitoring Inference Aggregator
,Stability AI Image Generation
,LMM For Classification
,Florence-2 Model
,Image Blur
,Cache Set
,Roboflow Dataset Upload
,CogVLM
,Circle Visualization
,Crop Visualization
,Path Deviation
,OpenAI
,Stitch OCR Detections
,Velocity
,Detections Stitch
,OpenAI
,Pixel Color Count
,Label Visualization
,Detections Stabilizer
,Path Deviation
,Line Counter
,Time in Zone
,Model Comparison Visualization
,Bounding Box Visualization
,Perspective Correction
,SIFT Comparison
,Slack Notification
,Color Visualization
,Ellipse Visualization
,Reference Path Visualization
,Anthropic Claude
,Blur Visualization
,Pixelate Visualization
,Email Notification
,LMM
,Llama 3.2 Vision
,Instance Segmentation Model
,Keypoint Visualization
,Time in Zone
,Byte Tracker
,Florence-2 Model
,Detections Filter
,Detections Transformation
,YOLO-World Model
,Trace Visualization
,Image Threshold
,Triangle Visualization
,Polygon Visualization
,Stability AI Inpainting
,Detections Consensus
,Halo Visualization
,Dot Visualization
,Google Gemini
,Polygon Zone Visualization
,Detections Merge
,CLIP Embedding Model
,Local File Sink
,Size Measurement
,Instance Segmentation Model
,Roboflow Custom Metadata
,Detections Classes Replacement
,Corner Visualization
,Roboflow Dataset Upload
,Line Counter Visualization
,Byte Tracker
,Image Preprocessing
,Byte Tracker
,Line Counter
,Distance Measurement
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Google Vision OCR
in version v1
has.
Bindings
Example JSON definition of step Google Vision OCR
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/google_vision_ocr@v1",
"image": "$inputs.image",
"ocr_type": "<block_does_not_provide_example>",
"language_hints": [
"en",
"fr"
],
"api_key": "xxx-xxx"
}