Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/perception_encoder@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of Perception Encoder model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model
in version v1
.
- inputs:
Keypoint Visualization
,Google Gemini
,Keypoint Detection Model
,Image Contours
,Circle Visualization
,Image Threshold
,Absolute Static Crop
,Perspective Correction
,Color Visualization
,Instance Segmentation Model
,Reference Path Visualization
,Stitch Images
,Image Blur
,Florence-2 Model
,Blur Visualization
,Local File Sink
,Relative Static Crop
,Halo Visualization
,Clip Comparison
,Stability AI Inpainting
,SIFT Comparison
,Icon Visualization
,Roboflow Custom Metadata
,Polygon Zone Visualization
,Depth Estimation
,Stability AI Image Generation
,Dynamic Crop
,Grid Visualization
,Crop Visualization
,Stitch OCR Detections
,Camera Calibration
,VLM as Classifier
,QR Code Generator
,SIFT
,Camera Focus
,Model Comparison Visualization
,Twilio SMS Notification
,Llama 3.2 Vision
,Triangle Visualization
,Line Counter Visualization
,Email Notification
,LMM
,Roboflow Dataset Upload
,CSV Formatter
,Image Slicer
,Mask Visualization
,Single-Label Classification Model
,OCR Model
,Pixelate Visualization
,Webhook Sink
,Object Detection Model
,Slack Notification
,Dot Visualization
,Image Slicer
,Roboflow Dataset Upload
,Classification Label Visualization
,OpenAI
,Model Monitoring Inference Aggregator
,Polygon Visualization
,OpenAI
,LMM For Classification
,Stability AI Outpainting
,Trace Visualization
,Bounding Box Visualization
,Image Preprocessing
,Multi-Label Classification Model
,Image Convert Grayscale
,Google Vision OCR
,Label Visualization
,CogVLM
,Corner Visualization
,Background Color Visualization
,Florence-2 Model
,VLM as Detector
,Ellipse Visualization
,OpenAI
,Anthropic Claude
- outputs:
Identify Outliers
,Identify Changes
,Cosine Similarity
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model
in version v1
has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}