Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/perception_encoder@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of Perception Encoder model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model
in version v1
.
- inputs:
Anthropic Claude
,Crop Visualization
,Stitch OCR Detections
,SIFT
,LMM For Classification
,Blur Visualization
,Line Counter Visualization
,Color Visualization
,Image Contours
,Camera Focus
,Mask Visualization
,Image Convert Grayscale
,Google Gemini
,Circle Visualization
,Absolute Static Crop
,VLM as Classifier
,Object Detection Model
,Stitch Images
,Trace Visualization
,Image Preprocessing
,Roboflow Custom Metadata
,OCR Model
,LMM
,Polygon Zone Visualization
,QR Code Generator
,Halo Visualization
,Florence-2 Model
,Perspective Correction
,Stability AI Inpainting
,Webhook Sink
,Label Visualization
,VLM as Detector
,Stability AI Image Generation
,Keypoint Detection Model
,Triangle Visualization
,Background Color Visualization
,Slack Notification
,Relative Static Crop
,Corner Visualization
,Multi-Label Classification Model
,Icon Visualization
,Pixelate Visualization
,Image Blur
,Model Comparison Visualization
,CSV Formatter
,Instance Segmentation Model
,Llama 3.2 Vision
,Image Threshold
,Google Vision OCR
,Reference Path Visualization
,Image Slicer
,Roboflow Dataset Upload
,CogVLM
,Depth Estimation
,Roboflow Dataset Upload
,Single-Label Classification Model
,OpenAI
,Classification Label Visualization
,Polygon Visualization
,Stability AI Outpainting
,Keypoint Visualization
,Dot Visualization
,Email Notification
,Grid Visualization
,Local File Sink
,OpenAI
,Bounding Box Visualization
,Camera Calibration
,Ellipse Visualization
,OpenAI
,Florence-2 Model
,Model Monitoring Inference Aggregator
,Twilio SMS Notification
,Image Slicer
,SIFT Comparison
,Clip Comparison
,Dynamic Crop
- outputs:
Cosine Similarity
,Identify Changes
,Identify Outliers
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model
in version v1
has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}