Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of Perception Encoder model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
LMM,Background Color Visualization,Stitch Images,Image Slicer,Model Monitoring Inference Aggregator,Corner Visualization,Camera Calibration,Mask Visualization,Object Detection Model,Local File Sink,Model Comparison Visualization,Email Notification,Pixelate Visualization,Anthropic Claude,Relative Static Crop,Google Gemini,Florence-2 Model,Multi-Label Classification Model,Ellipse Visualization,Triangle Visualization,Camera Focus,OCR Model,QR Code Generator,Label Visualization,Roboflow Custom Metadata,Florence-2 Model,LMM For Classification,Blur Visualization,Dot Visualization,Stability AI Image Generation,Perspective Correction,Google Vision OCR,Llama 3.2 Vision,EasyOCR,Absolute Static Crop,Slack Notification,Morphological Transformation,Image Blur,Image Threshold,Clip Comparison,Stitch OCR Detections,Depth Estimation,Stability AI Outpainting,Halo Visualization,Stability AI Inpainting,Polygon Visualization,OpenAI,Grid Visualization,Roboflow Dataset Upload,CogVLM,Classification Label Visualization,Email Notification,VLM as Detector,Instance Segmentation Model,Bounding Box Visualization,Image Convert Grayscale,Polygon Zone Visualization,OpenAI,Keypoint Detection Model,Crop Visualization,Image Slicer,Icon Visualization,Color Visualization,Roboflow Dataset Upload,Keypoint Visualization,Contrast Equalization,Image Contours,Circle Visualization,OpenAI,VLM as Classifier,CSV Formatter,Reference Path Visualization,Twilio SMS Notification,Dynamic Crop,Webhook Sink,Single-Label Classification Model,SIFT,Line Counter Visualization,Image Preprocessing,Trace Visualization,SIFT Comparison - outputs:
Identify Changes,Identify Outliers,Cosine Similarity
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}