Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of Perception Encoder model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
Email Notification,Contrast Equalization,Instance Segmentation Model,Google Vision OCR,Grid Visualization,S3 Sink,Stability AI Image Generation,Model Comparison Visualization,Absolute Static Crop,Keypoint Visualization,SIFT,Trace Visualization,Roboflow Dataset Upload,Twilio SMS/MMS Notification,QR Code Generator,Model Monitoring Inference Aggregator,GLM-OCR,Reference Path Visualization,Halo Visualization,OCR Model,SIFT Comparison,VLM As Classifier,Image Preprocessing,Crop Visualization,OpenAI,OpenAI,Label Visualization,Classification Label Visualization,Pixelate Visualization,Local File Sink,Twilio SMS Notification,Email Notification,Qwen3.5-VL,Stitch OCR Detections,Corner Visualization,Stitch Images,Background Subtraction,Stitch OCR Detections,LMM For Classification,EasyOCR,Morphological Transformation,CSV Formatter,OpenAI,Clip Comparison,Image Threshold,Background Color Visualization,Anthropic Claude,Google Gemini,Camera Calibration,Halo Visualization,Stability AI Outpainting,Roboflow Custom Metadata,CogVLM,OpenAI,Single-Label Classification Model,Ellipse Visualization,Heatmap Visualization,Image Convert Grayscale,Triangle Visualization,Image Blur,Depth Estimation,Color Visualization,Camera Focus,Text Display,Anthropic Claude,Dot Visualization,Image Slicer,Keypoint Detection Model,Polygon Visualization,Florence-2 Model,Circle Visualization,Blur Visualization,Multi-Label Classification Model,Google Gemini,LMM,Slack Notification,Icon Visualization,Camera Focus,Stability AI Inpainting,Polygon Visualization,Webhook Sink,Polygon Zone Visualization,Perspective Correction,Florence-2 Model,Anthropic Claude,Mask Visualization,Google Gemini,Image Contours,Dynamic Crop,Roboflow Dataset Upload,Llama 3.2 Vision,VLM As Detector,Object Detection Model,Image Slicer,Line Counter Visualization,Relative Static Crop,Bounding Box Visualization - outputs:
Identify Changes,Cosine Similarity,Identify Outliers
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}