Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | โ |
data |
str |
The string or image to generate an embedding for.. | โ |
version |
str |
Variant of Perception Encoder model. | โ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
Roboflow Dataset Upload,Line Counter Visualization,OCR Model,Image Slicer,Instance Segmentation Model,Color Visualization,Ellipse Visualization,Polygon Visualization,Relative Static Crop,Webhook Sink,Trace Visualization,Stitch OCR Detections,Camera Focus,Qwen 3.5 API,OpenAI,Image Threshold,Heatmap Visualization,Florence-2 Model,Halo Visualization,GLM-OCR,Dot Visualization,S3 Sink,Twilio SMS Notification,Model Monitoring Inference Aggregator,Google Gemini,Roboflow Dataset Upload,Pixelate Visualization,Twilio SMS/MMS Notification,Polygon Zone Visualization,Blur Visualization,Background Subtraction,Text Display,CSV Formatter,Stability AI Image Generation,Perspective Correction,Anthropic Claude,Bounding Box Visualization,Depth Estimation,Stability AI Inpainting,Polygon Visualization,SIFT,Roboflow Vision Events,Google Gemini,Label Visualization,Grid Visualization,Qwen3.5-VL,Contrast Equalization,Triangle Visualization,Halo Visualization,Circle Visualization,Mask Visualization,OpenAI,MoonshotAI Kimi,Llama 3.2 Vision,Email Notification,Slack Notification,Object Detection Model,Stability AI Outpainting,Email Notification,Google Gemma API,Google Vision OCR,Image Preprocessing,Google Gemini,EasyOCR,OpenAI,Anthropic Claude,Model Comparison Visualization,Roboflow Custom Metadata,Single-Label Classification Model,VLM As Classifier,Stitch Images,Qwen 3.6 API,SIFT Comparison,Morphological Transformation,CogVLM,Crop Visualization,Camera Calibration,Florence-2 Model,Icon Visualization,Local File Sink,Image Contours,Reference Path Visualization,Anthropic Claude,Clip Comparison,VLM As Detector,LMM,Classification Label Visualization,Image Slicer,Absolute Static Crop,Image Blur,Multi-Label Classification Model,Image Convert Grayscale,OpenAI,Corner Visualization,Dynamic Crop,Keypoint Visualization,QR Code Generator,Camera Focus,LMM For Classification,Morphological Transformation,Keypoint Detection Model,Contrast Enhancement,Background Color Visualization,Stitch OCR Detections - outputs:
Cosine Similarity,Identify Outliers,Identify Changes
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}