Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | โ |
data |
str |
The string or image to generate an embedding for.. | โ |
version |
str |
Variant of Perception Encoder model. | โ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
S3 Sink,Email Notification,Clip Comparison,Morphological Transformation,VLM As Detector,Qwen-VL,Twilio SMS/MMS Notification,MoonshotAI Kimi,Polygon Zone Visualization,Stitch OCR Detections,OpenAI-Compatible LLM,OpenAI,Heatmap Visualization,Keypoint Visualization,Email Notification,Llama 3.2 Vision,Anthropic Claude,Stability AI Image Generation,Google Vision OCR,Camera Focus,Label Visualization,Instance Segmentation Model,Local File Sink,Google Gemini,Background Color Visualization,Qwen 3.5 API,Google Gemini,Polygon Visualization,SIFT Comparison,Grid Visualization,Florence-2 Model,OCR Model,VLM As Classifier,LMM For Classification,Keypoint Detection Model,Image Preprocessing,SIFT,Roboflow Dataset Upload,Corner Visualization,Stability AI Outpainting,Halo Visualization,Multi-Label Classification Model,Qwen3.5-VL,Blur Visualization,Morphological Transformation,Trace Visualization,Stitch OCR Detections,Reference Path Visualization,Halo Visualization,Model Comparison Visualization,Dot Visualization,Background Subtraction,Text Display,Absolute Static Crop,CSV Formatter,Florence-2 Model,Icon Visualization,Perspective Correction,Stability AI Inpainting,Image Convert Grayscale,QR Code Generator,OpenRouter,Model Monitoring Inference Aggregator,OpenAI,Llama 3.2 Vision,Image Threshold,Anthropic Claude,Dynamic Crop,Contrast Enhancement,Bounding Box Visualization,Depth Estimation,Image Contours,EasyOCR,Relative Static Crop,Polygon Visualization,Google Gemma API,Qwen 3.6 API,Image Blur,Anthropic Claude,Triangle Visualization,Object Detection Model,Roboflow Custom Metadata,OpenAI,Slack Notification,Pixelate Visualization,Stitch Images,Single-Label Classification Model,OpenAI,Image Slicer,Line Counter Visualization,Image Slicer,LMM,Roboflow Dataset Upload,Color Visualization,Google Gemini,Classification Label Visualization,Camera Focus,Camera Calibration,Ellipse Visualization,Mask Visualization,GLM-OCR,Crop Visualization,Circle Visualization,CogVLM,Contrast Equalization,Roboflow Vision Events,Webhook Sink,Twilio SMS Notification,MoonshotAI Kimi,Google Gemma - outputs:
Cosine Similarity,Identify Outliers,Identify Changes
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}