Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | โ |
data |
str |
The string or image to generate an embedding for.. | โ |
version |
str |
Variant of Perception Encoder model. | โ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
Anthropic Claude,Instance Segmentation Model,Google Vision OCR,Circle Visualization,Image Slicer,Google Gemini,Image Contours,Qwen 3.6 API,Single-Label Classification Model,Roboflow Vision Events,Depth Estimation,Line Counter Visualization,Stitch Images,Morphological Transformation,LMM,Model Comparison Visualization,MoonshotAI Kimi,Grid Visualization,Twilio SMS/MMS Notification,OpenAI,Twilio SMS Notification,Qwen-VL,S3 Sink,Halo Visualization,Camera Focus,SIFT Comparison,Keypoint Detection Model,Local File Sink,Mask Visualization,SIFT,Anthropic Claude,MoonshotAI Kimi,Roboflow Dataset Upload,Text Display,Image Slicer,Multi-Label Classification Model,Absolute Static Crop,Llama 3.2 Vision,GLM-OCR,Object Detection Model,Roboflow Custom Metadata,Email Notification,Dynamic Crop,OpenRouter,Model Monitoring Inference Aggregator,Contrast Enhancement,Webhook Sink,Google Gemma API,Stability AI Image Generation,Color Visualization,Heatmap Visualization,Contrast Equalization,Google Gemini,Roboflow Dataset Upload,Slack Notification,Anthropic Claude,QR Code Generator,Clip Comparison,OpenAI,Bounding Box Visualization,Florence-2 Model,VLM As Classifier,Image Blur,Dot Visualization,Polygon Zone Visualization,Label Visualization,Icon Visualization,Image Threshold,LMM For Classification,Blur Visualization,Relative Static Crop,Trace Visualization,Qwen3.5-VL,Florence-2 Model,Camera Focus,CogVLM,Pixelate Visualization,Llama 3.2 Vision,Image Convert Grayscale,Keypoint Visualization,Polygon Visualization,Google Gemma,Classification Label Visualization,Morphological Transformation,Camera Calibration,Google Gemini,Email Notification,Image Preprocessing,Corner Visualization,Stitch OCR Detections,Halo Visualization,Roboflow Asset Library Attributes,Reference Path Visualization,OpenAI,Background Color Visualization,Ellipse Visualization,CSV Formatter,Stability AI Outpainting,VLM As Detector,EasyOCR,Triangle Visualization,OCR Model,Crop Visualization,Perspective Correction,Qwen 3.5 API,Stitch OCR Detections,OpenAI-Compatible LLM,Polygon Visualization,Background Subtraction,OpenAI,Stability AI Inpainting - outputs:
Identify Outliers,Cosine Similarity,Identify Changes
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}