Perception Encoder Embedding Model¶
Class: PerceptionEncoderModelBlockV1
Use the Meta Perception Encoder model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/perception_encoder@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of Perception Encoder model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Perception Encoder Embedding Model in version v1.
- inputs:
Llama 3.2 Vision,Contrast Equalization,Clip Comparison,Anthropic Claude,VLM as Detector,Local File Sink,Polygon Visualization,QR Code Generator,Image Blur,SIFT Comparison,Email Notification,Roboflow Dataset Upload,Text Display,Model Comparison Visualization,Camera Focus,SIFT,LMM,Google Vision OCR,Mask Visualization,Anthropic Claude,Relative Static Crop,Circle Visualization,EasyOCR,Pixelate Visualization,Stability AI Inpainting,Reference Path Visualization,Instance Segmentation Model,Perspective Correction,Ellipse Visualization,Crop Visualization,Halo Visualization,Image Threshold,Keypoint Detection Model,CSV Formatter,Florence-2 Model,Twilio SMS Notification,Image Convert Grayscale,Corner Visualization,Image Preprocessing,Icon Visualization,Background Subtraction,Image Contours,Image Slicer,Depth Estimation,Multi-Label Classification Model,Stitch Images,Dynamic Crop,VLM as Classifier,Model Monitoring Inference Aggregator,Bounding Box Visualization,Line Counter Visualization,Blur Visualization,Morphological Transformation,Single-Label Classification Model,Camera Calibration,Polygon Zone Visualization,Email Notification,OCR Model,Keypoint Visualization,Roboflow Custom Metadata,Google Gemini,OpenAI,OpenAI,CogVLM,Camera Focus,Trace Visualization,Color Visualization,Absolute Static Crop,Image Slicer,Dot Visualization,Label Visualization,Slack Notification,Florence-2 Model,Google Gemini,Google Gemini,Grid Visualization,Object Detection Model,LMM For Classification,OpenAI,Stitch OCR Detections,OpenAI,Stitch OCR Detections,Classification Label Visualization,Roboflow Dataset Upload,Background Color Visualization,Stability AI Outpainting,Twilio SMS/MMS Notification,Anthropic Claude,Triangle Visualization,Stability AI Image Generation,Webhook Sink - outputs:
Cosine Similarity,Identify Changes,Identify Outliers
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Perception Encoder Embedding Model in version v1 has.
Bindings
Example JSON definition of step Perception Encoder Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/perception_encoder@v1",
"data": "$inputs.image",
"version": "PE-Core-B16-224"
}