CLIP Embedding Model¶
Class: ClipModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.clip.v1.ClipModelBlockV1
Use a CLIP model to create semantic embeddings of text and images.
This block accepts an image or string and returns an embedding. The embedding can be used to compare the similarity between different images or between images and text.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/clip@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Unique name of step in workflows. | ❌ |
data |
str |
The string or image to generate an embedding for.. | ✅ |
version |
str |
Variant of CLIP model. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to CLIP Embedding Model in version v1.
- inputs:
VLM as Detector,Google Vision OCR,Classification Label Visualization,Circle Visualization,Image Contours,Relative Static Crop,Image Preprocessing,LMM For Classification,VLM as Classifier,Ellipse Visualization,Stitch Images,Triangle Visualization,Stability AI Inpainting,QR Code Generator,Image Slicer,Background Color Visualization,Model Monitoring Inference Aggregator,OCR Model,Dot Visualization,Florence-2 Model,SIFT,Morphological Transformation,EasyOCR,Reference Path Visualization,Halo Visualization,SIFT Comparison,Polygon Visualization,Florence-2 Model,Image Slicer,Slack Notification,Clip Comparison,Image Convert Grayscale,Instance Segmentation Model,OpenAI,Color Visualization,Keypoint Detection Model,Google Gemini,Label Visualization,Email Notification,Llama 3.2 Vision,Trace Visualization,Email Notification,Corner Visualization,Mask Visualization,CogVLM,OpenAI,Roboflow Custom Metadata,Stability AI Outpainting,Stitch OCR Detections,Blur Visualization,CSV Formatter,Crop Visualization,OpenAI,Grid Visualization,Twilio SMS Notification,Perspective Correction,Absolute Static Crop,Single-Label Classification Model,Roboflow Dataset Upload,Contrast Equalization,Roboflow Dataset Upload,Polygon Zone Visualization,Stability AI Image Generation,Webhook Sink,Depth Estimation,Bounding Box Visualization,Camera Focus,Line Counter Visualization,Multi-Label Classification Model,Icon Visualization,Image Blur,Pixelate Visualization,Image Threshold,Anthropic Claude,LMM,Google Gemini,Dynamic Crop,Model Comparison Visualization,Camera Calibration,Local File Sink,Keypoint Visualization,Object Detection Model - outputs:
Cosine Similarity,Identify Outliers,Identify Changes
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
CLIP Embedding Model in version v1 has.
Bindings
Example JSON definition of step CLIP Embedding Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/clip@v1",
"data": "$inputs.image",
"version": "ViT-B-16"
}