YOLO-World Model¶
Class: YoloWorldModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.yolo_world.v1.YoloWorldModelBlockV1
Run YOLO-World, a zero-shot object detection model, on an image.
YOLO-World accepts one or more text classes you want to identify in an image. The model returns the location of objects that meet the specified class, if YOLO-World is able to identify objects of that class.
We recommend experimenting with YOLO-World to evaluate the model on your use case before using this block in production. For example on how to effectively prompt YOLO-World, refer to the Roboflow YOLO-World prompting guide.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/yolo_world_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
class_names |
List[str] |
One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects.. | ✅ |
version |
str |
Variant of YoloWorld model. | ✅ |
confidence |
float |
Confidence threshold for detections. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to YOLO-World Model in version v1.
- inputs:
Dynamic Crop,Motion Detection,OCR Model,Email Notification,Image Blur,Background Subtraction,SIFT Comparison,OpenAI,Google Gemini,Google Vision OCR,Image Preprocessing,Instance Segmentation Model,Local File Sink,Single-Label Classification Model,Bounding Box Visualization,Anthropic Claude,Model Monitoring Inference Aggregator,Multi-Label Classification Model,Keypoint Detection Model,Email Notification,Slack Notification,Camera Focus,Twilio SMS/MMS Notification,Identify Outliers,Dot Visualization,Florence-2 Model,Roboflow Dataset Upload,CSV Formatter,Camera Focus,Stitch OCR Detections,Depth Estimation,Polygon Visualization,Perspective Correction,OpenAI,Camera Calibration,Corner Visualization,Icon Visualization,Image Slicer,Detections List Roll-Up,Line Counter Visualization,Heatmap Visualization,Morphological Transformation,Stability AI Image Generation,Google Gemini,Keypoint Visualization,VLM As Detector,Halo Visualization,Background Color Visualization,Label Visualization,Polygon Visualization,Pixelate Visualization,LMM,CogVLM,Contrast Equalization,Triangle Visualization,Stability AI Outpainting,Dimension Collapse,Mask Visualization,VLM As Classifier,Color Visualization,Text Display,Relative Static Crop,Reference Path Visualization,Stitch OCR Detections,Llama 3.2 Vision,OpenAI,Clip Comparison,Image Threshold,Clip Comparison,Classification Label Visualization,Webhook Sink,Circle Visualization,Polygon Zone Visualization,Image Contours,Image Convert Grayscale,Grid Visualization,Florence-2 Model,Buffer,Roboflow Custom Metadata,Dynamic Zone,LMM For Classification,SIFT,Halo Visualization,Object Detection Model,Detections Consensus,Anthropic Claude,Google Gemini,Model Comparison Visualization,Blur Visualization,QR Code Generator,EasyOCR,Absolute Static Crop,Image Slicer,S3 Sink,Anthropic Claude,Stability AI Inpainting,Ellipse Visualization,Identify Changes,Crop Visualization,Qwen3.5-VL,Trace Visualization,Twilio SMS Notification,Stitch Images,Size Measurement,OpenAI,Roboflow Dataset Upload - outputs:
Dynamic Crop,Time in Zone,Circle Visualization,Time in Zone,Mask Area Measurement,Byte Tracker,Model Monitoring Inference Aggregator,Bounding Box Visualization,Florence-2 Model,Detections Stitch,Roboflow Custom Metadata,Dot Visualization,Florence-2 Model,Roboflow Dataset Upload,Camera Focus,Stitch OCR Detections,Velocity,Perspective Correction,PTZ Tracking (ONVIF),Line Counter,Icon Visualization,Corner Visualization,Overlap Filter,Byte Tracker,Detections Consensus,Detections List Roll-Up,Heatmap Visualization,Distance Measurement,Byte Tracker,Model Comparison Visualization,Blur Visualization,Detection Offset,Background Color Visualization,Path Deviation,Label Visualization,Detection Event Log,Pixelate Visualization,Time in Zone,Triangle Visualization,Detections Transformation,Ellipse Visualization,Path Deviation,Crop Visualization,Trace Visualization,Color Visualization,Segment Anything 2 Model,Detections Combine,Size Measurement,Detections Stabilizer,Detections Filter,Detections Classes Replacement,Stitch OCR Detections,Detections Merge,Roboflow Dataset Upload,Line Counter
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
YOLO-World Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..class_names(list_of_values): One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects..version(string): Variant of YoloWorld model.confidence(float_zero_to_one): Confidence threshold for detections.
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step YOLO-World Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/yolo_world_model@v1",
"images": "$inputs.image",
"class_names": [
"person",
"car",
"license plate"
],
"version": "v2-s",
"confidence": 0.005
}