YOLO-World Model¶
Class: YoloWorldModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.yolo_world.v1.YoloWorldModelBlockV1
Run YOLO-World, a zero-shot object detection model, on an image.
YOLO-World accepts one or more text classes you want to identify in an image. The model returns the location of objects that meet the specified class, if YOLO-World is able to identify objects of that class.
We recommend experimenting with YOLO-World to evaluate the model on your use case before using this block in production. For example on how to effectively prompt YOLO-World, refer to the Roboflow YOLO-World prompting guide.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/yolo_world_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
class_names |
List[str] |
One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects.. | ✅ |
version |
str |
Variant of YoloWorld model. | ✅ |
confidence |
float |
Confidence threshold for detections. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to YOLO-World Model in version v1.
- inputs:
Heatmap Visualization,Pixelate Visualization,Roboflow Custom Metadata,Halo Visualization,Anthropic Claude,OpenAI,Halo Visualization,Contrast Equalization,Trace Visualization,Grid Visualization,Ellipse Visualization,Model Monitoring Inference Aggregator,Local File Sink,Webhook Sink,Dynamic Crop,Image Convert Grayscale,Corner Visualization,Polygon Zone Visualization,Circle Visualization,Model Comparison Visualization,Florence-2 Model,Image Slicer,Stability AI Outpainting,Dynamic Zone,Keypoint Visualization,Text Display,Google Vision OCR,EasyOCR,OCR Model,Anthropic Claude,Size Measurement,Clip Comparison,Twilio SMS Notification,Stability AI Inpainting,Roboflow Dataset Upload,SIFT Comparison,Reference Path Visualization,Image Blur,Stitch OCR Detections,VLM As Classifier,QR Code Generator,CogVLM,LMM For Classification,Image Slicer,Icon Visualization,Background Color Visualization,Absolute Static Crop,Google Gemini,Label Visualization,Image Preprocessing,Classification Label Visualization,SIFT,OpenAI,Mask Visualization,Stability AI Image Generation,Single-Label Classification Model,Detections Consensus,Bounding Box Visualization,OpenAI,Slack Notification,Roboflow Dataset Upload,Triangle Visualization,Dimension Collapse,Detections List Roll-Up,Dot Visualization,Keypoint Detection Model,Email Notification,Twilio SMS/MMS Notification,Image Contours,Anthropic Claude,Instance Segmentation Model,Stitch OCR Detections,Email Notification,LMM,Florence-2 Model,Background Subtraction,Camera Calibration,Image Threshold,VLM As Detector,Clip Comparison,Relative Static Crop,Identify Changes,Crop Visualization,Stitch Images,Blur Visualization,Perspective Correction,Motion Detection,CSV Formatter,Camera Focus,Camera Focus,Line Counter Visualization,Color Visualization,Morphological Transformation,Llama 3.2 Vision,Google Gemini,Buffer,Multi-Label Classification Model,Identify Outliers,OpenAI,Qwen3.5-VL,Google Gemini,Depth Estimation,Polygon Visualization,Object Detection Model,Polygon Visualization - outputs:
Path Deviation,Roboflow Custom Metadata,Heatmap Visualization,Line Counter,Detections Stitch,Pixelate Visualization,Time in Zone,Byte Tracker,Bounding Box Visualization,Detections Consensus,Distance Measurement,PTZ Tracking (ONVIF),Mask Area Measurement,Detections Filter,Detection Offset,Triangle Visualization,Trace Visualization,Detections List Roll-Up,Dot Visualization,Ellipse Visualization,Model Monitoring Inference Aggregator,Segment Anything 2 Model,Dynamic Crop,Detection Event Log,Byte Tracker,Corner Visualization,Stitch OCR Detections,Circle Visualization,Model Comparison Visualization,Florence-2 Model,Florence-2 Model,Overlap Filter,Detections Merge,Detections Combine,Byte Tracker,Crop Visualization,Size Measurement,Detections Stabilizer,Detections Classes Replacement,Velocity,Roboflow Dataset Upload,Blur Visualization,Perspective Correction,Path Deviation,Stitch OCR Detections,Camera Focus,Detections Transformation,Color Visualization,Line Counter,Time in Zone,Icon Visualization,Background Color Visualization,Label Visualization,Time in Zone,Roboflow Dataset Upload
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
YOLO-World Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..class_names(list_of_values): One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects..version(string): Variant of YoloWorld model.confidence(float_zero_to_one): Confidence threshold for detections.
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step YOLO-World Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/yolo_world_model@v1",
"images": "$inputs.image",
"class_names": [
"person",
"car",
"license plate"
],
"version": "v2-s",
"confidence": 0.005
}