YOLO-World Model¶
Class: YoloWorldModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.yolo_world.v1.YoloWorldModelBlockV1
Run YOLO-World, a zero-shot object detection model, on an image.
YOLO-World accepts one or more text classes you want to identify in an image. The model returns the location of objects that meet the specified class, if YOLO-World is able to identify objects of that class.
We recommend experimenting with YOLO-World to evaluate the model on your use case before using this block in production. For example on how to effectively prompt YOLO-World, refer to the Roboflow YOLO-World prompting guide.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/yolo_world_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
class_names |
List[str] |
One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects.. | ✅ |
version |
str |
Variant of YoloWorld model. | ✅ |
confidence |
float |
Confidence threshold for detections. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to YOLO-World Model in version v1.
- inputs:
Stability AI Outpainting,OpenAI-Compatible LLM,Morphological Transformation,Motion Detection,Contrast Enhancement,Identify Outliers,Crop Visualization,Camera Focus,Blur Visualization,Image Preprocessing,Corner Visualization,Ellipse Visualization,Mask Visualization,Stability AI Image Generation,Qwen-VL,Google Gemma API,Heatmap Visualization,Object Detection Model,Stitch OCR Detections,Roboflow Vision Events,Image Slicer,Qwen 3.5 API,Trace Visualization,Background Color Visualization,Slack Notification,Anthropic Claude,Dimension Collapse,Qwen 3.6 API,OpenAI,Webhook Sink,Email Notification,Color Visualization,Bounding Box Visualization,Keypoint Visualization,Detections Consensus,Model Comparison Visualization,Google Gemma,Relative Static Crop,Llama 3.2 Vision,CogVLM,Polygon Zone Visualization,Qwen3.5-VL,Dynamic Crop,Camera Focus,Polygon Visualization,QR Code Generator,Google Vision OCR,GLM-OCR,Stitch Images,Google Gemini,Llama 3.2 Vision,OpenRouter,Twilio SMS Notification,Image Blur,Clip Comparison,Dynamic Zone,Model Monitoring Inference Aggregator,Anthropic Claude,Image Slicer,Depth Estimation,OpenAI,Detections List Roll-Up,Multi-Label Classification Model,Buffer,Google Gemini,Classification Label Visualization,Pixelate Visualization,EasyOCR,SIFT,Florence-2 Model,MoonshotAI Kimi,Contrast Equalization,Image Threshold,Instance Segmentation Model,MoonshotAI Kimi,Dot Visualization,Polygon Visualization,Background Subtraction,Roboflow Dataset Upload,Anthropic Claude,Halo Visualization,Stability AI Inpainting,Roboflow Custom Metadata,Florence-2 Model,Label Visualization,Local File Sink,Icon Visualization,Single-Label Classification Model,Image Contours,OpenAI,Absolute Static Crop,Google Gemini,Grid Visualization,VLM As Classifier,Camera Calibration,Halo Visualization,Email Notification,Size Measurement,OpenAI,Clip Comparison,LMM,LMM For Classification,Text Display,Image Convert Grayscale,Reference Path Visualization,Circle Visualization,Line Counter Visualization,Stitch OCR Detections,OCR Model,Keypoint Detection Model,SIFT Comparison,VLM As Detector,Image Stack,Identify Changes,Morphological Transformation,Twilio SMS/MMS Notification,Roboflow Dataset Upload,CSV Formatter,S3 Sink,Triangle Visualization,Perspective Correction - outputs:
BoT-SORT Tracker,Time in Zone,Detections List Roll-Up,Segment Anything 2 Model,Detections Stitch,Pixelate Visualization,PTZ Tracking (ONVIF),Florence-2 Model,Crop Visualization,Camera Focus,Blur Visualization,Time in Zone,Corner Visualization,Ellipse Visualization,Stitch OCR Detections,Roboflow Vision Events,Velocity,Line Counter,Heatmap Visualization,Dot Visualization,Roboflow Dataset Upload,Trace Visualization,Path Deviation,OC-SORT Tracker,Per-Class Confidence Filter,Background Color Visualization,Roboflow Custom Metadata,Line Counter,Detections Classes Replacement,Florence-2 Model,Time in Zone,Label Visualization,Icon Visualization,Detection Offset,Overlap Filter,Detections Merge,Color Visualization,Byte Tracker,Bounding Box Visualization,Detections Filter,Detections Consensus,Model Comparison Visualization,ByteTrack Tracker,SAM2 Video Tracker,Detection Event Log,Path Deviation,Byte Tracker,Size Measurement,Detections Transformation,Mask Area Measurement,Circle Visualization,Dynamic Crop,Byte Tracker,Stitch OCR Detections,Detections Combine,Distance Measurement,Overlap Analysis,SORT Tracker,Roboflow Dataset Upload,Detections Stabilizer,Model Monitoring Inference Aggregator,Triangle Visualization,Perspective Correction
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
YOLO-World Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..class_names(list_of_values): One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects..version(string): Variant of YoloWorld model.confidence(float_zero_to_one): Confidence threshold for detections.
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step YOLO-World Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/yolo_world_model@v1",
"images": "$inputs.image",
"class_names": [
"person",
"car",
"license plate"
],
"version": "v2-s",
"confidence": 0.005
}