YOLO-World Model¶
Class: YoloWorldModelBlockV1
Source: inference.core.workflows.core_steps.models.foundation.yolo_world.v1.YoloWorldModelBlockV1
Run YOLO-World, a zero-shot object detection model, on an image.
YOLO-World accepts one or more text classes you want to identify in an image. The model returns the location of objects that meet the specified class, if YOLO-World is able to identify objects of that class.
We recommend experimenting with YOLO-World to evaluate the model on your use case before using this block in production. For example on how to effectively prompt YOLO-World, refer to the Roboflow YOLO-World prompting guide.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/yolo_world_model@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
class_names |
List[str] |
One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects.. | ✅ |
version |
str |
Variant of YoloWorld model. | ✅ |
confidence |
float |
Confidence threshold for detections. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to YOLO-World Model in version v1.
- inputs:
Perspective Correction,S3 Sink,Stability AI Inpainting,Image Convert Grayscale,Clip Comparison,Morphological Transformation,Email Notification,Qwen-VL,VLM As Detector,QR Code Generator,Twilio SMS/MMS Notification,OpenRouter,Model Monitoring Inference Aggregator,OpenAI,Llama 3.2 Vision,MoonshotAI Kimi,Polygon Zone Visualization,Image Threshold,Stitch OCR Detections,Anthropic Claude,OpenAI-Compatible LLM,OpenAI,Dynamic Crop,Size Measurement,Detections Consensus,Heatmap Visualization,Keypoint Visualization,Email Notification,Llama 3.2 Vision,Anthropic Claude,Stability AI Image Generation,Clip Comparison,Google Vision OCR,Camera Focus,Label Visualization,Instance Segmentation Model,Identify Outliers,Contrast Enhancement,Bounding Box Visualization,Local File Sink,Depth Estimation,Google Gemini,Image Contours,EasyOCR,Relative Static Crop,Motion Detection,Polygon Visualization,Google Gemma API,Background Color Visualization,Qwen 3.6 API,Qwen 3.5 API,Image Blur,Polygon Visualization,Google Gemini,SIFT Comparison,Grid Visualization,Anthropic Claude,Florence-2 Model,Triangle Visualization,Object Detection Model,OCR Model,Roboflow Custom Metadata,OpenAI,Slack Notification,VLM As Classifier,Image Stack,Pixelate Visualization,Stitch Images,Single-Label Classification Model,OpenAI,Buffer,Image Slicer,LMM For Classification,Keypoint Detection Model,Image Preprocessing,SIFT,Line Counter Visualization,Roboflow Dataset Upload,Image Slicer,Dynamic Zone,Corner Visualization,Stability AI Outpainting,Halo Visualization,Multi-Label Classification Model,LMM,Roboflow Dataset Upload,Qwen3.5-VL,Color Visualization,Detections List Roll-Up,Blur Visualization,Google Gemini,Classification Label Visualization,Camera Focus,Camera Calibration,Morphological Transformation,Trace Visualization,Stitch OCR Detections,Reference Path Visualization,Halo Visualization,Ellipse Visualization,Model Comparison Visualization,Dot Visualization,Identify Changes,Mask Visualization,GLM-OCR,Crop Visualization,Background Subtraction,Circle Visualization,CogVLM,Text Display,Dimension Collapse,Absolute Static Crop,CSV Formatter,Florence-2 Model,Contrast Equalization,Roboflow Vision Events,Webhook Sink,Icon Visualization,Twilio SMS Notification,MoonshotAI Kimi,Google Gemma - outputs:
Perspective Correction,Roboflow Custom Metadata,BoT-SORT Tracker,Detections Filter,Path Deviation,Detections Merge,Pixelate Visualization,Line Counter,Detections Stabilizer,Model Monitoring Inference Aggregator,Roboflow Dataset Upload,Time in Zone,Line Counter,Icon Visualization,Stitch OCR Detections,OC-SORT Tracker,Detections Classes Replacement,Segment Anything 2 Model,Dynamic Crop,Corner Visualization,Size Measurement,Mask Area Measurement,Heatmap Visualization,Detections Consensus,Roboflow Dataset Upload,Detections Transformation,Time in Zone,Color Visualization,Blur Visualization,Detections List Roll-Up,Label Visualization,Camera Focus,Distance Measurement,Path Deviation,Trace Visualization,Per-Class Confidence Filter,Detections Stitch,Bounding Box Visualization,Overlap Filter,Stitch OCR Detections,Detection Offset,Byte Tracker,Ellipse Visualization,Model Comparison Visualization,Dot Visualization,PTZ Tracking (ONVIF),SORT Tracker,Crop Visualization,Circle Visualization,Detections Combine,Byte Tracker,Background Color Visualization,ByteTrack Tracker,SAM2 Video Tracker,Florence-2 Model,Roboflow Vision Events,Byte Tracker,Velocity,Detection Event Log,Florence-2 Model,Triangle Visualization,Time in Zone
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
YOLO-World Model in version v1 has.
Bindings
-
input
images(image): The image to infer on..class_names(list_of_values): One or more classes that you want YOLO-World to detect. The model accepts any string as an input, though does best with short descriptions of common objects..version(string): Variant of YoloWorld model.confidence(float_zero_to_one): Confidence threshold for detections.
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step YOLO-World Model in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/yolo_world_model@v1",
"images": "$inputs.image",
"class_names": [
"person",
"car",
"license plate"
],
"version": "v2-s",
"confidence": 0.005
}