Moondream2¶
Class: Moondream2BlockV1
Source: inference.core.workflows.core_steps.models.foundation.moondream2.v1.Moondream2BlockV1
This workflow block runs Moondream2, a multimodal vision-language model. You can use this block to run zero-shot object detection.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/moondream2@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
prompt |
str |
Optional text prompt to provide additional context to Moondream2.. | ✅ |
model_version |
str |
The Moondream2 model to be used for inference.. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Moondream2 in version v1.
- inputs:
Email Notification,Contrast Equalization,Instance Segmentation Model,Google Vision OCR,Grid Visualization,S3 Sink,Stability AI Image Generation,Model Comparison Visualization,Absolute Static Crop,Keypoint Visualization,SIFT,Trace Visualization,Roboflow Dataset Upload,Twilio SMS/MMS Notification,QR Code Generator,Model Monitoring Inference Aggregator,GLM-OCR,Reference Path Visualization,Halo Visualization,OCR Model,SIFT Comparison,VLM As Classifier,Image Preprocessing,Crop Visualization,OpenAI,OpenAI,Label Visualization,Classification Label Visualization,Pixelate Visualization,Local File Sink,Twilio SMS Notification,Email Notification,Qwen3.5-VL,Stitch OCR Detections,Corner Visualization,Stitch Images,Background Subtraction,Stitch OCR Detections,LMM For Classification,EasyOCR,Morphological Transformation,CSV Formatter,OpenAI,Clip Comparison,Image Threshold,Background Color Visualization,Anthropic Claude,Single-Label Classification Model,Google Gemini,Camera Calibration,Halo Visualization,Stability AI Outpainting,Roboflow Custom Metadata,CogVLM,OpenAI,Single-Label Classification Model,Ellipse Visualization,Heatmap Visualization,Image Convert Grayscale,Triangle Visualization,Semantic Segmentation Model,Image Blur,Depth Estimation,Color Visualization,Camera Focus,Text Display,Anthropic Claude,Dot Visualization,Image Slicer,Keypoint Detection Model,Polygon Visualization,Florence-2 Model,Circle Visualization,Blur Visualization,Multi-Label Classification Model,Google Gemini,LMM,Slack Notification,Icon Visualization,Camera Focus,Stability AI Inpainting,Polygon Visualization,Webhook Sink,Polygon Zone Visualization,Perspective Correction,Florence-2 Model,Instance Segmentation Model,Anthropic Claude,Mask Visualization,Google Gemini,Image Contours,Dynamic Crop,Roboflow Dataset Upload,Llama 3.2 Vision,Keypoint Detection Model,Object Detection Model,VLM As Detector,Object Detection Model,Image Slicer,Line Counter Visualization,Relative Static Crop,Multi-Label Classification Model,Bounding Box Visualization - outputs:
Detections Classes Replacement,Velocity,Byte Tracker,Detections Consensus,Triangle Visualization,PTZ Tracking (ONVIF),ByteTrack Tracker,Path Deviation,Size Measurement,Detections Filter,Detections Transformation,Color Visualization,Model Comparison Visualization,Overlap Filter,Roboflow Dataset Upload,Trace Visualization,Dot Visualization,Detections Combine,Model Monitoring Inference Aggregator,Florence-2 Model,Circle Visualization,Blur Visualization,Time in Zone,Detections Stabilizer,SORT Tracker,Detections Merge,Crop Visualization,Icon Visualization,Pixelate Visualization,Camera Focus,Label Visualization,Detections List Roll-Up,Detection Offset,Detection Event Log,Stitch OCR Detections,Corner Visualization,Florence-2 Model,Mask Area Measurement,Time in Zone,OC-SORT Tracker,Time in Zone,Perspective Correction,Stitch OCR Detections,Roboflow Dataset Upload,Distance Measurement,Byte Tracker,Path Deviation,Line Counter,Background Color Visualization,Segment Anything 2 Model,Roboflow Custom Metadata,Byte Tracker,Ellipse Visualization,Detections Stitch,Dynamic Crop,Line Counter,Bounding Box Visualization,Heatmap Visualization
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Moondream2 in version v1 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Optional text prompt to provide additional context to Moondream2..model_version(roboflow_model_id): The Moondream2 model to be used for inference..
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Moondream2 in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/moondream2@v1",
"images": "$inputs.image",
"prompt": "my prompt",
"model_version": "moondream2/moondream2_2b_jul24"
}