Moondream2¶
Class: Moondream2BlockV1
Source: inference.core.workflows.core_steps.models.foundation.moondream2.v1.Moondream2BlockV1
This workflow block runs Moondream2, a multimodal vision-language model. You can use this block to run zero-shot object detection.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/moondream2@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | โ |
prompt |
str |
Optional text prompt to provide additional context to Moondream2.. | โ |
model_version |
str |
The Moondream2 model to be used for inference.. | โ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Moondream2 in version v1.
- inputs:
Roboflow Dataset Upload,Line Counter Visualization,Stability AI Outpainting,Object Detection Model,Email Notification,Google Gemma API,Image Slicer,OCR Model,Google Vision OCR,Image Preprocessing,Google Gemini,Instance Segmentation Model,EasyOCR,Color Visualization,Object Detection Model,Multi-Label Classification Model,OpenAI,Ellipse Visualization,Polygon Visualization,Anthropic Claude,Single-Label Classification Model,Relative Static Crop,Webhook Sink,Model Comparison Visualization,Trace Visualization,Stitch OCR Detections,Camera Focus,Roboflow Custom Metadata,Qwen 3.5 API,Object Detection Model,OpenAI,Instance Segmentation Model,Single-Label Classification Model,Semantic Segmentation Model,VLM As Classifier,Image Threshold,Stitch Images,Heatmap Visualization,Qwen 3.6 API,SIFT Comparison,Morphological Transformation,Florence-2 Model,Halo Visualization,Instance Segmentation Model,CogVLM,Crop Visualization,Camera Calibration,Florence-2 Model,Multi-Label Classification Model,GLM-OCR,Dot Visualization,S3 Sink,Semantic Segmentation Model,Twilio SMS Notification,Icon Visualization,Model Monitoring Inference Aggregator,Local File Sink,Google Gemini,Roboflow Dataset Upload,Image Contours,Pixelate Visualization,Keypoint Detection Model,Twilio SMS/MMS Notification,Polygon Zone Visualization,Reference Path Visualization,Blur Visualization,Anthropic Claude,Background Subtraction,Text Display,Clip Comparison,CSV Formatter,VLM As Detector,LMM,Stability AI Image Generation,Perspective Correction,Anthropic Claude,Bounding Box Visualization,Depth Estimation,Classification Label Visualization,Image Slicer,Absolute Static Crop,Image Blur,Stability AI Inpainting,Multi-Label Classification Model,Polygon Visualization,Image Convert Grayscale,SIFT,Single-Label Classification Model,Roboflow Vision Events,OpenAI,Google Gemini,Label Visualization,Corner Visualization,Grid Visualization,Dynamic Crop,Contrast Equalization,Keypoint Visualization,Triangle Visualization,Qwen3.5-VL,Keypoint Detection Model,QR Code Generator,Halo Visualization,Circle Visualization,Camera Focus,Mask Visualization,LMM For Classification,Morphological Transformation,OpenAI,Contrast Enhancement,Keypoint Detection Model,MoonshotAI Kimi,Llama 3.2 Vision,Background Color Visualization,Email Notification,Slack Notification,Stitch OCR Detections - outputs:
Detections Stabilizer,Detections Stitch,Model Monitoring Inference Aggregator,Detections Filter,Roboflow Dataset Upload,Roboflow Dataset Upload,Icon Visualization,Pixelate Visualization,Line Counter,Time in Zone,Distance Measurement,Color Visualization,Detections Combine,Blur Visualization,SAM2 Video Tracker,Detection Event Log,Ellipse Visualization,ByteTrack Tracker,Byte Tracker,Byte Tracker,Time in Zone,Detections Merge,Detections Classes Replacement,Detections Consensus,Perspective Correction,Overlap Filter,Model Comparison Visualization,Velocity,Stitch OCR Detections,Trace Visualization,Camera Focus,Roboflow Custom Metadata,Byte Tracker,Bounding Box Visualization,Line Counter,Detection Offset,Roboflow Vision Events,Detections List Roll-Up,Size Measurement,Label Visualization,Corner Visualization,Mask Area Measurement,Heatmap Visualization,Dynamic Crop,Per-Class Confidence Filter,SORT Tracker,Triangle Visualization,Circle Visualization,Florence-2 Model,Segment Anything 2 Model,Detections Transformation,Crop Visualization,Florence-2 Model,Path Deviation,Background Color Visualization,Time in Zone,Dot Visualization,OC-SORT Tracker,PTZ Tracking (ONVIF),Path Deviation,Stitch OCR Detections
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Moondream2 in version v1 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Optional text prompt to provide additional context to Moondream2..model_version(roboflow_model_id): The Moondream2 model to be used for inference..
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Moondream2 in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/moondream2@v1",
"images": "$inputs.image",
"prompt": "my prompt",
"model_version": "moondream2/moondream2_2b_jul24"
}