Moondream2¶
Class: Moondream2BlockV1
Source: inference.core.workflows.core_steps.models.foundation.moondream2.v1.Moondream2BlockV1
This workflow block runs Moondream2, a multimodal vision-language model. You can use this block to run zero-shot object detection.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/moondream2@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
prompt |
str |
Optional text prompt to provide additional context to Moondream2.. | ✅ |
model_version |
str |
The Moondream2 model to be used for inference.. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Moondream2 in version v1.
- inputs:
VLM as Detector,Google Vision OCR,Classification Label Visualization,Circle Visualization,Image Contours,Relative Static Crop,Image Preprocessing,LMM For Classification,VLM as Classifier,Ellipse Visualization,Stitch Images,Triangle Visualization,Stability AI Inpainting,QR Code Generator,Image Slicer,Background Color Visualization,Model Monitoring Inference Aggregator,OCR Model,Dot Visualization,Florence-2 Model,SIFT,Morphological Transformation,EasyOCR,Reference Path Visualization,Halo Visualization,SIFT Comparison,Polygon Visualization,Image Slicer,Florence-2 Model,Slack Notification,Clip Comparison,Image Convert Grayscale,Instance Segmentation Model,OpenAI,Color Visualization,Keypoint Detection Model,Object Detection Model,Google Gemini,Label Visualization,Email Notification,Llama 3.2 Vision,Trace Visualization,Email Notification,Corner Visualization,Mask Visualization,CogVLM,Stability AI Outpainting,OpenAI,Roboflow Custom Metadata,Stitch OCR Detections,Blur Visualization,CSV Formatter,Crop Visualization,Single-Label Classification Model,OpenAI,Grid Visualization,Perspective Correction,Twilio SMS Notification,Absolute Static Crop,Single-Label Classification Model,Contrast Equalization,Roboflow Dataset Upload,Roboflow Dataset Upload,Polygon Zone Visualization,Stability AI Image Generation,Webhook Sink,Depth Estimation,Bounding Box Visualization,Camera Focus,Line Counter Visualization,Instance Segmentation Model,Multi-Label Classification Model,Icon Visualization,Image Blur,Pixelate Visualization,Image Threshold,Keypoint Detection Model,Anthropic Claude,LMM,Google Gemini,Multi-Label Classification Model,Dynamic Crop,Model Comparison Visualization,Camera Calibration,Local File Sink,Keypoint Visualization,Object Detection Model - outputs:
Byte Tracker,Overlap Filter,Blur Visualization,Time in Zone,Circle Visualization,Detections Stabilizer,Crop Visualization,Detections Filter,Detections Classes Replacement,Perspective Correction,Ellipse Visualization,Triangle Visualization,Roboflow Dataset Upload,Detections Combine,Roboflow Dataset Upload,Stitch OCR Detections,Background Color Visualization,Model Monitoring Inference Aggregator,Segment Anything 2 Model,Velocity,Distance Measurement,Dot Visualization,Florence-2 Model,Bounding Box Visualization,Detections Transformation,Icon Visualization,Florence-2 Model,Time in Zone,Detection Offset,Pixelate Visualization,Path Deviation,Byte Tracker,PTZ Tracking (ONVIF).md),Color Visualization,Line Counter,Detections Merge,Label Visualization,Byte Tracker,Trace Visualization,Dynamic Crop,Path Deviation,Line Counter,Detections Consensus,Model Comparison Visualization,Size Measurement,Corner Visualization,Time in Zone,Roboflow Custom Metadata,Detections Stitch
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Moondream2 in version v1 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Optional text prompt to provide additional context to Moondream2..model_version(roboflow_model_id): The Moondream2 model to be used for inference..
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Moondream2 in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/moondream2@v1",
"images": "$inputs.image",
"prompt": "my prompt",
"model_version": "moondream2/moondream2_2b_jul24"
}