Moondream2¶
Class: Moondream2BlockV1
Source: inference.core.workflows.core_steps.models.foundation.moondream2.v1.Moondream2BlockV1
This workflow block runs Moondream2, a multimodal vision-language model. You can use this block to run zero-shot object detection.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/moondream2@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
prompt |
str |
Optional text prompt to provide additional context to Moondream2.. | ✅ |
model_version |
str |
The Moondream2 model to be used for inference.. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Moondream2 in version v1.
- inputs:
Google Vision OCR,Label Visualization,LMM For Classification,Blur Visualization,Background Color Visualization,Contrast Equalization,Bounding Box Visualization,Keypoint Visualization,Stability AI Outpainting,Reference Path Visualization,Image Slicer,Pixelate Visualization,Single-Label Classification Model,CSV Formatter,Image Preprocessing,Color Visualization,SIFT Comparison,Object Detection Model,Email Notification,Anthropic Claude,Circle Visualization,Image Contours,Object Detection Model,Polygon Zone Visualization,Ellipse Visualization,Clip Comparison,Email Notification,VLM as Classifier,Model Monitoring Inference Aggregator,OCR Model,Absolute Static Crop,Depth Estimation,LMM,Morphological Transformation,Roboflow Dataset Upload,Crop Visualization,OpenAI,Image Convert Grayscale,Florence-2 Model,CogVLM,Roboflow Custom Metadata,VLM as Detector,Multi-Label Classification Model,Classification Label Visualization,Stitch OCR Detections,Keypoint Detection Model,Keypoint Detection Model,Camera Calibration,Polygon Visualization,Icon Visualization,Triangle Visualization,Roboflow Dataset Upload,Anthropic Claude,Model Comparison Visualization,Corner Visualization,Florence-2 Model,Google Gemini,Google Gemini,EasyOCR,Line Counter Visualization,Grid Visualization,Halo Visualization,Stability AI Image Generation,QR Code Generator,Twilio SMS Notification,Relative Static Crop,Dot Visualization,Llama 3.2 Vision,Image Blur,Slack Notification,OpenAI,Local File Sink,Instance Segmentation Model,Multi-Label Classification Model,Image Slicer,OpenAI,Stability AI Inpainting,Dynamic Crop,Single-Label Classification Model,Camera Focus,Webhook Sink,Image Threshold,Instance Segmentation Model,Perspective Correction,Mask Visualization,Trace Visualization,OpenAI,Stitch Images,SIFT - outputs:
Label Visualization,Time in Zone,Line Counter,Blur Visualization,Background Color Visualization,Bounding Box Visualization,Detections Filter,PTZ Tracking (ONVIF).md),Detection Offset,Pixelate Visualization,Detections Classes Replacement,Icon Visualization,Detections Transformation,Triangle Visualization,Roboflow Dataset Upload,Model Comparison Visualization,Byte Tracker,Overlap Filter,Distance Measurement,Corner Visualization,Florence-2 Model,Color Visualization,Path Deviation,Detections Combine,Size Measurement,Circle Visualization,Time in Zone,Dot Visualization,Detections Stitch,Line Counter,Ellipse Visualization,Velocity,Model Monitoring Inference Aggregator,Byte Tracker,Path Deviation,Time in Zone,Roboflow Dataset Upload,Dynamic Crop,Detections Stabilizer,Detections Consensus,Crop Visualization,Detections Merge,Perspective Correction,Florence-2 Model,Roboflow Custom Metadata,Trace Visualization,Byte Tracker,Stitch OCR Detections,Segment Anything 2 Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Moondream2 in version v1 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Optional text prompt to provide additional context to Moondream2..model_version(roboflow_model_id): The Moondream2 model to be used for inference..
-
output
predictions(object_detection_prediction): Prediction with detected bounding boxes in form of sv.Detections(...) object.
Example JSON definition of step Moondream2 in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/moondream2@v1",
"images": "$inputs.image",
"prompt": "my prompt",
"model_version": "moondream2/moondream2_2b_jul24"
}