Depth Estimation¶
Class: DepthEstimationBlockV1
Source: inference.core.workflows.core_steps.models.foundation.depth_estimation.v1.DepthEstimationBlockV1
π― This workflow block performs depth estimation on images using Apple's DepthPro model. It analyzes the spatial relationships
and depth information in images to create a depth map where:
π Each pixel's value represents its relative distance from the camera
π Lower values (darker colors) indicate closer objects
π Higher values (lighter colors) indicate further objects
The model outputs:
1. πΊοΈ A depth map showing the relative distances of objects in the scene
2. π The camera's field of view (in degrees)
3. π¬ The camera's focal length
This is particularly useful for:
- ποΈ Understanding 3D structure from 2D images
- π¨ Creating depth-aware visualizations
- π Analyzing spatial relationships in scenes
- πΆοΈ Applications in augmented reality and 3D reconstruction
β‘ The model runs efficiently on Apple Silicon (M1-M4) using Metal Performance Shaders (MPS) for accelerated inference.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/depth_estimation@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | β |
model_version |
str |
The Depth Estimation model to be used for inference.. | β |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Depth Estimation in version v1.
- inputs:
Florence-2 Model,Trace Visualization,Roboflow Dataset Upload,Classification Label Visualization,Stitch Images,Image Slicer,Ellipse Visualization,Crop Visualization,Grid Visualization,Morphological Transformation,Triangle Visualization,Reference Path Visualization,Roboflow Dataset Upload,Google Gemini,LMM,Stitch OCR Detections,Twilio SMS/MMS Notification,Image Slicer,Local File Sink,VLM As Classifier,Icon Visualization,QR Code Generator,Stability AI Outpainting,OpenAI,Florence-2 Model,Google Vision OCR,Camera Focus,Pixelate Visualization,Model Comparison Visualization,Image Preprocessing,Background Color Visualization,Clip Comparison,Color Visualization,Twilio SMS Notification,Polygon Zone Visualization,OpenAI,Halo Visualization,Background Subtraction,Keypoint Detection Model,Keypoint Visualization,Instance Segmentation Model,Contrast Equalization,EasyOCR,Image Blur,Polygon Visualization,Anthropic Claude,SIFT,Google Gemini,Webhook Sink,Perspective Correction,Object Detection Model,Circle Visualization,Blur Visualization,Dot Visualization,Camera Calibration,Heatmap Visualization,Image Threshold,Multi-Label Classification Model,Relative Static Crop,Google Gemini,Text Display,Email Notification,OpenAI,Single-Label Classification Model,Anthropic Claude,Depth Estimation,Mask Visualization,CSV Formatter,Stability AI Image Generation,Halo Visualization,Absolute Static Crop,OCR Model,Label Visualization,Stability AI Inpainting,Anthropic Claude,Corner Visualization,Image Convert Grayscale,Stitch OCR Detections,Roboflow Custom Metadata,SIFT Comparison,Polygon Visualization,CogVLM,VLM As Detector,Line Counter Visualization,Bounding Box Visualization,Llama 3.2 Vision,Camera Focus,Email Notification,Slack Notification,Dynamic Crop,Image Contours,Model Monitoring Inference Aggregator,LMM For Classification,OpenAI - outputs:
Florence-2 Model,Roboflow Dataset Upload,Trace Visualization,Image Contours,Seg Preview,Segment Anything 2 Model,Classification Label Visualization,Stitch Images,Single-Label Classification Model,Qwen3-VL,Clip Comparison,Ellipse Visualization,Image Slicer,Byte Tracker,SAM 3,Detections Stabilizer,Crop Visualization,Triangle Visualization,Morphological Transformation,Roboflow Dataset Upload,LMM,Twilio SMS/MMS Notification,Multi-Label Classification Model,Reference Path Visualization,SmolVLM2,Google Gemini,Image Slicer,Barcode Detection,VLM As Classifier,Icon Visualization,Stability AI Outpainting,OpenAI,Moondream2,Keypoint Detection Model,Florence-2 Model,Google Vision OCR,Pixel Color Count,Camera Focus,Pixelate Visualization,Model Comparison Visualization,Object Detection Model,Gaze Detection,Template Matching,Image Preprocessing,Background Color Visualization,Clip Comparison,Color Visualization,Polygon Zone Visualization,OpenAI,Background Subtraction,Halo Visualization,Keypoint Detection Model,Keypoint Visualization,Perception Encoder Embedding Model,Instance Segmentation Model,Contrast Equalization,EasyOCR,Image Blur,Anthropic Claude,Polygon Visualization,SIFT,Google Gemini,Perspective Correction,Object Detection Model,Circle Visualization,Blur Visualization,Dominant Color,Dot Visualization,YOLO-World Model,Multi-Label Classification Model,Heatmap Visualization,Image Threshold,Camera Calibration,Relative Static Crop,Google Gemini,Text Display,Email Notification,OpenAI,Instance Segmentation Model,Qwen2.5-VL,Single-Label Classification Model,Anthropic Claude,Depth Estimation,VLM As Detector,Mask Visualization,Stability AI Image Generation,Buffer,Halo Visualization,Absolute Static Crop,Detections Stitch,OCR Model,Label Visualization,Stability AI Inpainting,Motion Detection,Anthropic Claude,Corner Visualization,Image Convert Grayscale,QR Code Detection,SIFT Comparison,Polygon Visualization,CogVLM,SIFT Comparison,SAM 3,VLM As Detector,Line Counter Visualization,Bounding Box Visualization,CLIP Embedding Model,Llama 3.2 Vision,Camera Focus,SAM 3,VLM As Classifier,Dynamic Crop,Time in Zone,LMM For Classification,OpenAI
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Depth Estimation in version v1 has.
Bindings
-
input
-
output
image(image): Image in workflows.normalized_depth(numpy_array): Numpy array.
Example JSON definition of step Depth Estimation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/depth_estimation@v1",
"images": "$inputs.image",
"model_version": "depth-anything-v2/small"
}