Depth Estimation¶
Class: DepthEstimationBlockV1
Source: inference.core.workflows.core_steps.models.foundation.depth_estimation.v1.DepthEstimationBlockV1
π― This workflow block performs depth estimation on images using Apple's DepthPro model. It analyzes the spatial relationships
and depth information in images to create a depth map where:
π Each pixel's value represents its relative distance from the camera
π Lower values (darker colors) indicate closer objects
π Higher values (lighter colors) indicate further objects
The model outputs:
1. πΊοΈ A depth map showing the relative distances of objects in the scene
2. π The camera's field of view (in degrees)
3. π¬ The camera's focal length
This is particularly useful for:
- ποΈ Understanding 3D structure from 2D images
- π¨ Creating depth-aware visualizations
- π Analyzing spatial relationships in scenes
- πΆοΈ Applications in augmented reality and 3D reconstruction
β‘ The model runs efficiently on Apple Silicon (M1-M4) using Metal Performance Shaders (MPS) for accelerated inference.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/depth_estimation@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | β |
model_version |
str |
The Depth Estimation model to be used for inference.. | β |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Depth Estimation in version v1.
- inputs:
Triangle Visualization,Morphological Transformation,Roboflow Dataset Upload,Ellipse Visualization,LMM,Florence-2 Model,Blur Visualization,Halo Visualization,Anthropic Claude,Google Gemini,Camera Focus,Llama 3.2 Vision,Model Comparison Visualization,Keypoint Visualization,Pixelate Visualization,Image Slicer,Stitch OCR Detections,Roboflow Dataset Upload,Line Counter Visualization,Label Visualization,SIFT Comparison,QR Code Generator,Email Notification,Slack Notification,Corner Visualization,Image Slicer,Florence-2 Model,CSV Formatter,EasyOCR,Object Detection Model,Anthropic Claude,OpenAI,Google Gemini,Bounding Box Visualization,Keypoint Detection Model,Anthropic Claude,Background Subtraction,Background Color Visualization,Image Convert Grayscale,Camera Calibration,Polygon Visualization,Image Blur,VLM As Classifier,Relative Static Crop,Clip Comparison,Heatmap Visualization,CogVLM,Mask Visualization,Image Preprocessing,Twilio SMS Notification,VLM As Detector,OpenAI,OCR Model,SIFT,Stitch Images,Stability AI Outpainting,Stitch OCR Detections,Dynamic Crop,Model Monitoring Inference Aggregator,Circle Visualization,Color Visualization,Trace Visualization,OpenAI,Icon Visualization,Dot Visualization,Email Notification,Instance Segmentation Model,Camera Focus,Twilio SMS/MMS Notification,Depth Estimation,Contrast Equalization,LMM For Classification,Roboflow Custom Metadata,Grid Visualization,Text Display,Reference Path Visualization,Image Threshold,Perspective Correction,Image Contours,Polygon Zone Visualization,Multi-Label Classification Model,Polygon Visualization,Local File Sink,Halo Visualization,Google Vision OCR,Stability AI Inpainting,Crop Visualization,Google Gemini,Webhook Sink,Absolute Static Crop,Classification Label Visualization,OpenAI,Single-Label Classification Model,Stability AI Image Generation - outputs:
Triangle Visualization,Detections Stitch,Roboflow Dataset Upload,Ellipse Visualization,Morphological Transformation,LMM,Florence-2 Model,Blur Visualization,CLIP Embedding Model,Anthropic Claude,Halo Visualization,Google Gemini,Camera Focus,Llama 3.2 Vision,Motion Detection,Model Comparison Visualization,VLM As Detector,Keypoint Visualization,Qwen3-VL,Pixelate Visualization,Image Slicer,SIFT Comparison,Roboflow Dataset Upload,Line Counter Visualization,Keypoint Detection Model,SmolVLM2,Label Visualization,SIFT Comparison,Clip Comparison,Buffer,SAM 3,Detections Stabilizer,Object Detection Model,EasyOCR,Image Slicer,Corner Visualization,Florence-2 Model,Object Detection Model,SAM 3,QR Code Detection,Anthropic Claude,OpenAI,Google Gemini,Perception Encoder Embedding Model,Bounding Box Visualization,Keypoint Detection Model,Anthropic Claude,Pixel Color Count,Background Subtraction,Background Color Visualization,Image Convert Grayscale,Camera Calibration,Polygon Visualization,Image Blur,VLM As Classifier,Relative Static Crop,Clip Comparison,Heatmap Visualization,CogVLM,Mask Visualization,Image Preprocessing,VLM As Detector,Instance Segmentation Model,OpenAI,OCR Model,SIFT,Stitch Images,Single-Label Classification Model,Moondream2,Stability AI Outpainting,Dynamic Crop,Circle Visualization,Byte Tracker,OpenAI,Trace Visualization,Color Visualization,Qwen2.5-VL,Icon Visualization,YOLO-World Model,Dot Visualization,Time in Zone,Email Notification,Instance Segmentation Model,Camera Focus,Twilio SMS/MMS Notification,Contrast Equalization,Depth Estimation,Segment Anything 2 Model,LMM For Classification,Text Display,Dominant Color,Reference Path Visualization,Image Threshold,Perspective Correction,Multi-Label Classification Model,Image Contours,Polygon Zone Visualization,Polygon Visualization,Halo Visualization,Google Vision OCR,SAM 3,Stability AI Inpainting,Crop Visualization,Template Matching,Google Gemini,Barcode Detection,VLM As Classifier,Classification Label Visualization,Seg Preview,OpenAI,Absolute Static Crop,Multi-Label Classification Model,Single-Label Classification Model,Gaze Detection,Stability AI Image Generation
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Depth Estimation in version v1 has.
Bindings
-
input
-
output
image(image): Image in workflows.normalized_depth(numpy_array): Numpy array.
Example JSON definition of step Depth Estimation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/depth_estimation@v1",
"images": "$inputs.image",
"model_version": "depth-anything-v2/small"
}