Depth Estimation¶
Class: DepthEstimationBlockV1
Source: inference.core.workflows.core_steps.models.foundation.depth_estimation.v1.DepthEstimationBlockV1
π― This workflow block performs depth estimation on images using Apple's DepthPro model. It analyzes the spatial relationships
and depth information in images to create a depth map where:
π Each pixel's value represents its relative distance from the camera
π Lower values (darker colors) indicate closer objects
π Higher values (lighter colors) indicate further objects
The model outputs:
1. πΊοΈ A depth map showing the relative distances of objects in the scene
2. π The camera's field of view (in degrees)
3. π¬ The camera's focal length
This is particularly useful for:
- ποΈ Understanding 3D structure from 2D images
- π¨ Creating depth-aware visualizations
- π Analyzing spatial relationships in scenes
- πΆοΈ Applications in augmented reality and 3D reconstruction
β‘ The model runs efficiently on Apple Silicon (M1-M4) using Metal Performance Shaders (MPS) for accelerated inference.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/depth_estimation@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | β |
model_version |
str |
The Depth Estimation model to be used for inference.. | β |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Depth Estimation in version v1.
- inputs:
Contrast Equalization,Llama 3.2 Vision,Clip Comparison,Anthropic Claude,VLM as Detector,Local File Sink,Polygon Visualization,QR Code Generator,Image Blur,SIFT Comparison,Email Notification,Roboflow Dataset Upload,Text Display,Model Comparison Visualization,Camera Focus,SIFT,LMM,Google Vision OCR,Mask Visualization,Anthropic Claude,Relative Static Crop,Circle Visualization,EasyOCR,Pixelate Visualization,Stability AI Inpainting,Reference Path Visualization,Instance Segmentation Model,Perspective Correction,Ellipse Visualization,Crop Visualization,Halo Visualization,Image Threshold,Keypoint Detection Model,CSV Formatter,Florence-2 Model,Twilio SMS Notification,Image Convert Grayscale,Corner Visualization,Image Preprocessing,Icon Visualization,Background Subtraction,Image Contours,Image Slicer,Depth Estimation,Multi-Label Classification Model,Stitch Images,Dynamic Crop,Bounding Box Visualization,VLM as Classifier,Model Monitoring Inference Aggregator,Line Counter Visualization,Blur Visualization,Morphological Transformation,Camera Calibration,Polygon Zone Visualization,Single-Label Classification Model,Email Notification,Keypoint Visualization,OCR Model,Roboflow Custom Metadata,Google Gemini,OpenAI,Camera Focus,Trace Visualization,OpenAI,CogVLM,Color Visualization,Absolute Static Crop,Image Slicer,Dot Visualization,Label Visualization,Slack Notification,Florence-2 Model,Google Gemini,Google Gemini,Grid Visualization,Object Detection Model,LMM For Classification,OpenAI,Stitch OCR Detections,OpenAI,Classification Label Visualization,Background Color Visualization,Stability AI Outpainting,Stitch OCR Detections,Roboflow Dataset Upload,Twilio SMS/MMS Notification,Anthropic Claude,Triangle Visualization,Stability AI Image Generation,Webhook Sink - outputs:
Contrast Equalization,Llama 3.2 Vision,Clip Comparison,SIFT Comparison,Anthropic Claude,VLM as Detector,Polygon Visualization,Image Blur,SIFT Comparison,SmolVLM2,CLIP Embedding Model,Roboflow Dataset Upload,Text Display,Motion Detection,SIFT,Model Comparison Visualization,Camera Focus,Moondream2,LMM,Qwen3-VL,Single-Label Classification Model,Google Vision OCR,SAM 3,Anthropic Claude,Relative Static Crop,Mask Visualization,Object Detection Model,Keypoint Detection Model,Circle Visualization,Seg Preview,EasyOCR,Pixelate Visualization,Stability AI Inpainting,Multi-Label Classification Model,Time in Zone,VLM as Classifier,Reference Path Visualization,Instance Segmentation Model,Perspective Correction,Halo Visualization,Image Threshold,Ellipse Visualization,Crop Visualization,Keypoint Detection Model,Florence-2 Model,Detections Stabilizer,Image Convert Grayscale,Perception Encoder Embedding Model,Corner Visualization,Image Preprocessing,Barcode Detection,Icon Visualization,SAM 3,Background Subtraction,Segment Anything 2 Model,Qwen2.5-VL,Image Slicer,Image Contours,Depth Estimation,Multi-Label Classification Model,Pixel Color Count,Detections Stitch,Stitch Images,QR Code Detection,Dynamic Crop,Bounding Box Visualization,Anthropic Claude,VLM as Classifier,YOLO-World Model,Instance Segmentation Model,Line Counter Visualization,Blur Visualization,Morphological Transformation,Camera Calibration,Polygon Zone Visualization,Single-Label Classification Model,Email Notification,Stability AI Image Generation,Dominant Color,OCR Model,Keypoint Visualization,Google Gemini,OpenAI,Camera Focus,Trace Visualization,CogVLM,OpenAI,Image Slicer,Absolute Static Crop,Color Visualization,Dot Visualization,Label Visualization,Buffer,Florence-2 Model,Google Gemini,Google Gemini,Object Detection Model,LMM For Classification,Template Matching,OpenAI,OpenAI,Classification Label Visualization,Background Color Visualization,Stability AI Outpainting,Byte Tracker,SAM 3,Twilio SMS/MMS Notification,Roboflow Dataset Upload,Gaze Detection,Clip Comparison,Triangle Visualization,VLM as Detector
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Depth Estimation in version v1 has.
Bindings
-
input
-
output
image(image): Image in workflows.normalized_depth(numpy_array): Numpy array.
Example JSON definition of step Depth Estimation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/depth_estimation@v1",
"images": "$inputs.image",
"model_version": "depth-anything-v2/small"
}