Crop Visualization¶
Class: CropVisualizationBlockV1
Source: inference.core.workflows.core_steps.visualizations.crop.v1.CropVisualizationBlockV1
Display scaled-up, zoomed-in views of detected objects overlaid on the original image, allowing detailed inspection of small or distant objects while maintaining context with the full scene.
How This Block Works¶
This block takes an image and detection predictions and creates scaled-up, zoomed-in crops of each detected object, then displays these enlarged crops on the original image. The block:
- Takes an image and predictions as input
- Identifies detected regions from bounding boxes or segmentation masks
- Extracts the image region for each detected object (crops the object from the original image)
- Scales up each crop by the specified scale factor (e.g., 2x makes objects twice as large)
- Applies color styling to the crop border based on the selected color palette, with colors assigned by class, index, or track ID
- Positions the scaled crop on the image at the specified anchor point relative to the original detection location using Supervision's CropAnnotator
- Draws a colored border around the scaled crop with the specified thickness
- Returns an annotated image with scaled-up object crops overlaid on the original image
The block works with both object detection predictions (using bounding boxes) and instance segmentation predictions (using masks). When masks are available, it crops the exact shape of detected objects; otherwise, it crops rectangular bounding box regions. The scale factor allows you to zoom in on objects, making small or distant objects more visible and easier to inspect. The scaled crops are positioned relative to their original detection locations, allowing you to see both the zoomed-in detail and the object's position in the full scene context.
Common Use Cases¶
- Small Object Inspection: Zoom in on small detected objects (e.g., defects, small products, distant objects) to make them more visible and easier to inspect while maintaining scene context
- Detail Visualization: Display enlarged views of detected objects for detailed analysis, quality control, or inspection workflows where fine details need to be visible
- Multi-Scale Object Display: Show both the full scene and zoomed-in object details simultaneously, useful for applications where context and detail are both important
- Quality Control and Inspection: Inspect detected defects, products, or components at higher magnification while keeping the original detection location visible for reference
- Presentation and Reporting: Create visualizations that highlight detected objects with zoomed-in views for reports, documentation, or presentations where both overview and detail are needed
- User Interface Enhancement: Provide zoomed-in object views in user interfaces, dashboards, or interactive applications where users need to see object details without losing scene context
Connecting to Other Blocks¶
The annotated image from this block can be connected to:
- Other visualization blocks (e.g., Label Visualization, Bounding Box Visualization, Polygon Visualization) to combine scaled crops with additional annotations for comprehensive visualization
- Data storage blocks (e.g., Local File Sink, CSV Formatter, Roboflow Dataset Upload) to save images with scaled crops for documentation, reporting, or analysis
- Webhook blocks to send visualized results with scaled crops to external systems, APIs, or web applications for display in dashboards or monitoring tools
- Notification blocks (e.g., Email Notification, Slack Notification) to send annotated images with scaled crops as visual evidence in alerts or reports
- Video output blocks to create annotated video streams or recordings with scaled crops for live monitoring, detailed inspection, or post-processing analysis
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/crop_visualization@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
copy_image |
bool |
Enable this option to create a copy of the input image for visualization, preserving the original. Use this when stacking multiple visualizations.. | ✅ |
color_palette |
str |
Select a color palette for the visualised elements.. | ✅ |
palette_size |
int |
Specify the number of colors in the palette. This applies when using custom or Matplotlib palettes.. | ✅ |
custom_colors |
List[str] |
Define a list of custom colors for bounding boxes in HEX format.. | ✅ |
color_axis |
str |
Choose how bounding box colors are assigned.. | ✅ |
position |
str |
Anchor position for placing the scaled crop relative to the original detection's bounding box. Options include: CENTER (center of box), corners (TOP_LEFT, TOP_RIGHT, BOTTOM_LEFT, BOTTOM_RIGHT), edge midpoints (TOP_CENTER, CENTER_LEFT, CENTER_RIGHT, BOTTOM_CENTER), or CENTER_OF_MASS (center of mass of the object). The scaled crop will be positioned at this anchor point relative to the original detection location.. | ✅ |
scale_factor |
float |
Factor by which to scale (zoom) the cropped object region. A factor of 2.0 doubles the size of the crop, making objects twice as large. A factor of 1.0 shows the crop at original size. Higher values (e.g., 3.0, 4.0) create more zoomed-in views, useful for inspecting small or distant objects. Lower values (e.g., 1.5) provide subtle magnification.. | ✅ |
border_thickness |
int |
Thickness of the border outline around the scaled crop in pixels. Higher values create thicker, more visible borders that help distinguish the scaled crop from the background.. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Crop Visualization in version v1.
- inputs:
Stability AI Outpainting,SAM 3,Motion Detection,Contrast Enhancement,Camera Focus,Image Preprocessing,Corner Visualization,Ellipse Visualization,Seg Preview,Roboflow Vision Events,Object Detection Model,Cosine Similarity,Heatmap Visualization,Trace Visualization,OC-SORT Tracker,VLM As Classifier,Time in Zone,OpenAI,Email Notification,Byte Tracker,Keypoint Visualization,Detections Consensus,Model Comparison Visualization,YOLO-World Model,JSON Parser,Polygon Zone Visualization,Byte Tracker,Dynamic Crop,Polygon Visualization,QR Code Generator,GLM-OCR,Stitch Images,OpenRouter,Image Blur,Model Monitoring Inference Aggregator,Dynamic Zone,Clip Comparison,Detections Stitch,Time in Zone,Detections List Roll-Up,Segment Anything 2 Model,Instance Segmentation Model,Google Gemini,Buffer,Pixelate Visualization,EasyOCR,SIFT,Contrast Equalization,Image Threshold,Instance Segmentation Model,Polygon Visualization,Anthropic Claude,Halo Visualization,Roboflow Custom Metadata,Keypoint Detection Model,Local File Sink,Florence-2 Model,Icon Visualization,Detection Offset,Image Contours,SAM2 Video Tracker,Single-Label Classification Model,OpenAI,Detections Filter,SAM 3,Grid Visualization,ByteTrack Tracker,VLM As Detector,Size Measurement,Object Detection Model,Detections Transformation,LMM,Image Convert Grayscale,Reference Path Visualization,Stitch OCR Detections,Keypoint Detection Model,SIFT Comparison,Identify Changes,Roboflow Dataset Upload,CSV Formatter,S3 Sink,BoT-SORT Tracker,SIFT Comparison,OpenAI-Compatible LLM,Morphological Transformation,Object Detection Model,Identify Outliers,Crop Visualization,Blur Visualization,Mask Visualization,Stability AI Image Generation,Qwen-VL,Stitch OCR Detections,Velocity,Google Gemma API,Image Slicer,Qwen 3.5 API,Path Deviation,Background Color Visualization,Slack Notification,Anthropic Claude,Dimension Collapse,Qwen 3.6 API,Webhook Sink,Color Visualization,Bounding Box Visualization,Google Gemma,Relative Static Crop,Detection Event Log,Bounding Rectangle,Path Deviation,CogVLM,Llama 3.2 Vision,Instance Segmentation Model,Qwen3.5-VL,Camera Focus,Instance Segmentation Model,Google Vision OCR,Google Gemini,SAM 3,Llama 3.2 Vision,Distance Measurement,SORT Tracker,Twilio SMS Notification,Detections Stabilizer,Moondream2,Anthropic Claude,Image Slicer,Depth Estimation,OpenAI,Multi-Label Classification Model,Gaze Detection,Template Matching,Classification Label Visualization,PTZ Tracking (ONVIF),Florence-2 Model,MoonshotAI Kimi,Time in Zone,MoonshotAI Kimi,Line Counter,Dot Visualization,Background Subtraction,Keypoint Detection Model,Roboflow Dataset Upload,Stability AI Inpainting,Per-Class Confidence Filter,Line Counter,Detections Classes Replacement,Label Visualization,Overlap Filter,Detections Merge,Absolute Static Crop,Google Gemini,VLM As Classifier,Camera Calibration,Halo Visualization,Email Notification,OpenAI,Pixel Color Count,Clip Comparison,LMM For Classification,Text Display,Mask Area Measurement,Circle Visualization,Line Counter Visualization,Byte Tracker,OCR Model,Detections Combine,VLM As Detector,Image Stack,Morphological Transformation,Twilio SMS/MMS Notification,Mask Edge Snap,Triangle Visualization,Perspective Correction - outputs:
Stability AI Outpainting,Multi-Label Classification Model,CLIP Embedding Model,SAM 3,Motion Detection,Contrast Enhancement,Camera Focus,Image Preprocessing,Seg Preview,Ellipse Visualization,Corner Visualization,Roboflow Vision Events,Object Detection Model,Heatmap Visualization,Trace Visualization,OC-SORT Tracker,VLM As Classifier,Time in Zone,OpenAI,Byte Tracker,Keypoint Visualization,Model Comparison Visualization,YOLO-World Model,Polygon Zone Visualization,Single-Label Classification Model,Dynamic Crop,Polygon Visualization,GLM-OCR,Stitch Images,OpenRouter,Semantic Segmentation Model,Image Blur,Clip Comparison,Detections Stitch,Segment Anything 2 Model,Instance Segmentation Model,Google Gemini,Buffer,Pixelate Visualization,EasyOCR,SIFT,Contrast Equalization,Image Threshold,Instance Segmentation Model,Polygon Visualization,Anthropic Claude,Halo Visualization,Qwen2.5-VL,Keypoint Detection Model,Florence-2 Model,Icon Visualization,Single-Label Classification Model,Image Contours,OpenAI,SAM2 Video Tracker,SAM 3,ByteTrack Tracker,VLM As Detector,Barcode Detection,Multi-Label Classification Model,Object Detection Model,LMM,Image Convert Grayscale,Reference Path Visualization,Dominant Color,Keypoint Detection Model,SIFT Comparison,Roboflow Dataset Upload,BoT-SORT Tracker,Qwen3-VL,Object Detection Model,Morphological Transformation,Crop Visualization,Blur Visualization,Qwen-VL,Mask Visualization,Stability AI Image Generation,Google Gemma API,Qwen3.5,Image Slicer,Qwen 3.5 API,Perception Encoder Embedding Model,Background Color Visualization,Anthropic Claude,Qwen 3.6 API,Color Visualization,Bounding Box Visualization,Google Gemma,Relative Static Crop,Llama 3.2 Vision,CogVLM,Instance Segmentation Model,Qwen3.5-VL,Instance Segmentation Model,Google Vision OCR,Camera Focus,Google Gemini,Llama 3.2 Vision,SAM 3,Single-Label Classification Model,SORT Tracker,SmolVLM2,Detections Stabilizer,Moondream2,Anthropic Claude,Image Slicer,OpenAI,Depth Estimation,Multi-Label Classification Model,Gaze Detection,Template Matching,Classification Label Visualization,Florence-2 Model,MoonshotAI Kimi,MoonshotAI Kimi,Dot Visualization,Keypoint Detection Model,Background Subtraction,Roboflow Dataset Upload,Stability AI Inpainting,Semantic Segmentation Model,QR Code Detection,Label Visualization,Absolute Static Crop,Google Gemini,VLM As Classifier,Halo Visualization,Email Notification,Camera Calibration,OpenAI,Clip Comparison,Pixel Color Count,LMM For Classification,Text Display,Line Counter Visualization,Circle Visualization,OCR Model,VLM As Detector,Image Stack,Morphological Transformation,Twilio SMS/MMS Notification,Mask Edge Snap,Triangle Visualization,Perspective Correction
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Crop Visualization in version v1 has.
Bindings
-
input
image(image): The image to visualize on..copy_image(boolean): Enable this option to create a copy of the input image for visualization, preserving the original. Use this when stacking multiple visualizations..predictions(Union[keypoint_detection_prediction,rle_instance_segmentation_prediction,instance_segmentation_prediction,object_detection_prediction]): Model predictions to visualize..color_palette(string): Select a color palette for the visualised elements..palette_size(integer): Specify the number of colors in the palette. This applies when using custom or Matplotlib palettes..custom_colors(list_of_values): Define a list of custom colors for bounding boxes in HEX format..color_axis(string): Choose how bounding box colors are assigned..position(string): Anchor position for placing the scaled crop relative to the original detection's bounding box. Options include: CENTER (center of box), corners (TOP_LEFT, TOP_RIGHT, BOTTOM_LEFT, BOTTOM_RIGHT), edge midpoints (TOP_CENTER, CENTER_LEFT, CENTER_RIGHT, BOTTOM_CENTER), or CENTER_OF_MASS (center of mass of the object). The scaled crop will be positioned at this anchor point relative to the original detection location..scale_factor(float): Factor by which to scale (zoom) the cropped object region. A factor of 2.0 doubles the size of the crop, making objects twice as large. A factor of 1.0 shows the crop at original size. Higher values (e.g., 3.0, 4.0) create more zoomed-in views, useful for inspecting small or distant objects. Lower values (e.g., 1.5) provide subtle magnification..border_thickness(integer): Thickness of the border outline around the scaled crop in pixels. Higher values create thicker, more visible borders that help distinguish the scaled crop from the background..
-
output
image(image): Image in workflows.
Example JSON definition of step Crop Visualization in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/crop_visualization@v1",
"image": "$inputs.image",
"copy_image": true,
"predictions": "$steps.object_detection_model.predictions",
"color_palette": "DEFAULT",
"palette_size": 10,
"custom_colors": [
"#FF0000",
"#00FF00",
"#0000FF"
],
"color_axis": "CLASS",
"position": "CENTER",
"scale_factor": 2.0,
"border_thickness": 2
}