Crop Visualization¶
Class: CropVisualizationBlockV1
Source: inference.core.workflows.core_steps.visualizations.crop.v1.CropVisualizationBlockV1
Display scaled-up, zoomed-in views of detected objects overlaid on the original image, allowing detailed inspection of small or distant objects while maintaining context with the full scene.
How This Block Works¶
This block takes an image and detection predictions and creates scaled-up, zoomed-in crops of each detected object, then displays these enlarged crops on the original image. The block:
- Takes an image and predictions as input
- Identifies detected regions from bounding boxes or segmentation masks
- Extracts the image region for each detected object (crops the object from the original image)
- Scales up each crop by the specified scale factor (e.g., 2x makes objects twice as large)
- Applies color styling to the crop border based on the selected color palette, with colors assigned by class, index, or track ID
- Positions the scaled crop on the image at the specified anchor point relative to the original detection location using Supervision's CropAnnotator
- Draws a colored border around the scaled crop with the specified thickness
- Returns an annotated image with scaled-up object crops overlaid on the original image
The block works with both object detection predictions (using bounding boxes) and instance segmentation predictions (using masks). When masks are available, it crops the exact shape of detected objects; otherwise, it crops rectangular bounding box regions. The scale factor allows you to zoom in on objects, making small or distant objects more visible and easier to inspect. The scaled crops are positioned relative to their original detection locations, allowing you to see both the zoomed-in detail and the object's position in the full scene context.
Common Use Cases¶
- Small Object Inspection: Zoom in on small detected objects (e.g., defects, small products, distant objects) to make them more visible and easier to inspect while maintaining scene context
- Detail Visualization: Display enlarged views of detected objects for detailed analysis, quality control, or inspection workflows where fine details need to be visible
- Multi-Scale Object Display: Show both the full scene and zoomed-in object details simultaneously, useful for applications where context and detail are both important
- Quality Control and Inspection: Inspect detected defects, products, or components at higher magnification while keeping the original detection location visible for reference
- Presentation and Reporting: Create visualizations that highlight detected objects with zoomed-in views for reports, documentation, or presentations where both overview and detail are needed
- User Interface Enhancement: Provide zoomed-in object views in user interfaces, dashboards, or interactive applications where users need to see object details without losing scene context
Connecting to Other Blocks¶
The annotated image from this block can be connected to:
- Other visualization blocks (e.g., Label Visualization, Bounding Box Visualization, Polygon Visualization) to combine scaled crops with additional annotations for comprehensive visualization
- Data storage blocks (e.g., Local File Sink, CSV Formatter, Roboflow Dataset Upload) to save images with scaled crops for documentation, reporting, or analysis
- Webhook blocks to send visualized results with scaled crops to external systems, APIs, or web applications for display in dashboards or monitoring tools
- Notification blocks (e.g., Email Notification, Slack Notification) to send annotated images with scaled crops as visual evidence in alerts or reports
- Video output blocks to create annotated video streams or recordings with scaled crops for live monitoring, detailed inspection, or post-processing analysis
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/crop_visualization@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
copy_image |
bool |
Enable this option to create a copy of the input image for visualization, preserving the original. Use this when stacking multiple visualizations.. | ✅ |
color_palette |
str |
Select a color palette for the visualised elements.. | ✅ |
palette_size |
int |
Specify the number of colors in the palette. This applies when using custom or Matplotlib palettes.. | ✅ |
custom_colors |
List[str] |
Define a list of custom colors for bounding boxes in HEX format.. | ✅ |
color_axis |
str |
Choose how bounding box colors are assigned.. | ✅ |
position |
str |
Anchor position for placing the scaled crop relative to the original detection's bounding box. Options include: CENTER (center of box), corners (TOP_LEFT, TOP_RIGHT, BOTTOM_LEFT, BOTTOM_RIGHT), edge midpoints (TOP_CENTER, CENTER_LEFT, CENTER_RIGHT, BOTTOM_CENTER), or CENTER_OF_MASS (center of mass of the object). The scaled crop will be positioned at this anchor point relative to the original detection location.. | ✅ |
scale_factor |
float |
Factor by which to scale (zoom) the cropped object region. A factor of 2.0 doubles the size of the crop, making objects twice as large. A factor of 1.0 shows the crop at original size. Higher values (e.g., 3.0, 4.0) create more zoomed-in views, useful for inspecting small or distant objects. Lower values (e.g., 1.5) provide subtle magnification.. | ✅ |
border_thickness |
int |
Thickness of the border outline around the scaled crop in pixels. Higher values create thicker, more visible borders that help distinguish the scaled crop from the background.. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Crop Visualization in version v1.
- inputs:
Moondream2,Image Threshold,Stitch Images,Byte Tracker,Size Measurement,Keypoint Detection Model,Mask Visualization,Instance Segmentation Model,Path Deviation,Crop Visualization,QR Code Generator,Detections Stabilizer,Clip Comparison,Segment Anything 2 Model,Stability AI Image Generation,VLM As Detector,VLM As Classifier,Google Gemini,Overlap Filter,Qwen3.5-VL,Object Detection Model,Slack Notification,Dot Visualization,OpenAI,Motion Detection,Email Notification,Detections List Roll-Up,Instance Segmentation Model,Roboflow Dataset Upload,Depth Estimation,Contrast Equalization,Label Visualization,Stitch OCR Detections,Llama 3.2 Vision,Camera Focus,Polygon Zone Visualization,Detections Filter,Color Visualization,OpenAI,Dimension Collapse,Template Matching,Florence-2 Model,Model Monitoring Inference Aggregator,JSON Parser,Dynamic Crop,Background Color Visualization,Object Detection Model,SIFT Comparison,Line Counter Visualization,Clip Comparison,Image Preprocessing,PTZ Tracking (ONVIF),Blur Visualization,CSV Formatter,Triangle Visualization,Gaze Detection,OCR Model,Trace Visualization,Email Notification,Twilio SMS/MMS Notification,Byte Tracker,Image Convert Grayscale,Reference Path Visualization,YOLO-World Model,Single-Label Classification Model,LMM For Classification,Florence-2 Model,Perspective Correction,Stitch OCR Detections,OpenAI,Time in Zone,Circle Visualization,EasyOCR,Detections Consensus,Seg Preview,Multi-Label Classification Model,Detections Transformation,Stability AI Outpainting,SAM 3,Text Display,Anthropic Claude,Line Counter,Path Deviation,Relative Static Crop,OpenAI,Detections Combine,Local File Sink,Google Gemini,Image Slicer,Keypoint Detection Model,Bounding Rectangle,Distance Measurement,Ellipse Visualization,Byte Tracker,Halo Visualization,Anthropic Claude,Model Comparison Visualization,Corner Visualization,Buffer,Identify Outliers,Absolute Static Crop,Image Contours,Classification Label Visualization,Image Slicer,Camera Calibration,Detections Stitch,Camera Focus,Time in Zone,Grid Visualization,Background Subtraction,Cosine Similarity,SAM 3,Heatmap Visualization,SIFT,Identify Changes,CogVLM,Line Counter,Bounding Box Visualization,Polygon Visualization,Roboflow Dataset Upload,Pixelate Visualization,Roboflow Custom Metadata,Pixel Color Count,Image Blur,SIFT Comparison,Detections Classes Replacement,Morphological Transformation,Stability AI Inpainting,Webhook Sink,LMM,Detection Offset,Detection Event Log,Icon Visualization,VLM As Classifier,Twilio SMS Notification,Google Vision OCR,Polygon Visualization,Google Gemini,Anthropic Claude,Mask Area Measurement,Time in Zone,SAM 3,Detections Merge,Dynamic Zone,Halo Visualization,Velocity,VLM As Detector,Keypoint Visualization - outputs:
Moondream2,Image Threshold,Stitch Images,OpenAI,Byte Tracker,Multi-Label Classification Model,Keypoint Detection Model,Mask Visualization,Time in Zone,Instance Segmentation Model,Circle Visualization,EasyOCR,Seg Preview,Crop Visualization,Multi-Label Classification Model,SAM 3,Stability AI Outpainting,Text Display,Anthropic Claude,Detections Stabilizer,QR Code Detection,Relative Static Crop,Clip Comparison,OpenAI,Segment Anything 2 Model,Stability AI Image Generation,Halo Visualization,Google Gemini,Image Slicer,Keypoint Detection Model,VLM As Detector,VLM As Classifier,Google Gemini,Qwen3.5-VL,Object Detection Model,Ellipse Visualization,Dot Visualization,Halo Visualization,Anthropic Claude,Model Comparison Visualization,OpenAI,Corner Visualization,Motion Detection,Buffer,Absolute Static Crop,Image Contours,Classification Label Visualization,Instance Segmentation Model,Roboflow Dataset Upload,Depth Estimation,Dominant Color,Contrast Equalization,Image Slicer,Detections Stitch,Camera Focus,Label Visualization,Barcode Detection,Llama 3.2 Vision,Background Subtraction,Polygon Zone Visualization,Camera Focus,Color Visualization,Qwen2.5-VL,SAM 3,Heatmap Visualization,OpenAI,SIFT,CogVLM,Template Matching,Florence-2 Model,Roboflow Dataset Upload,Bounding Box Visualization,Polygon Visualization,Pixelate Visualization,Pixel Color Count,Image Blur,SIFT Comparison,Morphological Transformation,Dynamic Crop,Stability AI Inpainting,Background Color Visualization,Perception Encoder Embedding Model,LMM,Object Detection Model,Clip Comparison,Line Counter Visualization,Image Preprocessing,Icon Visualization,VLM As Classifier,Qwen3-VL,SmolVLM2,Blur Visualization,Triangle Visualization,Google Vision OCR,Polygon Visualization,Gaze Detection,OCR Model,Google Gemini,Trace Visualization,Email Notification,Anthropic Claude,Twilio SMS/MMS Notification,CLIP Embedding Model,SAM 3,Single-Label Classification Model,Image Convert Grayscale,Reference Path Visualization,YOLO-World Model,Single-Label Classification Model,LMM For Classification,Camera Calibration,Florence-2 Model,VLM As Detector,Perspective Correction,Keypoint Visualization
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Crop Visualization in version v1 has.
Bindings
-
input
image(image): The image to visualize on..copy_image(boolean): Enable this option to create a copy of the input image for visualization, preserving the original. Use this when stacking multiple visualizations..predictions(Union[keypoint_detection_prediction,object_detection_prediction,instance_segmentation_prediction,rle_instance_segmentation_prediction]): Model predictions to visualize..color_palette(string): Select a color palette for the visualised elements..palette_size(integer): Specify the number of colors in the palette. This applies when using custom or Matplotlib palettes..custom_colors(list_of_values): Define a list of custom colors for bounding boxes in HEX format..color_axis(string): Choose how bounding box colors are assigned..position(string): Anchor position for placing the scaled crop relative to the original detection's bounding box. Options include: CENTER (center of box), corners (TOP_LEFT, TOP_RIGHT, BOTTOM_LEFT, BOTTOM_RIGHT), edge midpoints (TOP_CENTER, CENTER_LEFT, CENTER_RIGHT, BOTTOM_CENTER), or CENTER_OF_MASS (center of mass of the object). The scaled crop will be positioned at this anchor point relative to the original detection location..scale_factor(float): Factor by which to scale (zoom) the cropped object region. A factor of 2.0 doubles the size of the crop, making objects twice as large. A factor of 1.0 shows the crop at original size. Higher values (e.g., 3.0, 4.0) create more zoomed-in views, useful for inspecting small or distant objects. Lower values (e.g., 1.5) provide subtle magnification..border_thickness(integer): Thickness of the border outline around the scaled crop in pixels. Higher values create thicker, more visible borders that help distinguish the scaled crop from the background..
-
output
image(image): Image in workflows.
Example JSON definition of step Crop Visualization in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/crop_visualization@v1",
"image": "$inputs.image",
"copy_image": true,
"predictions": "$steps.object_detection_model.predictions",
"color_palette": "DEFAULT",
"palette_size": 10,
"custom_colors": [
"#FF0000",
"#00FF00",
"#0000FF"
],
"color_axis": "CLASS",
"position": "CENTER",
"scale_factor": 2.0,
"border_thickness": 2
}