Dimension Collapse¶
Class: DimensionCollapseBlockV1
Source: inference.core.workflows.core_steps.fusion.dimension_collapse.v1.DimensionCollapseBlockV1
Flatten nested batch data by reducing dimensionality from level n to level n-1, aggregating nested lists into a single flat list to enable data aggregation, batch flattening, and dimensionality reduction workflows where nested batch outputs (such as classification or OCR results from dynamically cropped images) need to be collapsed into a single-level batch for downstream processing.
How This Block Works¶
This block collapses the dimensionality of batch data by flattening nested lists one level. The block:
- Receives batch data at dimensionality level n (nested batch structure)
- Flattens the nested structure:
- Takes all elements from the nested batch structure
- Concatenates them into a single flat list
- Removes one level of nesting from the data structure
- Reduces dimensionality:
- Input data at level n (e.g., list of lists)
- Output data at level n-1 (e.g., single list)
- Maintains all data elements, just removes the nested structure
- Returns flattened output:
- Outputs a single list containing all elements from the nested input
- Elements are preserved in order (flattened sequentially)
- Output dimensionality is one level lower than input
This block is particularly useful when working with dynamically cropped images or other operations that create nested batch structures. For example, when you crop multiple objects from each image, you get a nested batch (level 2): a list where each element is itself a list of crops. Classification results for those crops also form a nested batch. The Dimension Collapse block flattens this nested structure into a single-level batch (level 1), allowing you to work with all results together.
Common Use Cases¶
- Aggregating Classification Results: Aggregate classification results from dynamically cropped images into a single list (e.g., classify crops from images then aggregate all results, collect classification results from multiple crops, flatten nested classification outputs), enabling classification aggregation workflows
- Aggregating OCR Results: Aggregate OCR results from dynamically cropped text regions into a single list (e.g., OCR crops from images then aggregate all text results, collect OCR results from multiple crops, flatten nested OCR outputs), enabling OCR aggregation workflows
- Batch Flattening: Flatten nested batch structures for downstream processing (e.g., flatten nested batches for analysis, reduce batch dimensionality for storage, collapse nested structures for filtering), enabling batch flattening workflows
- Data Aggregation: Aggregate results from nested batch operations into flat lists (e.g., aggregate results from nested operations, collect outputs from nested batches, flatten nested operation results), enabling data aggregation workflows
- Dimensionality Reduction: Reduce batch dimensionality to match requirements of downstream blocks (e.g., reduce dimensionality for blocks requiring level 1 inputs, flatten nested batches for compatibility, adjust dimensionality for workflow connections), enabling dimensionality adjustment workflows
- Result Collection: Collect and flatten results from nested processing operations (e.g., collect nested processing results, flatten operation outputs, aggregate nested operation data), enabling result collection workflows
Connecting to Other Blocks¶
This block receives nested batch data and produces flattened batch data:
- After blocks that create nested batches (crop blocks, classification on crops, OCR on crops) to flatten nested results (e.g., crop then classify then flatten, OCR crops then flatten, process nested batches then collapse), enabling nested-to-flat workflows
- Before blocks requiring single-level batches to provide flattened data (e.g., flatten before filtering, collapse before storage, aggregate before analysis), enabling flat-to-processing workflows
- Before data storage blocks to store aggregated flattened results (e.g., store aggregated classifications, save flattened OCR results, log collapsed batch data), enabling aggregation-to-storage workflows
- Before analytics blocks to analyze aggregated results (e.g., analyze aggregated classifications, perform analytics on flattened data, process collapsed batches), enabling aggregation-to-analytics workflows
- Before filtering blocks to filter flattened aggregated data (e.g., filter aggregated results, apply filters to collapsed batches, process flattened data), enabling aggregation-to-filter workflows
- In workflow outputs to provide aggregated flattened results as final output (e.g., aggregated classification outputs, flattened OCR outputs, collapsed batch outputs), enabling aggregation output workflows
Requirements¶
This block requires batch data at dimensionality level n (nested batch structure). The block automatically handles batch casting for the input parameter. The block reduces output dimensionality by 1 level (from level n to level n-1). All elements from the nested structure are preserved and flattened into a single list. The block works with any data type - it simply flattens the nested list structure without modifying individual elements. The output is a single-level batch containing all elements from the nested input, ordered sequentially as they appear in the nested structure.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/dimension_collapse@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Dimension Collapse in version v1.
- inputs:
Contrast Equalization,Clip Comparison,Detections Transformation,VLM as Detector,Polygon Visualization,Image Blur,SIFT Comparison,First Non Empty Or Default,Text Display,SIFT,Moondream2,Qwen3-VL,Google Vision OCR,Pixelate Visualization,Time in Zone,VLM as Classifier,Detection Offset,Detections Filter,Instance Segmentation Model,Perspective Correction,Halo Visualization,Image Threshold,Path Deviation,Keypoint Detection Model,CSV Formatter,Florence-2 Model,Detections Stabilizer,Twilio SMS Notification,Image Convert Grayscale,Perception Encoder Embedding Model,Corner Visualization,Dynamic Zone,Identify Changes,Icon Visualization,Expression,SAM 3,Qwen2.5-VL,Detections Consensus,Multi-Label Classification Model,Detections Stitch,Dynamic Crop,QR Code Detection,Continue If,Bounding Box Visualization,YOLO-World Model,Detection Event Log,Detections Classes Replacement,Blur Visualization,Camera Calibration,Line Counter,Dominant Color,Path Deviation,OpenAI,Camera Focus,CogVLM,Trace Visualization,Image Slicer,Absolute Static Crop,Dot Visualization,Label Visualization,Slack Notification,Google Gemini,Object Detection Model,LMM For Classification,Stitch OCR Detections,OpenAI,Classification Label Visualization,Stitch OCR Detections,Byte Tracker,Velocity,Twilio SMS/MMS Notification,Anthropic Claude,Clip Comparison,Gaze Detection,VLM as Detector,Webhook Sink,Llama 3.2 Vision,SIFT Comparison,Anthropic Claude,Delta Filter,Time in Zone,Local File Sink,QR Code Generator,SmolVLM2,Email Notification,CLIP Embedding Model,Roboflow Dataset Upload,Motion Detection,Model Comparison Visualization,Camera Focus,PTZ Tracking (ONVIF).md),LMM,Byte Tracker,Single-Label Classification Model,Mask Visualization,Anthropic Claude,Relative Static Crop,Cosine Similarity,SAM 3,Detections Merge,Object Detection Model,Keypoint Detection Model,Circle Visualization,Seg Preview,Property Definition,EasyOCR,Stability AI Inpainting,Multi-Label Classification Model,Reference Path Visualization,Time in Zone,Detections Combine,Crop Visualization,Ellipse Visualization,Overlap Filter,Line Counter,Image Preprocessing,Barcode Detection,Environment Secrets Store,Detections List Roll-Up,Background Subtraction,Segment Anything 2 Model,Image Contours,Image Slicer,Cache Set,Depth Estimation,Pixel Color Count,Stitch Images,VLM as Classifier,Cache Get,Model Monitoring Inference Aggregator,Instance Segmentation Model,Line Counter Visualization,Morphological Transformation,Polygon Zone Visualization,Single-Label Classification Model,Email Notification,OCR Model,Distance Measurement,Roboflow Custom Metadata,Google Gemini,Keypoint Visualization,OpenAI,Size Measurement,Color Visualization,Data Aggregator,Byte Tracker,Identify Outliers,Buffer,Florence-2 Model,Google Gemini,JSON Parser,Grid Visualization,Rate Limiter,OpenAI,Template Matching,Dimension Collapse,Bounding Rectangle,Background Color Visualization,Roboflow Dataset Upload,SAM 3,Stability AI Outpainting,Triangle Visualization,Stability AI Image Generation - outputs:
Llama 3.2 Vision,Clip Comparison,Anthropic Claude,VLM as Detector,Time in Zone,Polygon Visualization,Email Notification,Roboflow Dataset Upload,Motion Detection,SAM 3,Mask Visualization,Anthropic Claude,Object Detection Model,Keypoint Detection Model,Circle Visualization,Seg Preview,Time in Zone,VLM as Classifier,Reference Path Visualization,Time in Zone,Instance Segmentation Model,Perspective Correction,Halo Visualization,Crop Visualization,Ellipse Visualization,Path Deviation,Keypoint Detection Model,Florence-2 Model,Corner Visualization,Line Counter,Detections List Roll-Up,SAM 3,Detections Consensus,Cache Set,VLM as Classifier,Bounding Box Visualization,YOLO-World Model,Instance Segmentation Model,Line Counter Visualization,Polygon Zone Visualization,Line Counter,Email Notification,Keypoint Visualization,Path Deviation,Google Gemini,OpenAI,OpenAI,Trace Visualization,Size Measurement,Color Visualization,Dot Visualization,Label Visualization,Buffer,Florence-2 Model,Google Gemini,Google Gemini,Grid Visualization,Object Detection Model,LMM For Classification,OpenAI,Classification Label Visualization,SAM 3,Roboflow Dataset Upload,Twilio SMS/MMS Notification,Anthropic Claude,Clip Comparison,Triangle Visualization,VLM as Detector,Webhook Sink
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Dimension Collapse in version v1 has.
Bindings
-
input
data(*): Reference to step outputs at dimensionality level n (nested batch structure) to be flattened and collapsed to level n-1. The input should be a nested batch (e.g., list of lists) where each nested level represents a batch dimension. The block flattens this structure by concatenating all nested elements into a single flat list. Common use cases: classification results from cropped images (level 2 → level 1), OCR results from cropped regions (level 2 → level 1), or any nested batch structure that needs to be flattened..
-
output
output(list_of_values): List of values of any type.
Example JSON definition of step Dimension Collapse in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/dimension_collapse@v1",
"data": "$steps.classification_step.predictions"
}