Workflow Blocks
Models
Object Detection Model
Predict the location of objects with bounding boxes.
Instance Segmentation Model
Predict the shape, size, and location of objects.
CLIP Embedding Model
Generate an embedding of an image or string.
Single-Label Classification Model
Apply a single tag to an image.
Multi-Label Classification Model
Apply multiple tags to an image.
Keypoint Detection Model
Predict skeletons on objects.
Anthropic Claude
Run Anthropic Claude model with vision capabilities.
Google Gemini
Run Google's Gemini model with vision capabilities.
OpenAI
Run OpenAI's GPT-4 with vision capabilities.
Florence-2 Model
Run Florence-2 on an image
YOLO-World Model
Run a zero-shot object detection model.
CogVLM
Run a self-hosted vision language model.
Segment Anything 2 Model
Convert bounding boxes to polygons, or run SAM2 on an entire image to generate a mask.
Clip Comparison
Compare CLIP image and text embeddings.
OCR Model
Extract text from an image using optical character recognition.
Barcode Detection
Detect and read barcodes in an image.
QR Code Detection
Detect and read QR codes in an image.
Gaze Detection
Detect faces and estimate gaze direction
Stability AI Inpainting
Use segmentation masks to inpaint objects within an image.
Google Vision OCR
Detect text in images using Google Vision API
LMM
Run a large multimodal model such as ChatGPT-4v or CogVLM.
LMM For Classification
Run a large multimodal model such as ChatGPT-4v or CogVLM for classification.
Stability AI Image Generation
generate new images from text, or create variations of existing images.
Llama 3.2 Vision
Run Llama model with Vision capabilities
Visualizations
Bounding Box Visualization
Draw a box around detected objects in an image.
Polygon Visualization
Draw a polygon around detected objects in an image.
Label Visualization
Draw labels on an image at specific coordinates based on provided detections.
Classification Label Visualization
Visualize both single-label and multi-label classification predictions with customizable display options.
Background Color Visualization
Apply a mask to cover all areas outside the detected regions in an image.
Blur Visualization
Blur detected objects in an image.
Circle Visualization
Draw a circle around detected objects in an image.
Color Visualization
Paint a solid color on detected objects in an image.
Corner Visualization
Draw the corners of detected objects in an image.
Crop Visualization
Draw scaled up crops of detections on the scene.
Ellipse Visualization
Draw ellipses that highlight detected objects in an image.
Halo Visualization
Paint a halo around detected objects in an image.
Mask Visualization
Apply a mask over detected objects in an image.
Pixelate Visualization
Pixelate detected objects in an image.
Triangle Visualization
Draw triangle markers on an image at specific coordinates based on provided detections.
Line Counter Visualization
Apply a mask over a line zone in an image.
Polygon Zone Visualization
Apply a mask over a polygon zone in an image.
Model Comparison Visualization
Visualize the difference between two models' detections.
Trace Visualization
Draw traces based on detections tracking results.
Reference Path Visualization
Draw a reference path in the image.
Keypoint Visualization
Draw keypoints on detected objects in an image.
Grid Visualization
Shows an array of images in a grid.
Logic and branching
Continue If
Conditionally stop execution of a branch.
Detections Filter
Conditionally filter out model predictions.
Rate Limiter
Limits the rate at which a branch of the Workflow will run.
Delta Filter
Allow the execution of workflow to proceed if the input value has changed.
Detections Consensus
Combine predictions from multiple detections models to make a decision about object presence.
Data storage
Roboflow Dataset Upload
Save images and predictions to your Roboflow Dataset.
Webhook Sink
Send a request to a remote API with Workflow results.
CSV Formatter
Create CSV files with specified columns.
Local File Sink
Save data to a local file.
Data Aggregator
Aggregate workflow data to produce time-based statistics.
Notifications
Transformations
Dynamic Crop
Crop an image using bounding boxes from a detection model.
Absolute Static Crop
Crop an image using fixed pixel coordinates.
Relative Static Crop
Crop an image proportional (%) to its dimensions.
Detection Offset
Apply a padding around the width and height of detections.
Detections Transformation
Apply transformations on detected bounding boxes.
Bounding Rectangle
Find the minimal bounding box surrounding the detected polygon.
Stitch Images
Stitch two images by common parts.
Classical computer vision
Image Preprocessing
Resize, flip, or rotate an image.
Template Matching
Locate instances of a given template within a specified image.
Dominant Color
Get the dominant color of an image in RGB format.
Dot Visualization
Draw dots on an image at specific coordinates based on provided detections.
Pixel Color Count
Count the number of pixels that match a specific color within a given tolerance.
SIFT Comparison
Compare SIFT descriptors from multiple images.
Image Contours
Find and count the contours on an image.
SIFT
Apply SIFT to an image.
Image Blur
Apply a blur to an image.
Image Threshold
Apply a threshold to an image.
Image Convert Grayscale
Convert an RGB image to grayscale.
Camera Focus
Calculate a score to indicate how well-focused a camera is.
Size Measurement
Measure the dimensions of objects in relation to a reference object.
Distance Measurement
Calculate the distance between two bounding boxes on a 2D plane.
Video
Byte Tracker
Track and update object positions across video frames using ByteTrack.
Time in Zone
Track object time in zone.
Line Counter
Count detections passing a line.
Path Deviation
Calculate Fréchet distance of object from the reference path.
Detections Stabilizer
Apply a smoothing algorithm to reduce noise and flickering across video frames.
Buffer
Returns an array of the last `length` values passed to it.
Identify Outliers
Identify outlier embeddings compared to prior data.
Identify Changes
Identify changes compared to prior data via embeddings.
Advanced
Property Definition
Define a variable from model predictions, such as the class names, confidences, or number of detections.
Expression
Create a specific output based on defined input variables and configured rules.
Perspective Correction
Adjust detection coordinates from a polygon-defined plane to a straight rectangular plane with specified width and height.
Stitch OCR Detections
Combines OCR detection results into a coherent text string by organizing detections spatially.
Dynamic Zone
Simplify polygons so they are geometrically convex and contain only the requested amount of vertices.
Cosine Similarity
Calculate the cosine similarity between two embeddings.
Detections Classes Replacement
Replace classes of detections with classes predicted by a chained classification model.
JSON Parser
Parse raw string into JSON.
VLM as Classifier
Parse a raw string into a classification prediction.
VLM as Detector
Parses raw string into object-detection prediction.
Dimension Collapse
Collapse dimensionality by aggregating nested data into a single list.
First Non Empty Or Default
Take the first non-empty data element or the configured default value.
Roboflow Custom Metadata
Add custom metadata to the Roboflow Model Monitoring dashboard.
Model Monitoring Inference Aggregator
Periodically report an aggregated sample of inference results to Roboflow Model Monitoring.
Image Slicer
Tile the input image into a list of smaller images to perform small object detection.
Detections Stitch
Merges detections made against multiple pieces of input image into single detection.
Cache Get
Fetches a previously stored value from a cache entry.
Cache Set
Stores a value in a cache entry for later retrieval.
Environment Secrets Store
Fetch secrets from environmental variables.