Workflow Blocks

Models

Object Detection Model
Predict the location of objects with bounding boxes.
Instance Segmentation Model
Predict the shape, size, and location of objects.
CLIP Embedding Model
Generate an embedding of an image or string.
Single-Label Classification Model
Apply a single tag to an image.
Multi-Label Classification Model
Apply multiple tags to an image.
Keypoint Detection Model
Predict skeletons on objects.
Anthropic Claude
Run Anthropic Claude model with vision capabilities.
Google Gemini
Run Google's Gemini model with vision capabilities.
OpenAI
Run OpenAI's GPT-4 with vision capabilities.
Florence-2 Model
Run Florence-2 on an image
YOLO-World Model
Run a zero-shot object detection model.
CogVLM
Run a self-hosted vision language model.
Segment Anything 2 Model
Convert bounding boxes to polygons, or run SAM2 on an entire image to generate a mask.
Clip Comparison
Compare CLIP image and text embeddings.
OCR Model
Extract text from an image using optical character recognition.
Barcode Detection
Detect and read barcodes in an image.
QR Code Detection
Detect and read QR codes in an image.
Gaze Detection
Detect faces and estimate gaze direction
Stability AI Inpainting
Use segmentation masks to inpaint objects within an image.
Google Vision OCR
Detect text in images using Google Vision API
LMM
Run a large multimodal model such as ChatGPT-4v or CogVLM.
LMM For Classification
Run a large multimodal model such as ChatGPT-4v or CogVLM for classification.
Stability AI Image Generation
generate new images from text, or create variations of existing images.
Llama 3.2 Vision
Run Llama model with Vision capabilities

Visualizations

Bounding Box Visualization
Draw a box around detected objects in an image.
Polygon Visualization
Draw a polygon around detected objects in an image.
Label Visualization
Draw labels on an image at specific coordinates based on provided detections.
Classification Label Visualization
Visualize both single-label and multi-label classification predictions with customizable display options.
Background Color Visualization
Apply a mask to cover all areas outside the detected regions in an image.
Blur Visualization
Blur detected objects in an image.
Circle Visualization
Draw a circle around detected objects in an image.
Color Visualization
Paint a solid color on detected objects in an image.
Corner Visualization
Draw the corners of detected objects in an image.
Crop Visualization
Draw scaled up crops of detections on the scene.
Ellipse Visualization
Draw ellipses that highlight detected objects in an image.
Halo Visualization
Paint a halo around detected objects in an image.
Mask Visualization
Apply a mask over detected objects in an image.
Pixelate Visualization
Pixelate detected objects in an image.
Triangle Visualization
Draw triangle markers on an image at specific coordinates based on provided detections.
Line Counter Visualization
Apply a mask over a line zone in an image.
Polygon Zone Visualization
Apply a mask over a polygon zone in an image.
Model Comparison Visualization
Visualize the difference between two models' detections.
Trace Visualization
Draw traces based on detections tracking results.
Reference Path Visualization
Draw a reference path in the image.
Keypoint Visualization
Draw keypoints on detected objects in an image.
Grid Visualization
Shows an array of images in a grid.

Advanced

Property Definition
Define a variable from model predictions, such as the class names, confidences, or number of detections.
Expression
Create a specific output based on defined input variables and configured rules.
Perspective Correction
Adjust detection coordinates from a polygon-defined plane to a straight rectangular plane with specified width and height.
Stitch OCR Detections
Combines OCR detection results into a coherent text string by organizing detections spatially.
Dynamic Zone
Simplify polygons so they are geometrically convex and contain only the requested amount of vertices.
Cosine Similarity
Calculate the cosine similarity between two embeddings.
Detections Classes Replacement
Replace classes of detections with classes predicted by a chained classification model.
JSON Parser
Parse raw string into JSON.
VLM as Classifier
Parse a raw string into a classification prediction.
VLM as Detector
Parses raw string into object-detection prediction.
Dimension Collapse
Collapse dimensionality by aggregating nested data into a single list.
First Non Empty Or Default
Take the first non-empty data element or the configured default value.
Roboflow Custom Metadata
Add custom metadata to the Roboflow Model Monitoring dashboard.
Model Monitoring Inference Aggregator
Periodically report an aggregated sample of inference results to Roboflow Model Monitoring.
Image Slicer
Tile the input image into a list of smaller images to perform small object detection.
Detections Stitch
Merges detections made against multiple pieces of input image into single detection.
Cache Get
Fetches a previously stored value from a cache entry.
Cache Set
Stores a value in a cache entry for later retrieval.
Environment Secrets Store
Fetch secrets from environmental variables.