Roboflow Inference

Index

roboflow/inference

Workflow Blocks

Models

Object Detection Model

Predict the location of objects with bounding boxes.

Instance Segmentation Model

Predict the shape, size, and location of objects.

Single-Label Classification Model

Apply a single tag to an image.

Multi-Label Classification Model

Apply multiple tags to an image.

Keypoint Detection Model

Predict skeletons on objects.

Anthropic Claude

Run Anthropic Claude model with vision capabilities.

Run Google's Gemini model with vision capabilities.

Run OpenAI's GPT-4 with vision capabilities.

Depth Estimation

Run Depth Estimation on an image.

Florence-2 Model

Run Florence-2 on an image

Run Qwen2.5-VL on an image.

Run SmolVLM2 on an image.

Run Moondream2 on an image.

YOLO-World Model

Run a zero-shot object detection model.

DEPRECATED! Run a self-hosted vision language model.

Segment Anything 2 Model

Convert bounding boxes to polygons, or run SAM2 on an entire image to generate a mask.

CLIP Embedding Model

Generate an embedding of an image or string.

Perception Encoder Embedding Model

Generate an embedding of an image or string.

Clip Comparison

Compare CLIP image and text embeddings.

Extract text from an image using optical character recognition.

Barcode Detection

Detect and read barcodes in an image.

QR Code Detection

Detect and read QR codes in an image.

Detect faces and estimate gaze direction

Stability AI Inpainting

Use segmentation masks to inpaint objects within an image.

Stability AI Outpainting

Use object detection bounding box to crop the image and to outpaint within given directions.

Google Vision OCR

Detect text in images using Google Vision API

Run a large multimodal model such as ChatGPT-4v.

LMM For Classification

Run a large multimodal model such as ChatGPT-4v for classification.

Stability AI Image Generation

generate new images from text, or create variations of existing images.

Llama 3.2 Vision

Run Llama model with Vision capabilities

Visualizations

Bounding Box Visualization

Draw a box around detected objects in an image.

Dot Visualization

Draw dots on an image at specific coordinates based on provided detections.

Polygon Visualization

Draw a polygon around detected objects in an image.

Label Visualization

Draw labels on an image at specific coordinates based on provided detections.

Classification Label Visualization

Visualize both single-label and multi-label classification predictions with customizable display options.

Background Color Visualization

Apply a mask to cover all areas outside the detected regions in an image.

Blur Visualization

Blur detected objects in an image.

Circle Visualization

Draw a circle around detected objects in an image.

Color Visualization

Paint a solid color on detected objects in an image.

Corner Visualization

Draw the corners of detected objects in an image.

Crop Visualization

Draw scaled up crops of detections on the scene.

Ellipse Visualization

Draw ellipses that highlight detected objects in an image.

Halo Visualization

Paint a halo around detected objects in an image.

Mask Visualization

Apply a mask over detected objects in an image.

Pixelate Visualization

Pixelate detected objects in an image.

Triangle Visualization

Draw triangle markers on an image at specific coordinates based on provided detections.

Line Counter Visualization

Apply a mask over a line zone in an image.

Polygon Zone Visualization

Apply a mask over a polygon zone in an image.

Model Comparison Visualization

Visualize the difference between two models' detections.

Trace Visualization

Draw traces based on detections tracking results.

Reference Path Visualization

Draw a reference path in the image.

Keypoint Visualization

Draw keypoints on detected objects in an image.

Grid Visualization

Shows an array of images in a grid.

Logic and branching

Conditionally stop execution of a branch.

Detections Filter

Conditionally filter out model predictions.

Limits the rate at which a branch of the Workflow will run.

Allow the execution of workflow to proceed if the input value has changed.

Detections Consensus

Combine predictions from multiple detections models to make a decision about object presence.

Data storage

Roboflow Dataset Upload

Save images and predictions to your Roboflow Dataset.

Send a request to a remote API with Workflow results.

Create CSV files with specified columns.

Local File Sink

Save data to a local file.

Data Aggregator

Aggregate workflow data to produce time-based statistics.

Notifications

Email Notification

Send notification via e-mail.

Slack Notification

Send notification via Slack.

Twilio SMS Notification

Send notification via Twilio SMS service.

Transformations

Crop an image using bounding boxes from a detection model.

Absolute Static Crop

Crop an image using fixed pixel coordinates.

Camera Calibration

Remove camera lens distortions from an image using a calibration table.

Relative Static Crop

Crop an image proportional (%) to its dimensions.

Detection Offset

Apply a padding around the width and height of detections.

Detections Transformation

Apply transformations on detected bounding boxes.

Bounding Rectangle

Find the minimal bounding box surrounding the detected polygon.

Detections Merge

Merge multiple detections into a single bounding box.

Stitch two images by common parts.

Classical computer vision

Image Preprocessing

Resize, flip, or rotate an image.

Template Matching

Locate instances of a given template within a specified image.

Get the dominant color of an image in RGB format.

Pixel Color Count

Count the number of pixels that match a specific color within a given tolerance.

SIFT Comparison

Compare SIFT descriptors from multiple images.

Find and count the contours on an image.

Apply SIFT to an image.

Apply a blur to an image.

Image Threshold

Apply a threshold to an image.

Image Convert Grayscale

Convert an RGB image to grayscale.

Calculate a score to indicate how well-focused a camera is.

Size Measurement

Measure the dimensions of objects in relation to a reference object.

Distance Measurement

Calculate the distance between two bounding boxes on a 2D plane.

Video

Track and update object positions across video frames using ByteTrack.

Track object time in zone.

PTZ Tracking (ONVIF)

Control an ONVIF compatible PTZ camera to follow an object

Count detections passing a line.

Calculate the velocity and speed of tracked objects with smoothing and unit conversion.

Calculate Fréchet distance of object from the reference path.

Detections Stabilizer

Apply a smoothing algorithm to reduce noise and flickering across video frames.

Returns an array of the last `length` values passed to it.

Identify Outliers

Identify outlier embeddings compared to prior data.

Identify Changes

Identify changes compared to prior data via embeddings.

Advanced

Property Definition

Define a variable from model predictions, such as the class names, confidences, or number of detections.

Create a specific output based on defined input variables and configured rules.

Perspective Correction

Adjust detection coordinates from a polygon-defined plane to a straight rectangular plane with specified width and height.

Stitch OCR Detections

Combines OCR detection results into a coherent text string by organizing detections spatially.

Simplify polygons so they are geometrically convex and contain only the requested amount of vertices.

Cosine Similarity

Calculate the cosine similarity between two embeddings.

Detections Classes Replacement

Replace classes of detections with classes predicted by a chained classification model.

Parse raw string into JSON.

VLM as Classifier

Parse a raw string into a classification prediction.

VLM as Detector

Parses raw string into object-detection prediction.

Dimension Collapse

Collapse dimensionality by aggregating nested data into a single list.

First Non Empty Or Default

Take the first non-empty data element or the configured default value.

Roboflow Custom Metadata

Add custom metadata to the Roboflow Model Monitoring dashboard.

Model Monitoring Inference Aggregator

Periodically report an aggregated sample of inference results to Roboflow Model Monitoring.

Tile the input image into a list of smaller images to perform small object detection.

Detections Stitch

Merges detections made against multiple pieces of input image into single detection.

Fetches a previously stored value from a cache entry.

Stores a value in a cache entry for later retrieval.

Environment Secrets Store

Fetch secrets from environmental variables.