Skip to content

Identify Outliers

Identify outlier embeddings compared to prior data.

This block accepts an embedding and compares it to a sample of prior data. If the embedding is an outlier, the block will return a boolean flag and the percentile of the embedding.

Type identifier

Use the following identifier in step "type" field: roboflow_core/identify_outliers@v1to add the block as as step in your workflow.

Properties

Name Type Description Refs
name str Unique name of step in workflows.
threshold_percentile float The desired sensitivity. A higher value will result in more data points being classified as outliers..
warmup int The number of data points to use for the initial average calculation. No outliers are identified during this period..
window_size int The number of previous data points to consider in the sliding window algorithm..

The Refs column marks possibility to parametrise the property with dynamic values available in workflow runtime. See Bindings for more info.

Available Connections

Check what blocks you can connect to Identify Outliers in version v1.

Input and Output Bindings

The available connections depend on its binding kinds. Check what binding kinds Identify Outliers in version v1 has.

Bindings
  • input

    • embedding (embedding): Embedding of the current data..
    • threshold_percentile (float_zero_to_one): The desired sensitivity. A higher value will result in more data points being classified as outliers..
    • warmup (integer): The number of data points to use for the initial average calculation. No outliers are identified during this period..
    • window_size (integer): The number of previous data points to consider in the sliding window algorithm..
  • output

Example JSON definition of step Identify Outliers in version v1
{
    "name": "<your_step_name_here>",
    "type": "roboflow_core/identify_outliers@v1",
    "embedding": "$steps.clip.embedding",
    "threshold_percentile": "$inputs.sample_rate",
    "warmup": 100,
    "window_size": 5
}