Anthropic Claude¶
v3¶
Class: AnthropicClaudeBlockV3 (there are multiple versions of this block)
Source: inference.core.workflows.core_steps.models.foundation.anthropic_claude.v3.AnthropicClaudeBlockV3
Warning: This block has multiple versions. Please refer to the specific version for details. You can learn more about how versions work here: Versioning
Ask a question to Anthropic Claude model with vision capabilities.
You can specify arbitrary text prompts or predefined ones, the block supports the following types of prompt:
-
Open Prompt (
unconstrained) - Use any prompt to generate a raw response -
Text Recognition (OCR) (
ocr) - Model recognizes text in the image -
Visual Question Answering (
visual-question-answering) - Model answers the question you submit in the prompt -
Captioning (short) (
caption) - Model provides a short description of the image -
Captioning (
detailed-caption) - Model provides a long description of the image -
Single-Label Classification (
classification) - Model classifies the image content as one of the provided classes -
Multi-Label Classification (
multi-label-classification) - Model classifies the image content as one or more of the provided classes -
Unprompted Object Detection (
object-detection) - Model detects and returns the bounding boxes for prominent objects in the image -
Structured Output Generation (
structured-answering) - Model returns a JSON response with the specified fields
API Key Options¶
This block supports two API key modes:
- Roboflow Managed API Key (Default) - Use
rf_key:accountto proxy requests through Roboflow's API: - Simplified setup - no Anthropic API key required
- Secure - your workflow API key is used for authentication
-
Usage-based billing - charged per token based on the model used
-
Custom Anthropic API Key - Provide your own Anthropic API key:
- Full control over API usage
- You pay Anthropic directly
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/anthropic_claude@v3to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
task_type |
str |
Task type to be performed by model. Value determines required parameters and output response.. | ❌ |
prompt |
str |
Text prompt to the Claude model. | ✅ |
output_structure |
Dict[str, str] |
Dictionary with structure of expected JSON response. | ❌ |
classes |
List[str] |
List of classes to be used. | ✅ |
api_key |
str |
Your Anthropic API key or 'rf_key:account' to use Roboflow's managed API key. | ✅ |
model_version |
str |
Model to be used. | ✅ |
extended_thinking |
bool |
Enable extended thinking for deeper reasoning on complex tasks. Note: temperature cannot be used when extended thinking is enabled.. | ❌ |
thinking_budget_tokens |
int |
Maximum number of tokens for internal thinking when extended thinking is enabled. Higher values allow deeper reasoning but increase latency and cost. Must be less than max_tokens. Minimum: 1024.. | ❌ |
max_tokens |
int |
Maximum number of tokens the model can generate in its response.. | ❌ |
temperature |
float |
Temperature to sample from the model - value in range 0.0-1.0, the higher - the more random / "creative" the generations are. Cannot be used when extended_thinking is enabled.. | ✅ |
max_image_size |
int |
Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio. | ✅ |
max_concurrent_requests |
int |
Number of concurrent requests that can be executed by block when batch of input images provided. If not given - block defaults to value configured globally in Workflows Execution Engine. Please restrict if you hit Anthropic API limits.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Runtime compatibility¶
-
requires_internet— air-gapped / offline deployments - This block depends on a service that is not reachable from fully offline / air-gapped deployments.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Anthropic Claude in version v3.
- inputs:
Roboflow Asset Library Attributes,MoonshotAI Kimi,Image Blur,Reference Path Visualization,SIFT Comparison,Event Writer,Slack Notification,Halo Visualization,VLM As Classifier,Image Stack,Google Gemma,Qwen 3.6 API,Clip Comparison,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,Pixelate Visualization,OpenAI-Compatible LLM,Google Gemini,Anthropic Claude,OpenAI,Trace Visualization,Llama 3.2 Vision,Detection Event Log,Clip Comparison,Camera Focus,OpenAI,GLM-OCR,PLC ModbusTCP,Buffer,MQTT Writer,SIFT Comparison,CSV Formatter,Webhook Sink,Image Contours,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Dimension Collapse,SIFT,Classification Label Visualization,Multi-Label Classification Model,Keypoint Detection Model,Keypoint Visualization,Template Matching,Icon Visualization,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Crop Visualization,Image Convert Grayscale,Mask Visualization,Halo Visualization,Distance Measurement,PLC EthernetIP,Text Display,Morphological Transformation,Anthropic Claude,Roboflow Dataset Upload,Object Detection Model,Ellipse Visualization,Size Measurement,Circle Visualization,Twilio SMS Notification,Email Notification,S3 Sink,Camera Focus,Identify Changes,Image Slicer,LMM For Classification,OCR Model,Heatmap Visualization,OpenAI,Google Gemma API,Stitch Images,Morphological Transformation,EasyOCR,Current Time,Blur Visualization,Stitch OCR Detections,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,LMM,Single-Label Classification Model,Polygon Visualization,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Relative Static Crop,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Gaze Detection,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Dynamic Zone,Triangle Visualization,QR Code Generator,Contrast Enhancement,Roboflow Dataset Upload,Absolute Static Crop,Qwen 3.5 API,Background Subtraction,OpenAI,Image Slicer,Qwen-VL,Florence-2 Model,Perspective Correction,Twilio SMS/MMS Notification,Roboflow Vision Events,Microsoft SQL Server Sink,Cosine Similarity,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Camera Calibration,VLM As Detector,Line Counter - outputs:
Cache Set,MoonshotAI Kimi,Roboflow Asset Library Attributes,Path Deviation,Image Blur,Keypoint Detection Model,Reference Path Visualization,PTZ Tracking (ONVIF),Event Writer,Slack Notification,Halo Visualization,CLIP Embedding Model,VLM As Classifier,Google Gemma,Qwen 3.6 API,Clip Comparison,Object Detection Model,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,OpenAI-Compatible LLM,Google Gemini,JSON Parser,Anthropic Claude,Cache Get,OpenAI,Trace Visualization,Llama 3.2 Vision,OpenAI,Clip Comparison,GLM-OCR,Buffer,MQTT Writer,Webhook Sink,SIFT Comparison,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Classification Label Visualization,Instance Segmentation Model,Keypoint Detection Model,Keypoint Visualization,Instance Segmentation Model,Icon Visualization,Seg Preview,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Multi-Label Classification Model,Crop Visualization,Mask Visualization,Halo Visualization,Detections Stitch,Distance Measurement,PLC EthernetIP,Anthropic Claude,Morphological Transformation,Text Display,VLM As Classifier,Roboflow Dataset Upload,VLM As Detector,Detections Consensus,Object Detection Model,Ellipse Visualization,Keypoint Detection Model,SAM3 Video Tracker,Time in Zone,SAM 3,Size Measurement,Circle Visualization,Semantic Segmentation Model,Path Deviation,Twilio SMS Notification,Email Notification,S3 Sink,SAM 3,LMM For Classification,Heatmap Visualization,Google Gemma API,OpenAI,Time in Zone,Morphological Transformation,Single-Label Classification Model,YOLO-World Model,Current Time,Stitch OCR Detections,Moondream2,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,SAM 3,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,Instance Segmentation Model,LMM,Polygon Visualization,Segment Anything 2 Model,Time in Zone,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Triangle Visualization,QR Code Generator,Qwen 3.5 API,Roboflow Dataset Upload,OpenAI,Qwen-VL,Florence-2 Model,Perspective Correction,Roboflow Vision Events,Microsoft SQL Server Sink,Twilio SMS/MMS Notification,Perception Encoder Embedding Model,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Detections Classes Replacement,VLM As Detector,Line Counter,Object Detection Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Anthropic Claude in version v3 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Text prompt to the Claude model.classes(list_of_values): List of classes to be used.api_key(Union[secret,string,ROBOFLOW_MANAGED_KEY]): Your Anthropic API key or 'rf_key:account' to use Roboflow's managed API key.model_version(string): Model to be used.temperature(float): Temperature to sample from the model - value in range 0.0-1.0, the higher - the more random / "creative" the generations are. Cannot be used when extended_thinking is enabled..max_image_size(integer): Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio.
-
output
output(Union[string,language_model_output]): String value ifstringor LLM / VLM output iflanguage_model_output.classes(list_of_values): List of values of any type.
Example JSON definition of step Anthropic Claude in version v3
{
"name": "<your_step_name_here>",
"type": "roboflow_core/anthropic_claude@v3",
"images": "$inputs.image",
"task_type": "<block_does_not_provide_example>",
"prompt": "my prompt",
"output_structure": {
"my_key": "description"
},
"classes": [
"class-a",
"class-b"
],
"api_key": "rf_key:account",
"model_version": "claude-sonnet-4-5",
"extended_thinking": "<block_does_not_provide_example>",
"thinking_budget_tokens": "<block_does_not_provide_example>",
"max_tokens": "<block_does_not_provide_example>",
"temperature": "<block_does_not_provide_example>",
"max_image_size": "<block_does_not_provide_example>",
"max_concurrent_requests": "<block_does_not_provide_example>"
}
v2¶
Class: AnthropicClaudeBlockV2 (there are multiple versions of this block)
Source: inference.core.workflows.core_steps.models.foundation.anthropic_claude.v2.AnthropicClaudeBlockV2
Warning: This block has multiple versions. Please refer to the specific version for details. You can learn more about how versions work here: Versioning
Ask a question to Anthropic Claude model with vision capabilities.
You can specify arbitrary text prompts or predefined ones, the block supports the following types of prompt:
-
Open Prompt (
unconstrained) - Use any prompt to generate a raw response -
Text Recognition (OCR) (
ocr) - Model recognizes text in the image -
Visual Question Answering (
visual-question-answering) - Model answers the question you submit in the prompt -
Captioning (short) (
caption) - Model provides a short description of the image -
Captioning (
detailed-caption) - Model provides a long description of the image -
Single-Label Classification (
classification) - Model classifies the image content as one of the provided classes -
Multi-Label Classification (
multi-label-classification) - Model classifies the image content as one or more of the provided classes -
Unprompted Object Detection (
object-detection) - Model detects and returns the bounding boxes for prominent objects in the image -
Structured Output Generation (
structured-answering) - Model returns a JSON response with the specified fields
You need to provide your Anthropic API key to use the Claude model.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/anthropic_claude@v2to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
task_type |
str |
Task type to be performed by model. Value determines required parameters and output response.. | ❌ |
prompt |
str |
Text prompt to the Claude model. | ✅ |
output_structure |
Dict[str, str] |
Dictionary with structure of expected JSON response. | ❌ |
classes |
List[str] |
List of classes to be used. | ✅ |
api_key |
str |
Your Anthropic API key. | ✅ |
model_version |
str |
Model to be used. | ✅ |
extended_thinking |
bool |
Enable extended thinking for deeper reasoning on complex tasks. Note: temperature cannot be used when extended thinking is enabled.. | ❌ |
thinking_budget_tokens |
int |
Maximum number of tokens for internal thinking when extended thinking is enabled. Higher values allow deeper reasoning but increase latency and cost. Must be less than max_tokens. Minimum: 1024.. | ❌ |
max_tokens |
int |
Maximum number of tokens the model can generate in its response.. | ❌ |
temperature |
float |
Temperature to sample from the model - value in range 0.0-1.0, the higher - the more random / "creative" the generations are. Cannot be used when extended_thinking is enabled.. | ✅ |
max_image_size |
int |
Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio. | ✅ |
max_concurrent_requests |
int |
Number of concurrent requests that can be executed by block when batch of input images provided. If not given - block defaults to value configured globally in Workflows Execution Engine. Please restrict if you hit Anthropic API limits.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Runtime compatibility¶
-
requires_internet— air-gapped / offline deployments - This block depends on a service that is not reachable from fully offline / air-gapped deployments.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Anthropic Claude in version v2.
- inputs:
Roboflow Asset Library Attributes,MoonshotAI Kimi,Image Blur,Reference Path Visualization,SIFT Comparison,Event Writer,Slack Notification,Halo Visualization,VLM As Classifier,Image Stack,Google Gemma,Qwen 3.6 API,Clip Comparison,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,Pixelate Visualization,OpenAI-Compatible LLM,Google Gemini,Anthropic Claude,OpenAI,Trace Visualization,Llama 3.2 Vision,Detection Event Log,Clip Comparison,Camera Focus,OpenAI,GLM-OCR,PLC ModbusTCP,Buffer,MQTT Writer,SIFT Comparison,CSV Formatter,Webhook Sink,Image Contours,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Dimension Collapse,SIFT,Classification Label Visualization,Multi-Label Classification Model,Keypoint Detection Model,Keypoint Visualization,Template Matching,Icon Visualization,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Crop Visualization,Image Convert Grayscale,Mask Visualization,Halo Visualization,Distance Measurement,PLC EthernetIP,Text Display,Morphological Transformation,Anthropic Claude,Roboflow Dataset Upload,Object Detection Model,Ellipse Visualization,Size Measurement,Circle Visualization,Twilio SMS Notification,Email Notification,S3 Sink,Camera Focus,Identify Changes,Image Slicer,LMM For Classification,OCR Model,Heatmap Visualization,OpenAI,Google Gemma API,Stitch Images,Morphological Transformation,EasyOCR,Current Time,Blur Visualization,Stitch OCR Detections,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,LMM,Single-Label Classification Model,Polygon Visualization,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Relative Static Crop,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Gaze Detection,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Dynamic Zone,Triangle Visualization,QR Code Generator,Contrast Enhancement,Roboflow Dataset Upload,Absolute Static Crop,Qwen 3.5 API,Background Subtraction,OpenAI,Image Slicer,Qwen-VL,Florence-2 Model,Perspective Correction,Twilio SMS/MMS Notification,Roboflow Vision Events,Microsoft SQL Server Sink,Cosine Similarity,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Camera Calibration,VLM As Detector,Line Counter - outputs:
Cache Set,MoonshotAI Kimi,Roboflow Asset Library Attributes,Path Deviation,Image Blur,Keypoint Detection Model,Reference Path Visualization,PTZ Tracking (ONVIF),Event Writer,Slack Notification,Halo Visualization,CLIP Embedding Model,VLM As Classifier,Google Gemma,Qwen 3.6 API,Clip Comparison,Object Detection Model,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,OpenAI-Compatible LLM,Google Gemini,JSON Parser,Anthropic Claude,Cache Get,OpenAI,Trace Visualization,Llama 3.2 Vision,OpenAI,Clip Comparison,GLM-OCR,Buffer,MQTT Writer,Webhook Sink,SIFT Comparison,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Classification Label Visualization,Instance Segmentation Model,Keypoint Detection Model,Keypoint Visualization,Instance Segmentation Model,Icon Visualization,Seg Preview,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Multi-Label Classification Model,Crop Visualization,Mask Visualization,Halo Visualization,Detections Stitch,Distance Measurement,PLC EthernetIP,Anthropic Claude,Morphological Transformation,Text Display,VLM As Classifier,Roboflow Dataset Upload,VLM As Detector,Detections Consensus,Object Detection Model,Ellipse Visualization,Keypoint Detection Model,SAM3 Video Tracker,Time in Zone,SAM 3,Size Measurement,Circle Visualization,Semantic Segmentation Model,Path Deviation,Twilio SMS Notification,Email Notification,S3 Sink,SAM 3,LMM For Classification,Heatmap Visualization,Google Gemma API,OpenAI,Time in Zone,Morphological Transformation,Single-Label Classification Model,YOLO-World Model,Current Time,Stitch OCR Detections,Moondream2,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,SAM 3,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,Instance Segmentation Model,LMM,Polygon Visualization,Segment Anything 2 Model,Time in Zone,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Triangle Visualization,QR Code Generator,Qwen 3.5 API,Roboflow Dataset Upload,OpenAI,Qwen-VL,Florence-2 Model,Perspective Correction,Roboflow Vision Events,Microsoft SQL Server Sink,Twilio SMS/MMS Notification,Perception Encoder Embedding Model,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Detections Classes Replacement,VLM As Detector,Line Counter,Object Detection Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Anthropic Claude in version v2 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Text prompt to the Claude model.classes(list_of_values): List of classes to be used.api_key(Union[secret,string]): Your Anthropic API key.model_version(string): Model to be used.temperature(float): Temperature to sample from the model - value in range 0.0-1.0, the higher - the more random / "creative" the generations are. Cannot be used when extended_thinking is enabled..max_image_size(integer): Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio.
-
output
output(Union[string,language_model_output]): String value ifstringor LLM / VLM output iflanguage_model_output.classes(list_of_values): List of values of any type.
Example JSON definition of step Anthropic Claude in version v2
{
"name": "<your_step_name_here>",
"type": "roboflow_core/anthropic_claude@v2",
"images": "$inputs.image",
"task_type": "<block_does_not_provide_example>",
"prompt": "my prompt",
"output_structure": {
"my_key": "description"
},
"classes": [
"class-a",
"class-b"
],
"api_key": "xxx-xxx",
"model_version": "claude-sonnet-4-5",
"extended_thinking": "<block_does_not_provide_example>",
"thinking_budget_tokens": "<block_does_not_provide_example>",
"max_tokens": "<block_does_not_provide_example>",
"temperature": "<block_does_not_provide_example>",
"max_image_size": "<block_does_not_provide_example>",
"max_concurrent_requests": "<block_does_not_provide_example>"
}
v1¶
Class: AnthropicClaudeBlockV1 (there are multiple versions of this block)
Source: inference.core.workflows.core_steps.models.foundation.anthropic_claude.v1.AnthropicClaudeBlockV1
Warning: This block has multiple versions. Please refer to the specific version for details. You can learn more about how versions work here: Versioning
Ask a question to Anthropic Claude model with vision capabilities.
You can specify arbitrary text prompts or predefined ones, the block supports the following types of prompt:
-
Open Prompt (
unconstrained) - Use any prompt to generate a raw response -
Text Recognition (OCR) (
ocr) - Model recognizes text in the image -
Visual Question Answering (
visual-question-answering) - Model answers the question you submit in the prompt -
Captioning (short) (
caption) - Model provides a short description of the image -
Captioning (
detailed-caption) - Model provides a long description of the image -
Single-Label Classification (
classification) - Model classifies the image content as one of the provided classes -
Multi-Label Classification (
multi-label-classification) - Model classifies the image content as one or more of the provided classes -
Unprompted Object Detection (
object-detection) - Model detects and returns the bounding boxes for prominent objects in the image -
Structured Output Generation (
structured-answering) - Model returns a JSON response with the specified fields
You need to provide your Anthropic API key to use the Claude model.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/anthropic_claude@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
task_type |
str |
Task type to be performed by model. Value determines required parameters and output response.. | ❌ |
prompt |
str |
Text prompt to the Claude model. | ✅ |
output_structure |
Dict[str, str] |
Dictionary with structure of expected JSON response. | ❌ |
classes |
List[str] |
List of classes to be used. | ✅ |
api_key |
str |
Your Anthropic API key. | ✅ |
model_version |
str |
Model to be used. | ✅ |
max_tokens |
int |
Maximum number of tokens the model can generate in it's response.. | ❌ |
temperature |
float |
Temperature to sample from the model - value in range 0.0-2.0, the higher - the more random / "creative" the generations are.. | ✅ |
max_image_size |
int |
Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio. | ✅ |
max_concurrent_requests |
int |
Number of concurrent requests that can be executed by block when batch of input images provided. If not given - block defaults to value configured globally in Workflows Execution Engine. Please restrict if you hit Anthropic API limits.. | ❌ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Runtime compatibility¶
-
requires_internet— air-gapped / offline deployments - This block depends on a service that is not reachable from fully offline / air-gapped deployments.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Anthropic Claude in version v1.
- inputs:
Roboflow Asset Library Attributes,MoonshotAI Kimi,Image Blur,Reference Path Visualization,SIFT Comparison,Event Writer,Slack Notification,Halo Visualization,VLM As Classifier,Image Stack,Google Gemma,Qwen 3.6 API,Clip Comparison,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,Pixelate Visualization,OpenAI-Compatible LLM,Google Gemini,Anthropic Claude,OpenAI,Trace Visualization,Llama 3.2 Vision,Detection Event Log,Clip Comparison,Camera Focus,OpenAI,GLM-OCR,PLC ModbusTCP,Buffer,MQTT Writer,SIFT Comparison,CSV Formatter,Webhook Sink,Image Contours,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Dimension Collapse,SIFT,Classification Label Visualization,Multi-Label Classification Model,Keypoint Detection Model,Keypoint Visualization,Template Matching,Icon Visualization,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Crop Visualization,Image Convert Grayscale,Mask Visualization,Halo Visualization,Distance Measurement,PLC EthernetIP,Text Display,Morphological Transformation,Anthropic Claude,Roboflow Dataset Upload,Object Detection Model,Ellipse Visualization,Size Measurement,Circle Visualization,Twilio SMS Notification,Email Notification,S3 Sink,Camera Focus,Identify Changes,Image Slicer,LMM For Classification,OCR Model,Heatmap Visualization,OpenAI,Google Gemma API,Stitch Images,Morphological Transformation,EasyOCR,Current Time,Blur Visualization,Stitch OCR Detections,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,LMM,Single-Label Classification Model,Polygon Visualization,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Relative Static Crop,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Gaze Detection,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Dynamic Zone,Triangle Visualization,QR Code Generator,Contrast Enhancement,Roboflow Dataset Upload,Absolute Static Crop,Qwen 3.5 API,Background Subtraction,OpenAI,Image Slicer,Qwen-VL,Florence-2 Model,Perspective Correction,Twilio SMS/MMS Notification,Roboflow Vision Events,Microsoft SQL Server Sink,Cosine Similarity,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Camera Calibration,VLM As Detector,Line Counter - outputs:
Cache Set,MoonshotAI Kimi,Roboflow Asset Library Attributes,Path Deviation,Image Blur,Keypoint Detection Model,Reference Path Visualization,PTZ Tracking (ONVIF),Event Writer,Slack Notification,Halo Visualization,CLIP Embedding Model,VLM As Classifier,Google Gemma,Qwen 3.6 API,Clip Comparison,Object Detection Model,Dot Visualization,Label Visualization,Background Color Visualization,Llama 3.2 Vision,Email Notification,OpenAI-Compatible LLM,Google Gemini,JSON Parser,Anthropic Claude,Cache Get,OpenAI,Trace Visualization,Llama 3.2 Vision,OpenAI,Clip Comparison,GLM-OCR,Buffer,MQTT Writer,Webhook Sink,SIFT Comparison,Local File Sink,Motion Detection,Google Gemini,MoonshotAI Kimi,Polygon Visualization,Classification Label Visualization,Instance Segmentation Model,Keypoint Detection Model,Keypoint Visualization,Instance Segmentation Model,Icon Visualization,Seg Preview,Dynamic Crop,Stability AI Inpainting,Bounding Box Visualization,Polygon Zone Visualization,Stability AI Outpainting,Multi-Label Classification Model,Crop Visualization,Mask Visualization,Halo Visualization,Detections Stitch,Distance Measurement,PLC EthernetIP,Anthropic Claude,Morphological Transformation,Text Display,VLM As Classifier,Roboflow Dataset Upload,VLM As Detector,Detections Consensus,Object Detection Model,Ellipse Visualization,Keypoint Detection Model,SAM3 Video Tracker,Time in Zone,SAM 3,Size Measurement,Circle Visualization,Semantic Segmentation Model,Path Deviation,Twilio SMS Notification,Email Notification,S3 Sink,SAM 3,LMM For Classification,Heatmap Visualization,Google Gemma API,OpenAI,Time in Zone,Morphological Transformation,Single-Label Classification Model,YOLO-World Model,Current Time,Stitch OCR Detections,Moondream2,Detections List Roll-Up,Florence-2 Model,Google Gemini,Corner Visualization,OpenRouter,Pixel Color Count,Model Comparison Visualization,SAM 3,Model Monitoring Inference Aggregator,Google Vision OCR,Image Threshold,Instance Segmentation Model,LMM,Polygon Visualization,Segment Anything 2 Model,Time in Zone,Stability AI Image Generation,Line Counter Visualization,Line Counter,CogVLM,Qwen3.5-VL,Grid Visualization,Image Preprocessing,Stitch OCR Detections,Anthropic Claude,OPC UA Writer Sink,Color Visualization,Triangle Visualization,QR Code Generator,Qwen 3.5 API,Roboflow Dataset Upload,OpenAI,Qwen-VL,Florence-2 Model,Perspective Correction,Roboflow Vision Events,Microsoft SQL Server Sink,Twilio SMS/MMS Notification,Perception Encoder Embedding Model,Instance Segmentation Model,Depth Estimation,Roboflow Custom Metadata,Contrast Equalization,Detections Classes Replacement,VLM As Detector,Line Counter,Object Detection Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Anthropic Claude in version v1 has.
Bindings
-
input
images(image): The image to infer on..prompt(string): Text prompt to the Claude model.classes(list_of_values): List of classes to be used.api_key(Union[secret,string]): Your Anthropic API key.model_version(string): Model to be used.temperature(float): Temperature to sample from the model - value in range 0.0-2.0, the higher - the more random / "creative" the generations are..max_image_size(integer): Maximum size of the image - if input has larger side, it will be downscaled, keeping aspect ratio.
-
output
output(Union[string,language_model_output]): String value ifstringor LLM / VLM output iflanguage_model_output.classes(list_of_values): List of values of any type.
Example JSON definition of step Anthropic Claude in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/anthropic_claude@v1",
"images": "$inputs.image",
"task_type": "<block_does_not_provide_example>",
"prompt": "my prompt",
"output_structure": {
"my_key": "description"
},
"classes": [
"class-a",
"class-b"
],
"api_key": "xxx-xxx",
"model_version": "claude-sonnet-4",
"max_tokens": "<block_does_not_provide_example>",
"temperature": "<block_does_not_provide_example>",
"max_image_size": "<block_does_not_provide_example>",
"max_concurrent_requests": "<block_does_not_provide_example>"
}