Stability AI Image Generation¶
Class: StabilityAIImageGenBlockV1
The block wraps Stability AI image generation API and let users generate new images from text, or create variations of existing images.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/stability_ai_image_gen@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
strength |
float |
controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all.. | ✅ |
prompt |
str |
Prompt to generate new images from text (what you wish to see). | ✅ |
negative_prompt |
str |
Negative prompt to image generation model (what you do not wish to see). | ✅ |
model |
str |
choose one of {'core', 'ultra', 'sd3'}. Default 'core' . | ✅ |
api_key |
str |
Your Stability AI API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Runtime compatibility¶
-
requires_internet— air-gapped / offline deployments - This block depends on a service that is not reachable from fully offline / air-gapped deployments.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Stability AI Image Generation in version v1.
- inputs:
VLM As Classifier,MoonshotAI Kimi,Stability AI Image Generation,Trace Visualization,Anthropic Claude,Icon Visualization,SIFT Comparison,Morphological Transformation,Color Visualization,LMM For Classification,Perspective Correction,Corner Visualization,Roboflow Custom Metadata,Halo Visualization,Qwen-VL,Email Notification,Halo Visualization,Google Gemma,Background Color Visualization,Ellipse Visualization,Email Notification,Twilio SMS/MMS Notification,Text Display,Polygon Visualization,Crop Visualization,Absolute Static Crop,Image Preprocessing,Model Monitoring Inference Aggregator,Relative Static Crop,OpenRouter,OpenAI,Florence-2 Model,OpenAI,Heatmap Visualization,OCR Model,Blur Visualization,Depth Estimation,Instance Segmentation Model,Stability AI Outpainting,Anthropic Claude,Google Gemini,Clip Comparison,Google Gemini,Background Subtraction,Keypoint Visualization,CSV Formatter,Webhook Sink,Stitch Images,Florence-2 Model,Current Time,Contrast Equalization,OpenAI,VLM As Detector,Google Gemini,Triangle Visualization,Slack Notification,SIFT,Local File Sink,Image Contours,Keypoint Detection Model,GLM-OCR,Roboflow Asset Library Attributes,Image Slicer,Polygon Zone Visualization,Contrast Enhancement,Google Gemma API,Stitch OCR Detections,Image Threshold,Line Counter Visualization,Camera Calibration,QR Code Generator,S3 Sink,Microsoft SQL Server Sink,Google Vision OCR,Twilio SMS Notification,Image Blur,Morphological Transformation,Camera Focus,Roboflow Vision Events,Stability AI Inpainting,Classification Label Visualization,Stitch OCR Detections,Event Writer,Grid Visualization,Qwen3.5-VL,Mask Visualization,Llama 3.2 Vision,Reference Path Visualization,Image Slicer,Label Visualization,Identify Outliers,OPC UA Writer Sink,Dot Visualization,Identify Changes,Dynamic Crop,Circle Visualization,Llama 3.2 Vision,Camera Focus,OpenAI-Compatible LLM,MoonshotAI Kimi,Single-Label Classification Model,CogVLM,Qwen 3.6 API,Detections Consensus,Bounding Box Visualization,Multi-Label Classification Model,LMM,OpenAI,Image Convert Grayscale,Roboflow Visual Search,EasyOCR,Roboflow Dataset Upload,Pixelate Visualization,Roboflow Dataset Upload,PLC Writer,Qwen 3.5 API,Anthropic Claude,Object Detection Model,MQTT Writer,Polygon Visualization,Model Comparison Visualization - outputs:
VLM As Classifier,MoonshotAI Kimi,Stability AI Image Generation,Trace Visualization,Qwen2.5-VL,Image Stack,Anthropic Claude,Icon Visualization,SIFT Comparison,Morphological Transformation,Color Visualization,SmolVLM2,LMM For Classification,Single-Label Classification Model,Perspective Correction,Corner Visualization,Clip Comparison,Halo Visualization,Qwen-VL,Keypoint Detection Model,Halo Visualization,Object Detection Model,Google Gemma,Background Color Visualization,Ellipse Visualization,Email Notification,Twilio SMS/MMS Notification,Text Display,Polygon Visualization,Crop Visualization,Absolute Static Crop,Image Preprocessing,Template Matching,Relative Static Crop,OpenRouter,OpenAI,Florence-2 Model,VLM As Detector,OpenAI,Motion Detection,Heatmap Visualization,OCR Model,Perception Encoder Embedding Model,Blur Visualization,Barcode Detection,Depth Estimation,Instance Segmentation Model,Stability AI Outpainting,Anthropic Claude,YOLO-World Model,Google Gemini,Clip Comparison,Google Gemini,Background Subtraction,Keypoint Visualization,Buffer,Stitch Images,Florence-2 Model,Contrast Equalization,Mask Edge Snap,OpenAI,Qwen3-VL,Moondream2,VLM As Detector,Google Gemini,Triangle Visualization,CLIP Embedding Model,Detections Stabilizer,SIFT,Multi-Label Classification Model,Image Contours,Keypoint Detection Model,VLM As Classifier,Pixel Color Count,GLM-OCR,Image Slicer,Polygon Zone Visualization,Contrast Enhancement,Google Gemma API,Time in Zone,Semantic Segmentation Model,Image Threshold,Line Counter Visualization,Semantic Segmentation Model,Multi-Label Classification Model,Camera Calibration,ByteTrack Tracker,Google Vision OCR,Image Blur,Morphological Transformation,Camera Focus,Roboflow Vision Events,Stability AI Inpainting,Classification Label Visualization,SAM2 Video Tracker,Event Writer,Qwen3.5-VL,Mask Visualization,Llama 3.2 Vision,Dominant Color,Reference Path Visualization,Image Slicer,Label Visualization,Byte Tracker,Dot Visualization,Dynamic Crop,Detections Stitch,Circle Visualization,Llama 3.2 Vision,BoT-SORT Tracker,SAM3 Video Tracker,Camera Focus,Gaze Detection,Segment Anything 2 Model,MoonshotAI Kimi,Single-Label Classification Model,QR Code Detection,Qwen3.5,CogVLM,Object Detection Model,SAM 3 Interactive,Qwen 3.6 API,Bounding Box Visualization,Multi-Label Classification Model,LMM,OpenAI,SAM 3,Image Convert Grayscale,Instance Segmentation Model,EasyOCR,Roboflow Visual Search,Roboflow Dataset Upload,SAM 3,Instance Segmentation Model,Keypoint Detection Model,Pixelate Visualization,Roboflow Dataset Upload,SORT Tracker,Instance Segmentation Model,Track Class Lock,Qwen 3.5 API,Object Detection Model,Anthropic Claude,Polygon Visualization,OC-SORT Tracker,SAM 3,Model Comparison Visualization,Single-Label Classification Model,Seg Preview
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Stability AI Image Generation in version v1 has.
Bindings
-
input
image(image): The image to use as the starting point for the generation..strength(float_zero_to_one): controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all..prompt(string): Prompt to generate new images from text (what you wish to see).negative_prompt(string): Negative prompt to image generation model (what you do not wish to see).model(string): choose one of {'core', 'ultra', 'sd3'}. Default 'core' .api_key(Union[secret,string]): Your Stability AI API key.
-
output
image(image): Image in workflows.
Example JSON definition of step Stability AI Image Generation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/stability_ai_image_gen@v1",
"image": "$inputs.image",
"strength": 0.3,
"prompt": "my prompt",
"negative_prompt": "my prompt",
"model": "my prompt",
"api_key": "xxx-xxx"
}