Stability AI Image Generation¶
Class: StabilityAIImageGenBlockV1
The block wraps Stability AI image generation API and let users generate new images from text, or create variations of existing images.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/stability_ai_image_gen@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
strength |
float |
controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all.. | ✅ |
prompt |
str |
Prompt to generate new images from text (what you wish to see). | ✅ |
negative_prompt |
str |
Negative prompt to image generation model (what you do not wish to see). | ✅ |
model |
str |
choose one of {'core', 'ultra', 'sd3'}. Default 'core' . | ✅ |
api_key |
str |
Your Stability AI API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Stability AI Image Generation in version v1.
- inputs:
Stability AI Outpainting,Contrast Enhancement,Camera Focus,Image Preprocessing,Corner Visualization,Ellipse Visualization,Object Detection Model,Roboflow Vision Events,Heatmap Visualization,Trace Visualization,OpenAI,Email Notification,Keypoint Visualization,Detections Consensus,Model Comparison Visualization,Polygon Zone Visualization,Dynamic Crop,Polygon Visualization,QR Code Generator,GLM-OCR,Stitch Images,OpenRouter,Clip Comparison,Image Blur,Model Monitoring Inference Aggregator,Google Gemini,Pixelate Visualization,EasyOCR,SIFT,Contrast Equalization,Image Threshold,Instance Segmentation Model,Polygon Visualization,Anthropic Claude,Halo Visualization,Roboflow Custom Metadata,Florence-2 Model,Local File Sink,Icon Visualization,Single-Label Classification Model,Image Contours,OpenAI,Grid Visualization,LMM,Image Convert Grayscale,Reference Path Visualization,Stitch OCR Detections,Keypoint Detection Model,SIFT Comparison,Identify Changes,Roboflow Dataset Upload,CSV Formatter,S3 Sink,OpenAI-Compatible LLM,Morphological Transformation,Identify Outliers,Crop Visualization,Blur Visualization,Mask Visualization,Stability AI Image Generation,Qwen-VL,Stitch OCR Detections,Google Gemma API,Image Slicer,Qwen 3.5 API,Background Color Visualization,Slack Notification,Anthropic Claude,Qwen 3.6 API,Webhook Sink,Color Visualization,Bounding Box Visualization,Google Gemma,Relative Static Crop,CogVLM,Llama 3.2 Vision,Qwen3.5-VL,Camera Focus,Google Vision OCR,Google Gemini,Llama 3.2 Vision,Twilio SMS Notification,Anthropic Claude,Image Slicer,Depth Estimation,OpenAI,Multi-Label Classification Model,Classification Label Visualization,Florence-2 Model,MoonshotAI Kimi,MoonshotAI Kimi,Dot Visualization,Background Subtraction,Roboflow Dataset Upload,Stability AI Inpainting,Label Visualization,Absolute Static Crop,Google Gemini,VLM As Classifier,Camera Calibration,Halo Visualization,Email Notification,OpenAI,LMM For Classification,Text Display,Circle Visualization,Line Counter Visualization,OCR Model,VLM As Detector,Morphological Transformation,Twilio SMS/MMS Notification,Triangle Visualization,Perspective Correction - outputs:
Stability AI Outpainting,Multi-Label Classification Model,CLIP Embedding Model,SAM 3,Motion Detection,Contrast Enhancement,Camera Focus,Image Preprocessing,Seg Preview,Ellipse Visualization,Corner Visualization,Roboflow Vision Events,Object Detection Model,Heatmap Visualization,Trace Visualization,OC-SORT Tracker,VLM As Classifier,Time in Zone,OpenAI,Byte Tracker,Keypoint Visualization,Model Comparison Visualization,YOLO-World Model,Polygon Zone Visualization,Single-Label Classification Model,Dynamic Crop,Polygon Visualization,GLM-OCR,Stitch Images,OpenRouter,Semantic Segmentation Model,Image Blur,Clip Comparison,Detections Stitch,Segment Anything 2 Model,Instance Segmentation Model,Google Gemini,Buffer,Pixelate Visualization,EasyOCR,SIFT,Contrast Equalization,Image Threshold,Instance Segmentation Model,Polygon Visualization,Anthropic Claude,Halo Visualization,Qwen2.5-VL,Keypoint Detection Model,Florence-2 Model,Icon Visualization,Single-Label Classification Model,Image Contours,OpenAI,SAM2 Video Tracker,SAM 3,ByteTrack Tracker,VLM As Detector,Barcode Detection,Multi-Label Classification Model,Object Detection Model,LMM,Image Convert Grayscale,Reference Path Visualization,Dominant Color,Keypoint Detection Model,SIFT Comparison,Roboflow Dataset Upload,BoT-SORT Tracker,Qwen3-VL,Object Detection Model,Morphological Transformation,Crop Visualization,Blur Visualization,Qwen-VL,Mask Visualization,Stability AI Image Generation,Google Gemma API,Qwen3.5,Image Slicer,Qwen 3.5 API,Perception Encoder Embedding Model,Background Color Visualization,Anthropic Claude,Qwen 3.6 API,Color Visualization,Bounding Box Visualization,Google Gemma,Relative Static Crop,Llama 3.2 Vision,CogVLM,Instance Segmentation Model,Qwen3.5-VL,Instance Segmentation Model,Google Vision OCR,Camera Focus,Google Gemini,Llama 3.2 Vision,SAM 3,Single-Label Classification Model,SORT Tracker,SmolVLM2,Detections Stabilizer,Moondream2,Anthropic Claude,Image Slicer,OpenAI,Depth Estimation,Multi-Label Classification Model,Gaze Detection,Template Matching,Classification Label Visualization,Florence-2 Model,MoonshotAI Kimi,MoonshotAI Kimi,Dot Visualization,Keypoint Detection Model,Background Subtraction,Roboflow Dataset Upload,Stability AI Inpainting,Semantic Segmentation Model,QR Code Detection,Label Visualization,Absolute Static Crop,Google Gemini,VLM As Classifier,Halo Visualization,Email Notification,Camera Calibration,OpenAI,Clip Comparison,Pixel Color Count,LMM For Classification,Text Display,Line Counter Visualization,Circle Visualization,OCR Model,VLM As Detector,Image Stack,Morphological Transformation,Twilio SMS/MMS Notification,Mask Edge Snap,Triangle Visualization,Perspective Correction
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Stability AI Image Generation in version v1 has.
Bindings
-
input
image(image): The image to use as the starting point for the generation..strength(float_zero_to_one): controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all..prompt(string): Prompt to generate new images from text (what you wish to see).negative_prompt(string): Negative prompt to image generation model (what you do not wish to see).model(string): choose one of {'core', 'ultra', 'sd3'}. Default 'core' .api_key(Union[secret,string]): Your Stability AI API key.
-
output
image(image): Image in workflows.
Example JSON definition of step Stability AI Image Generation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/stability_ai_image_gen@v1",
"image": "$inputs.image",
"strength": 0.3,
"prompt": "my prompt",
"negative_prompt": "my prompt",
"model": "my prompt",
"api_key": "xxx-xxx"
}