Stability AI Image Generation¶
Class: StabilityAIImageGenBlockV1
The block wraps Stability AI image generation API and let users generate new images from text, or create variations of existing images.
Type identifier¶
Use the following identifier in step "type" field: roboflow_core/stability_ai_image_gen@v1to add the block as
as step in your workflow.
Properties¶
| Name | Type | Description | Refs |
|---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
strength |
float |
controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all.. | ✅ |
prompt |
str |
Prompt to generate new images from text (what you wish to see). | ✅ |
negative_prompt |
str |
Negative prompt to image generation model (what you do not wish to see). | ✅ |
model |
str |
choose one of {'core', 'ultra', 'sd3'}. Default 'core' . | ✅ |
api_key |
str |
Your Stability AI API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Stability AI Image Generation in version v1.
- inputs:
Clip Comparison,Florence-2 Model,Morphological Transformation,Google Gemini,LMM,Instance Segmentation Model,Polygon Zone Visualization,Email Notification,Keypoint Visualization,Roboflow Custom Metadata,Camera Focus,Anthropic Claude,Multi-Label Classification Model,Image Threshold,LMM For Classification,Keypoint Detection Model,Anthropic Claude,Email Notification,Reference Path Visualization,Stitch OCR Detections,Camera Focus,Image Slicer,Stability AI Image Generation,Stability AI Outpainting,Stitch Images,Blur Visualization,OpenAI,Roboflow Dataset Upload,Depth Estimation,Google Gemini,CogVLM,Image Preprocessing,Identify Outliers,Local File Sink,Florence-2 Model,Image Convert Grayscale,Dynamic Crop,Dot Visualization,Triangle Visualization,OCR Model,Crop Visualization,Twilio SMS Notification,Perspective Correction,Twilio SMS/MMS Notification,EasyOCR,Grid Visualization,Google Gemini,Trace Visualization,QR Code Generator,Pixelate Visualization,Detections Consensus,OpenAI,Camera Calibration,Roboflow Dataset Upload,Webhook Sink,Single-Label Classification Model,Object Detection Model,VLM as Detector,Background Subtraction,Bounding Box Visualization,Contrast Equalization,Halo Visualization,Model Comparison Visualization,Label Visualization,Slack Notification,OpenAI,Circle Visualization,Image Contours,Background Color Visualization,Image Blur,Mask Visualization,VLM as Classifier,Google Vision OCR,Llama 3.2 Vision,Color Visualization,Corner Visualization,Classification Label Visualization,OpenAI,Line Counter Visualization,Ellipse Visualization,Icon Visualization,Model Monitoring Inference Aggregator,Image Slicer,Absolute Static Crop,Polygon Visualization,SIFT Comparison,Stability AI Inpainting,Identify Changes,Relative Static Crop,SIFT,CSV Formatter,Text Display - outputs:
Instance Segmentation Model,Clip Comparison,Florence-2 Model,Morphological Transformation,Google Gemini,LMM,Instance Segmentation Model,Motion Detection,Email Notification,Detections Stitch,Polygon Zone Visualization,Keypoint Visualization,Camera Focus,Anthropic Claude,Multi-Label Classification Model,Pixel Color Count,Image Threshold,LMM For Classification,Keypoint Detection Model,Anthropic Claude,Gaze Detection,Reference Path Visualization,Camera Focus,Stability AI Image Generation,Stitch Images,Stability AI Outpainting,Image Slicer,SmolVLM2,OpenAI,Roboflow Dataset Upload,Depth Estimation,YOLO-World Model,Google Gemini,CogVLM,Image Preprocessing,VLM as Detector,Florence-2 Model,Image Convert Grayscale,SAM 3,Byte Tracker,Dynamic Crop,Time in Zone,Perception Encoder Embedding Model,Moondream2,Triangle Visualization,Dot Visualization,OCR Model,Seg Preview,Crop Visualization,Twilio SMS/MMS Notification,Perspective Correction,EasyOCR,SAM 3,Google Gemini,Object Detection Model,Text Display,Trace Visualization,Pixelate Visualization,OpenAI,CLIP Embedding Model,Camera Calibration,Roboflow Dataset Upload,Buffer,Barcode Detection,Object Detection Model,Single-Label Classification Model,QR Code Detection,VLM as Detector,Background Subtraction,Bounding Box Visualization,Contrast Equalization,Model Comparison Visualization,Halo Visualization,Label Visualization,OpenAI,Circle Visualization,Qwen2.5-VL,Image Contours,Image Blur,Background Color Visualization,Mask Visualization,Dominant Color,VLM as Classifier,Google Vision OCR,Llama 3.2 Vision,Color Visualization,Corner Visualization,Classification Label Visualization,Single-Label Classification Model,OpenAI,Segment Anything 2 Model,Clip Comparison,Template Matching,Line Counter Visualization,Icon Visualization,Ellipse Visualization,Image Slicer,Detections Stabilizer,Absolute Static Crop,VLM as Classifier,Polygon Visualization,SIFT Comparison,Stability AI Inpainting,Qwen3-VL,SAM 3,Keypoint Detection Model,Relative Static Crop,SIFT,Blur Visualization,Multi-Label Classification Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Stability AI Image Generation in version v1 has.
Bindings
-
input
image(image): The image to use as the starting point for the generation..strength(float_zero_to_one): controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all..prompt(string): Prompt to generate new images from text (what you wish to see).negative_prompt(string): Negative prompt to image generation model (what you do not wish to see).model(string): choose one of {'core', 'ultra', 'sd3'}. Default 'core' .api_key(Union[string,secret]): Your Stability AI API key.
-
output
image(image): Image in workflows.
Example JSON definition of step Stability AI Image Generation in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/stability_ai_image_gen@v1",
"image": "$inputs.image",
"strength": 0.3,
"prompt": "my prompt",
"negative_prompt": "my prompt",
"model": "my prompt",
"api_key": "xxx-xxx"
}