Stability AI Image Generation¶
Class: StabilityAIImageGenBlockV1
The block wraps Stability AI image generation API and let users generate new images from text, or create variations of existing images.
Type identifier¶
Use the following identifier in step "type"
field: roboflow_core/stability_ai_image_gen@v1
to add the block as
as step in your workflow.
Properties¶
Name | Type | Description | Refs |
---|---|---|---|
name |
str |
Enter a unique identifier for this step.. | ❌ |
strength |
float |
controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all.. | ✅ |
prompt |
str |
Prompt to generate new images from text (what you wish to see). | ✅ |
negative_prompt |
str |
Negative prompt to image generation model (what you do not wish to see). | ✅ |
model |
str |
choose one of {'core', 'ultra', 'sd3'}. Default 'core' . | ✅ |
api_key |
str |
Your Stability AI API key. | ✅ |
The Refs column marks possibility to parametrise the property with dynamic values available
in workflow
runtime. See Bindings for more info.
Available Connections¶
Compatible Blocks
Check what blocks you can connect to Stability AI Image Generation
in version v1
.
- inputs:
Crop Visualization
,Keypoint Detection Model
,Ellipse Visualization
,VLM as Classifier
,Stability AI Inpainting
,Stability AI Image Generation
,Blur Visualization
,Circle Visualization
,Pixelate Visualization
,Model Comparison Visualization
,Webhook Sink
,Stability AI Outpainting
,Detections Consensus
,Bounding Box Visualization
,Grid Visualization
,Identify Outliers
,Background Color Visualization
,Llama 3.2 Vision
,Image Contours
,LMM
,Object Detection Model
,Twilio SMS Notification
,Label Visualization
,Triangle Visualization
,Dynamic Crop
,Image Preprocessing
,Florence-2 Model
,Google Vision OCR
,Florence-2 Model
,Keypoint Visualization
,OCR Model
,Halo Visualization
,Corner Visualization
,Line Counter Visualization
,LMM For Classification
,Dot Visualization
,Anthropic Claude
,Slack Notification
,Roboflow Dataset Upload
,Icon Visualization
,Roboflow Custom Metadata
,Depth Estimation
,CogVLM
,Polygon Zone Visualization
,Stitch Images
,Image Slicer
,OpenAI
,Email Notification
,Relative Static Crop
,Google Gemini
,SIFT Comparison
,Image Slicer
,Image Threshold
,Perspective Correction
,Camera Focus
,Camera Calibration
,Classification Label Visualization
,Reference Path Visualization
,Color Visualization
,Model Monitoring Inference Aggregator
,Instance Segmentation Model
,Local File Sink
,OpenAI
,Clip Comparison
,Roboflow Dataset Upload
,Mask Visualization
,QR Code Generator
,OpenAI
,CSV Formatter
,Stitch OCR Detections
,SIFT
,Polygon Visualization
,Image Convert Grayscale
,VLM as Detector
,Image Blur
,Multi-Label Classification Model
,Trace Visualization
,Absolute Static Crop
,Identify Changes
,Single-Label Classification Model
- outputs:
Crop Visualization
,Keypoint Detection Model
,Ellipse Visualization
,VLM as Classifier
,Stability AI Inpainting
,Stability AI Image Generation
,Blur Visualization
,Keypoint Detection Model
,Pixelate Visualization
,Circle Visualization
,YOLO-World Model
,Model Comparison Visualization
,Stability AI Outpainting
,Single-Label Classification Model
,Bounding Box Visualization
,VLM as Classifier
,Background Color Visualization
,Dominant Color
,Llama 3.2 Vision
,Image Contours
,LMM
,Object Detection Model
,Label Visualization
,Florence-2 Model
,Triangle Visualization
,Dynamic Crop
,VLM as Detector
,Detections Stitch
,Google Vision OCR
,Florence-2 Model
,Instance Segmentation Model
,Keypoint Visualization
,QR Code Detection
,OCR Model
,Halo Visualization
,Multi-Label Classification Model
,Corner Visualization
,Line Counter Visualization
,SmolVLM2
,LMM For Classification
,CLIP Embedding Model
,Time in Zone
,Perception Encoder Embedding Model
,Template Matching
,Dot Visualization
,Clip Comparison
,Anthropic Claude
,Roboflow Dataset Upload
,Icon Visualization
,Depth Estimation
,CogVLM
,Polygon Zone Visualization
,Stitch Images
,Image Slicer
,OpenAI
,Relative Static Crop
,Segment Anything 2 Model
,Google Gemini
,SIFT Comparison
,Buffer
,Pixel Color Count
,Image Slicer
,Gaze Detection
,Image Threshold
,Perspective Correction
,Byte Tracker
,Camera Focus
,Qwen2.5-VL
,Classification Label Visualization
,Instance Segmentation Model
,Reference Path Visualization
,Color Visualization
,Barcode Detection
,OpenAI
,Camera Calibration
,Clip Comparison
,Roboflow Dataset Upload
,Mask Visualization
,OpenAI
,SIFT
,Polygon Visualization
,VLM as Detector
,Detections Stabilizer
,Image Convert Grayscale
,Moondream2
,Multi-Label Classification Model
,Image Blur
,Trace Visualization
,Absolute Static Crop
,Image Preprocessing
,Single-Label Classification Model
,Object Detection Model
Input and Output Bindings¶
The available connections depend on its binding kinds. Check what binding kinds
Stability AI Image Generation
in version v1
has.
Bindings
-
input
image
(image
): The image to use as the starting point for the generation..strength
(float_zero_to_one
): controls how much influence the image parameter has on the generated image. A value of 0 would yield an image that is identical to the input. A value of 1 would be as if you passed in no image at all..prompt
(string
): Prompt to generate new images from text (what you wish to see).negative_prompt
(string
): Negative prompt to image generation model (what you do not wish to see).model
(string
): choose one of {'core', 'ultra', 'sd3'}. Default 'core' .api_key
(Union[string
,secret
]): Your Stability AI API key.
-
output
image
(image
): Image in workflows.
Example JSON definition of step Stability AI Image Generation
in version v1
{
"name": "<your_step_name_here>",
"type": "roboflow_core/stability_ai_image_gen@v1",
"image": "$inputs.image",
"strength": 0.3,
"prompt": "my prompt",
"negative_prompt": "my prompt",
"model": "my prompt",
"api_key": "xxx-xxx"
}