common.title

Docs
Quantum Circuit
TYTAN CLOUD

QUANTUM GAMING


autoQAOA
Desktop RAG

Overview
Terms of service

Privacy policy

Contact
Research

Sign in
Sign up
common.title

Stable Diffusion speed comparison between Forge and Automatic1111, using T4 and H100 GPUs

Yuichiro Minato

2024/03/18 00:08

Hello, I wanted to try using Stable Diffusion, so I ran some benchmarks.

I used a random cityscape as the prompt and tried out various models.

It feels like there are slight differences in the fine details depending on the machine and tool used, but overall, they seemed quite similar.

The combinations of machines I used involved GPUs: T4 and H100. I only used Automatic1111 with the H100.

Automatic1111

https://github.com/AUTOMATIC1111/stable-diffusion-webui

Forge

https://github.com/lllyasviel/stable-diffusion-webui-forge

T4 + Automatic1111 / 640*360

T4 + Forge / 640*360

H100 + Automatic1111 / 640*360

The result for 640x360 pixels are

And I tried larger images.

T4 + Automatic1111 / 1920*1080

T4 + Forge / 1920*1080

H100 + Automatic1111

The benchmark is,

I feel like the images are quite different as well, but I did make sure to use a fixed seed for the runs.

The H100 is overwhelmingly faster, as expected. I feel like using Forge might make it even faster.

I wonder about the cost of providing such services for the companies involved...

I plan to continue benchmarking little by little in the future.

© 2025, blueqat Inc. All rights reserved