Train
Managed LoRA / fine-tune services on top of trainable models, plus self-host on rented H100s. Pick by provider, model, or cost.
est. wall-clock 0.28h
provider
managed training services · 13
cheapest first · click row → service| model | vendor | provider | type | $/1k steps | est. total (1.0k) | notes |
|---|---|---|---|---|---|---|
| wan-2.7-image-pro | Alibaba | self-host-vast | LoRA | $0.50 | $0.50 | |
| wan-2.7-image | Alibaba | self-host-vast | LoRA | $0.50 | $0.50 | |
| qwen-image-2.0 | Alibaba | self-host-vast | LoRA | $0.50 | $0.50 | |
| flux-2-klein-4b | BFL | self-host-vast | LoRA | $0.50 | $0.50 | Apache 2.0 — fully commercial |
| qwen-image-2.0-pro | Alibaba | fal | LoRA | $0.95 | $0.95 | |
| qwen-image-2.0 | Alibaba | fal | LoRA | $0.95 | $0.95 | |
| flux-2-klein-9b | BFL | self-host-vast | LoRA | $1.00 | $1.00 | |
| bagel | ByteDance | self-host-vast | LoRA + FT | $2.00 | $2.00 | |
| step1x-edit | StepFun | self-host-vast | LoRA | $2.00 | $2.00 | |
| flux-1-kontext-dev | BFL | fal | LoRA | $2.50 | $2.50 | |
| qwen-image-edit | Alibaba | fal | LoRA | $4.00 | $4.00 | |
| flux-2-dev | BFL | fal | LoRA | $8.00 | $8.00 | |
| hunyuan-image-3.0-instruct | Tencent | self-host | LoRA | $75.00 | $75.00 | 80B MoE — needs multi-H100 |
self-host on H100 · estimated cost for 8B × 1,000 steps
spot pricing| provider | gpu | kind | $/hr | total (0.28h) | region | notes |
|---|---|---|---|---|---|---|
| Vast.ai | H100 80GB | marketplace | $0.34 | $0.09 | global | Market floor. Reliability varies by host. |
| Spheron / GMI | H100 80GB | marketplace | $0.47 | $0.13 | global | Decentralized GPU marketplace. |
| RunPod | H100 SXM 80GB | cloud | $1.19 | $0.33 | global | |
| SF Compute | H100 80GB | cluster | $1.96 | $0.54 | us-sfo | Vetted clusters, sell-back unused. B300s announced for fall 2026. |
| RunPod | H100 PCIe 80GB | cloud | $1.99 | $0.55 | global | |
| AWS | H100 80GB (p5) | cloud | $2.50 | $0.69 | global | Was $6.88; repriced mid-2025. $1.90/hr with 3-yr SP. |
| CoreWeave | H100 80GB (8x HGX) | cloud | $2.83 | $0.79 | us | Per-GPU price in 8x node; $49.24/hr for full node |
| Lambda | H100 SXM 80GB | cloud | $2.99 | $0.83 | us+eu | Reserved 1-yr: $1.89/hr |
| Modal | H100 80GB | serverless | $3.95 | $1.10 | global | Preemptible base; non-preempt ~3.75×. Includes Modal Sandbox runtime. |
| Replicate (raw GPU) | H100 80GB (8x) | serverless | $5.49 | $1.53 | us | Per-GPU in 8x cluster; $43.92/hr full node. |
how this works
Managed — provider runs the GPU + trainer code; you upload a dataset + config, get back LoRA weights. Best for low-volume, no-ops use.
Self-host — you rent the GPU and run ai-toolkit / kohya / diffusion-pipe. Best at high volume — Vast.ai spot H100 ≈ $0.34/hr beats fal's $4/1k Qwen-Edit trainer at > 10 LoRAs/month if you've automated the loop.