stable-diffusion.cpp

Author	SHA1	Message	Date
leejet	2e79a82f85	refactor: reorganize code and use c api (#133 )	2024-01-01 16:22:18 +08:00
Steward Garcia	004dfbef27	feat: implement ESRGAN upscaler + Metal Backend (#104 ) * add esrgan upscaler * add sd_tiling * support metal backend * add clip_skip --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-28 23:46:48 +08:00
旺旺碎冰冰	0e64238e4c	feat: implement the complete bpe function (#119 ) * implement the complete bpe function --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-23 12:11:07 +08:00
Steward Garcia	134883aec4	feat: add TAESD implementation - faster autoencoder (#88 ) * add taesd implementation * taesd gpu offloading * show seed when generating image with -s -1 * less restrictive with larger images * cuda: im2col speedup x2 * cuda: group norm speedup x90 * quantized models now works in cuda :) * fix cal mem size --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-05 22:40:03 +08:00
leejet	8a87b273ad	fix: allow model and vae using different format	2023-12-03 17:12:04 +08:00
leejet	d7af2c2ba9	feat: load weights from safetensors and ckpt (#101 )	2023-12-03 15:47:20 +08:00
旺旺碎冰冰	47dd704198	fix: avoid build fail on msvc (#93 )	2023-11-28 20:49:11 +08:00
Steward Garcia	8124588cf1	feat: ggml-alloc integration and gpu acceleration (#75 ) * set ggml url to FSSRepo/ggml * ggml-alloc integration * offload all functions to gpu * gguf format + native converter * merge custom vae to a model * full offload to gpu * improve pretty progress --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-11-26 19:02:36 +08:00
leejet	176a00b606	chore: add .clang-format	2023-11-19 19:35:33 +08:00
leejet	9a9f3daf8e	feat: add LoRA support	2023-11-19 17:43:49 +08:00
leejet	536f3af672	feat: add lcm sampler support This referenced an issue discussion of the stable-diffusion-webui at https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/13952, which may not be too perfect.	2023-11-17 22:53:46 +08:00
Urs Ganse	3a25179d52	feat: add DPM2 and DPM++(2s) a samplers (#56 ) * Add DPM2 sampler. * Add DPM++ (2s) a sampler. * Update README.md with added samplers --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-09-12 23:02:09 +08:00
Urs Ganse	968fbf02aa	feat: add option to switch the sigma schedule (#51 ) Concretely, this allows switching to the "Karras" schedule from the Karras et al 2022 paper, equivalent to the samplers marked as "Karras" in the AUTOMATIC1111 WebUI. This choice is in principle orthogonal to the sampler choice and can be given independently.	2023-09-09 00:02:07 +08:00
Urs Ganse	b6899e8fc2	feat: add Euler, Heun and DPM++ (2M) samplers (#50 ) * Add Euler sampler * Add Heun sampler * Add DPM++ (2M) sampler * Add modified DPM++ (2M) "v2" sampler. This was proposed in a issue discussion of the stable diffusion webui, at https://github.com/AUTOMATIC1111/stable-diffusion-webui/discussions/8457 and apparently works around overstepping of the DPM++ (2M) method with small step counts. The parameter is called dpmpp2mv2 here. * match code style --------- Co-authored-by: Urs Ganse <urs@nerd2nerd.org> Co-authored-by: leejet <leejet714@gmail.com>	2023-09-08 23:47:28 +08:00
leejet	45842865ff	fix: seed should be 64 bit	2023-09-03 20:08:22 +08:00
leejet	e5a7aec252	feat: add CUDA RNG	2023-09-03 19:24:07 +08:00
leejet	8f34dd7cc7	perf: free unused params immediately to reduce memory usage	2023-08-17 00:55:36 +08:00
leejet	58735a2813	feat: add img2img mode (#5 )	2023-08-16 01:48:07 +08:00
leejet	3aca342e60	Initial commit	2023-08-13 16:00:22 +08:00

19 Commits