stable-diffusion.cpp

Author	SHA1	Message	Date
leejet	4a8190405a	fix: fix the issue with dynamic linking	2024-02-25 21:39:01 +08:00
leejet	730585d515	sync: update ggml (#180 )	2024-02-25 21:11:01 +08:00
Sean Bailey	193fb620b1	feat: add capability to repeatedly run the upscaler in a row (#174 ) * Add in upscale repeater logic --------- Co-authored-by: leejet <leejet714@gmail.com>	2024-02-24 21:31:01 +08:00
leejet	b6368868d9	feat: introduce GGMLBlock and implement SVD(Broken) (#159 ) * introduce GGMLBlock and implement SVD(Broken) * add sdxl vae warning	2024-02-24 20:06:39 +08:00
leejet	349439f239	style: format code	2024-01-29 23:05:18 +08:00
Steward Garcia	36ec16ac99	feat: Control Net support + Textual Inversion (embeddings) (#131 ) * add controlnet to pipeline * add cli params * control strength cli param * cli param keep controlnet in cpu * add Textual Inversion * add canny preprocessor * refactor: change ggml_type_sizef to ggml_row_size * process hint once time * ignore the embedding name case --------- Co-authored-by: leejet <leejet714@gmail.com>	2024-01-29 22:38:51 +08:00
旺旺碎冰冰	c6071fa82f	feat: add hipBlas support (#94 )	2024-01-14 11:53:42 +08:00
leejet	5c614e4bc2	feat: add convert api (#142 )	2024-01-14 11:43:24 +08:00
leejet	2b6ec97fe2	sync: update ggml (#134 )	2024-01-05 23:18:41 +08:00
leejet	db382348cc	fix: change GGML_MAX_NAME to 128	2024-01-03 22:42:42 +08:00
leejet	7cb41b190f	fix: avoid encountering 'std::set undefined' in some environments	2024-01-02 22:37:01 +08:00
leejet	7fb8a51318	chore: make SD_BUILD_DLL visible only to SD_LIB	2024-01-02 22:31:40 +08:00
leejet	2c5f3fc53a	chore: add support for building shared library	2024-01-02 21:05:44 +08:00
Erik Scholz	f2e4d9793b	fix: avoid some memory leaks (#136 ) --------- Co-authored-by: leejet <leejet714@gmail.com>	2024-01-01 23:27:29 +08:00
Erik Scholz	4a5e7b58e2	fix: never use a log message as a format string (#135 )	2024-01-01 20:43:47 +08:00
leejet	2e79a82f85	refactor: reorganize code and use c api (#133 )	2024-01-01 16:22:18 +08:00
leejet	b139434b57	docs: update README.md	2023-12-31 11:48:41 +08:00
leejet	14da17a923	fix: initialize some pointers to NULL	2023-12-30 14:24:45 +08:00
leejet	78ad76f3f4	feat: add SDXL support (#117 ) * add SDXL support * fix the issue with generating large images	2023-12-29 00:16:10 +08:00
Steward Garcia	004dfbef27	feat: implement ESRGAN upscaler + Metal Backend (#104 ) * add esrgan upscaler * add sd_tiling * support metal backend * add clip_skip --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-28 23:46:48 +08:00
旺旺碎冰冰	0e64238e4c	feat: implement the complete bpe function (#119 ) * implement the complete bpe function --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-23 12:11:07 +08:00
leejet	8f6b4a39d6	fix: enhance the tokenizer's handing of Unicode (#120 )	2023-12-21 00:22:03 +08:00
Kreijstal	9842a3f819	fix: add support for int32_t on other compilers (#114 )	2023-12-11 23:32:39 +08:00
leejet	ac8f5a044c	feat: add SD-Turbo support	2023-12-10 13:15:09 +08:00
Sam Jones	ca33304318	fix: remove dangling pointer to work_output in CLIPTextModel (#111 )	2023-12-10 10:05:02 +08:00
leejet	69efe3ce2b	chore: make code cleaner	2023-12-09 17:35:10 +08:00
leejet	2eac844bbd	fix: generate image correctly in img2img mode	2023-12-09 14:39:43 +08:00
leejet	968226abb2	docs: update v2-1_768-nonema-pruned.safetensors url	2023-12-05 22:52:19 +08:00
Steward Garcia	134883aec4	feat: add TAESD implementation - faster autoencoder (#88 ) * add taesd implementation * taesd gpu offloading * show seed when generating image with -s -1 * less restrictive with larger images * cuda: im2col speedup x2 * cuda: group norm speedup x90 * quantized models now works in cuda :) * fix cal mem size --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-12-05 22:40:03 +08:00
leejet	f99bcd1f76	fix: detect model format base on file content	2023-12-03 20:30:31 +08:00
leejet	8a87b273ad	fix: allow model and vae using different format	2023-12-03 17:12:04 +08:00
leejet	d7af2c2ba9	feat: load weights from safetensors and ckpt (#101 )	2023-12-03 15:47:20 +08:00
旺旺碎冰冰	47dd704198	fix: avoid build fail on msvc (#93 )	2023-11-28 20:49:11 +08:00
Erik Scholz	f469b835a3	fix: reading memory of stack allocated object past its scope (#91 )	2023-11-27 21:37:12 +08:00
Steward Garcia	8124588cf1	feat: ggml-alloc integration and gpu acceleration (#75 ) * set ggml url to FSSRepo/ggml * ggml-alloc integration * offload all functions to gpu * gguf format + native converter * merge custom vae to a model * full offload to gpu * improve pretty progress --------- Co-authored-by: leejet <leejet714@gmail.com>	2023-11-26 19:02:36 +08:00
Erik Scholz	c874063408	fix: support bf16 lora weights (#82 )	2023-11-20 22:34:17 +08:00
Urs Ganse	ae1d5dcebb	feat: allow LoRAs with negative multiplier (#83 ) * Allow Loras with negative weight, too. There are a couple of loras, which serve to adjust certain concepts in both positive and negative directions (like exposure, detail level etc). The current code rejects them if loaded with a negative weight, but I suggest that this check can simply be dropped. * ignore lora in the case of multiplier == 0.f --------- Co-authored-by: Urs Ganse <urs@nerd2nerd.org> Co-authored-by: leejet <leejet714@gmail.com>	2023-11-20 22:23:52 +08:00
leejet	51b53d4cb1	chore: typo remote => remove	2023-11-19 23:21:49 +08:00
leejet	0d9b801aaa	fix: fix multi loras prompt parse	2023-11-19 23:19:37 +08:00
leejet	176a00b606	chore: add .clang-format	2023-11-19 19:35:33 +08:00
leejet	64f6002457	docs: add contributors info to README.md	2023-11-19 18:35:19 +08:00
leejet	9a9f3daf8e	feat: add LoRA support	2023-11-19 17:43:49 +08:00
leejet	536f3af672	feat: add lcm sampler support This referenced an issue discussion of the stable-diffusion-webui at https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/13952, which may not be too perfect.	2023-11-17 22:53:46 +08:00
leejet	3bf1665885	chore: clear the msvc compilation warning	2023-10-28 20:55:24 +08:00
leejet	3001c23f7d	perf: change ggml graph eval order to RIGHT_TO_LEFT to optimize memory usage	2023-10-28 20:19:15 +08:00
leejet	ed374983f3	fix: set eps of ggml_norm(LayerNorm) to 1e-5	2023-10-27 00:50:23 +08:00
leejet	4c96185fcc	fix: update ggml to avoid insufficient memory error on macOS	2023-10-24 22:04:00 +08:00
leejet	fbd18e1059	fix: avoid stack overflow on MSVC	2023-10-23 21:10:46 +08:00
leejet	09cab2a2ae	chore: set default BUILD_SHARED_LIBS to OFF	2023-10-22 14:59:03 +08:00
leejet	69e54ace14	sync: update ggml	2023-10-22 14:11:06 +08:00

1 2

96 Commits