leejet
4a8190405a
fix: fix the issue with dynamic linking
2024-02-25 21:39:01 +08:00
leejet
730585d515
sync: update ggml ( #180 )
2024-02-25 21:11:01 +08:00
Sean Bailey
193fb620b1
feat: add capability to repeatedly run the upscaler in a row ( #174 )
...
* Add in upscale repeater logic
---------
Co-authored-by: leejet <leejet714@gmail.com>
2024-02-24 21:31:01 +08:00
leejet
b6368868d9
feat: introduce GGMLBlock and implement SVD(Broken) ( #159 )
...
* introduce GGMLBlock and implement SVD(Broken)
* add sdxl vae warning
2024-02-24 20:06:39 +08:00
leejet
349439f239
style: format code
2024-01-29 23:05:18 +08:00
Steward Garcia
36ec16ac99
feat: Control Net support + Textual Inversion (embeddings) ( #131 )
...
* add controlnet to pipeline
* add cli params
* control strength cli param
* cli param keep controlnet in cpu
* add Textual Inversion
* add canny preprocessor
* refactor: change ggml_type_sizef to ggml_row_size
* process hint once time
* ignore the embedding name case
---------
Co-authored-by: leejet <leejet714@gmail.com>
2024-01-29 22:38:51 +08:00
旺旺碎冰冰
c6071fa82f
feat: add hipBlas support ( #94 )
2024-01-14 11:53:42 +08:00
leejet
5c614e4bc2
feat: add convert api ( #142 )
2024-01-14 11:43:24 +08:00
leejet
2b6ec97fe2
sync: update ggml ( #134 )
2024-01-05 23:18:41 +08:00
leejet
db382348cc
fix: change GGML_MAX_NAME to 128
2024-01-03 22:42:42 +08:00
leejet
7cb41b190f
fix: avoid encountering 'std::set undefined' in some environments
2024-01-02 22:37:01 +08:00
leejet
7fb8a51318
chore: make SD_BUILD_DLL visible only to SD_LIB
2024-01-02 22:31:40 +08:00
leejet
2c5f3fc53a
chore: add support for building shared library
2024-01-02 21:05:44 +08:00
Erik Scholz
f2e4d9793b
fix: avoid some memory leaks ( #136 )
...
---------
Co-authored-by: leejet <leejet714@gmail.com>
2024-01-01 23:27:29 +08:00
Erik Scholz
4a5e7b58e2
fix: never use a log message as a format string ( #135 )
2024-01-01 20:43:47 +08:00
leejet
2e79a82f85
refactor: reorganize code and use c api ( #133 )
2024-01-01 16:22:18 +08:00
leejet
b139434b57
docs: update README.md
2023-12-31 11:48:41 +08:00
leejet
14da17a923
fix: initialize some pointers to NULL
2023-12-30 14:24:45 +08:00
leejet
78ad76f3f4
feat: add SDXL support ( #117 )
...
* add SDXL support
* fix the issue with generating large images
2023-12-29 00:16:10 +08:00
Steward Garcia
004dfbef27
feat: implement ESRGAN upscaler + Metal Backend ( #104 )
...
* add esrgan upscaler
* add sd_tiling
* support metal backend
* add clip_skip
---------
Co-authored-by: leejet <leejet714@gmail.com>
2023-12-28 23:46:48 +08:00
旺旺碎冰冰
0e64238e4c
feat: implement the complete bpe function ( #119 )
...
* implement the complete bpe function
---------
Co-authored-by: leejet <leejet714@gmail.com>
2023-12-23 12:11:07 +08:00
leejet
8f6b4a39d6
fix: enhance the tokenizer's handing of Unicode ( #120 )
2023-12-21 00:22:03 +08:00
Kreijstal
9842a3f819
fix: add support for int32_t on other compilers ( #114 )
2023-12-11 23:32:39 +08:00
leejet
ac8f5a044c
feat: add SD-Turbo support
2023-12-10 13:15:09 +08:00
Sam Jones
ca33304318
fix: remove dangling pointer to work_output in CLIPTextModel ( #111 )
2023-12-10 10:05:02 +08:00
leejet
69efe3ce2b
chore: make code cleaner
2023-12-09 17:35:10 +08:00
leejet
2eac844bbd
fix: generate image correctly in img2img mode
2023-12-09 14:39:43 +08:00
leejet
968226abb2
docs: update v2-1_768-nonema-pruned.safetensors url
2023-12-05 22:52:19 +08:00
Steward Garcia
134883aec4
feat: add TAESD implementation - faster autoencoder ( #88 )
...
* add taesd implementation
* taesd gpu offloading
* show seed when generating image with -s -1
* less restrictive with larger images
* cuda: im2col speedup x2
* cuda: group norm speedup x90
* quantized models now works in cuda :)
* fix cal mem size
---------
Co-authored-by: leejet <leejet714@gmail.com>
2023-12-05 22:40:03 +08:00
leejet
f99bcd1f76
fix: detect model format base on file content
2023-12-03 20:30:31 +08:00
leejet
8a87b273ad
fix: allow model and vae using different format
2023-12-03 17:12:04 +08:00
leejet
d7af2c2ba9
feat: load weights from safetensors and ckpt ( #101 )
2023-12-03 15:47:20 +08:00
旺旺碎冰冰
47dd704198
fix: avoid build fail on msvc ( #93 )
2023-11-28 20:49:11 +08:00
Erik Scholz
f469b835a3
fix: reading memory of stack allocated object past its scope ( #91 )
2023-11-27 21:37:12 +08:00
Steward Garcia
8124588cf1
feat: ggml-alloc integration and gpu acceleration ( #75 )
...
* set ggml url to FSSRepo/ggml
* ggml-alloc integration
* offload all functions to gpu
* gguf format + native converter
* merge custom vae to a model
* full offload to gpu
* improve pretty progress
---------
Co-authored-by: leejet <leejet714@gmail.com>
2023-11-26 19:02:36 +08:00
Erik Scholz
c874063408
fix: support bf16 lora weights ( #82 )
2023-11-20 22:34:17 +08:00
Urs Ganse
ae1d5dcebb
feat: allow LoRAs with negative multiplier ( #83 )
...
* Allow Loras with negative weight, too.
There are a couple of loras, which serve to adjust certain concepts in
both positive and negative directions (like exposure, detail level etc).
The current code rejects them if loaded with a negative weight, but I
suggest that this check can simply be dropped.
* ignore lora in the case of multiplier == 0.f
---------
Co-authored-by: Urs Ganse <urs@nerd2nerd.org>
Co-authored-by: leejet <leejet714@gmail.com>
2023-11-20 22:23:52 +08:00
leejet
51b53d4cb1
chore: typo remote => remove
2023-11-19 23:21:49 +08:00
leejet
0d9b801aaa
fix: fix multi loras prompt parse
2023-11-19 23:19:37 +08:00
leejet
176a00b606
chore: add .clang-format
2023-11-19 19:35:33 +08:00
leejet
64f6002457
docs: add contributors info to README.md
2023-11-19 18:35:19 +08:00
leejet
9a9f3daf8e
feat: add LoRA support
2023-11-19 17:43:49 +08:00
leejet
536f3af672
feat: add lcm sampler support
...
This referenced an issue discussion of the stable-diffusion-webui at
https://github.com/AUTOMATIC1111/stable-diffusion-webui/issues/13952 , which
may not be too perfect.
2023-11-17 22:53:46 +08:00
leejet
3bf1665885
chore: clear the msvc compilation warning
2023-10-28 20:55:24 +08:00
leejet
3001c23f7d
perf: change ggml graph eval order to RIGHT_TO_LEFT to optimize memory usage
2023-10-28 20:19:15 +08:00
leejet
ed374983f3
fix: set eps of ggml_norm(LayerNorm) to 1e-5
2023-10-27 00:50:23 +08:00
leejet
4c96185fcc
fix: update ggml to avoid insufficient memory error on macOS
2023-10-24 22:04:00 +08:00
leejet
fbd18e1059
fix: avoid stack overflow on MSVC
2023-10-23 21:10:46 +08:00
leejet
09cab2a2ae
chore: set default BUILD_SHARED_LIBS to OFF
2023-10-22 14:59:03 +08:00
leejet
69e54ace14
sync: update ggml
2023-10-22 14:11:06 +08:00