feat: ggml-alloc integration and gpu acceleration (#75)

* set ggml url to FSSRepo/ggml

* ggml-alloc integration

* offload all functions to gpu

* gguf format + native converter

* merge custom vae to a model

* full offload to gpu

* improve pretty progress

---------

Co-authored-by: leejet <leejet714@gmail.com>
This commit is contained in:
Steward Garcia
2023-11-26 06:02:36 -05:00
committed by GitHub
parent c874063408
commit 8124588cf1
29 changed files with 120774 additions and 2754 deletions

8
.gitignore vendored
View File

@@ -1,6 +1,12 @@
build*/
test/
.vscode/
.cache/
*.swp
.vscode/
*.bat
*.bin
*.exe
*.gguf
output.png
models/*