feat: ggml-alloc integration and gpu acceleration (#75)

* set ggml url to FSSRepo/ggml

* ggml-alloc integration

* offload all functions to gpu

* gguf format + native converter

* merge custom vae to a model

* full offload to gpu

* improve pretty progress

---------

Co-authored-by: leejet <leejet714@gmail.com>
This commit is contained in:
Steward Garcia
2023-11-26 06:02:36 -05:00
committed by GitHub
parent c874063408
commit 8124588cf1
29 changed files with 120774 additions and 2754 deletions

4
.gitmodules vendored
View File

@@ -1,3 +1,3 @@
[submodule "ggml"]
path = ggml
url = https://github.com/leejet/ggml.git
path = ggml
url = https://github.com/FSSRepo/ggml.git