Spaces:
Running
Running
Update README.md (#1)
Browse files- Update README.md (7f71e78466dcda9d3b15d51f25f4b25a015f36fd)
Co-authored-by: Vaibhav Srivastav <reach-vb@users.noreply.huggingface.co>
README.md
CHANGED
|
@@ -7,4 +7,18 @@ sdk: static
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 7 |
pinned: false
|
| 8 |
---
|
| 9 |
|
| 10 |
+
# GGML - AI at the edge
|
| 11 |
+
|
| 12 |
+
ggml is a tensor library for machine learning to enable large models and high performance on commodity hardware. It is used by [llama.cpp](https://github.com/ggerganov/llama.cpp) and [whisper.cpp](https://github.com/ggerganov/whisper.cpp)
|
| 13 |
+
|
| 14 |
+
Written in C
|
| 15 |
+
- 16-bit float support
|
| 16 |
+
- Integer quantization support (e.g. 4-bit, 5-bit, 8-bit)
|
| 17 |
+
- Automatic differentiation
|
| 18 |
+
- Built-in optimization algorithms (e.g. ADAM, L-BFGS)
|
| 19 |
+
- Optimized for Apple Silicon
|
| 20 |
+
- On x86 architectures utilizes AVX / AVX2 intrinsics
|
| 21 |
+
- Web support via WebAssembly and WASM SIMD
|
| 22 |
+
- No third-party dependencies
|
| 23 |
+
- Zero memory allocations during runtime
|
| 24 |
+
- Guided language output support
|