Eva-4B-GGUF / README.md
FutureMa's picture
Create README.md
76ea049 verified
|
raw
history blame contribute delete
1.82 kB
---
language:
- en
license: apache-2.0
base_model: FutureMa/Eva-4B
tags:
- finance
- earnings-calls
- financial-nlp
- text-classification
- qwen3
- llama-cpp
- gguf
- gguf-my-repo
pipeline_tag: text-generation
library_name: transformers
---
# FutureMa/Eva-4B-GGUF
This repository hosts **GGUF** files for [`FutureMa/Eva-4B`](https://huggingface.co/FutureMa/Eva-4B), intended for use with [`llama.cpp`](https://github.com/ggerganov/llama.cpp).
- **Base model:** `FutureMa/Eva-4B`
- **Format:** GGUF (for llama.cpp)
- **License:** Apache-2.0
Refer to the [original model card](https://huggingface.co/FutureMa/Eva-4B) for model details, intended use, limitations, and evaluation information.
## Files
- `Eva-4B-F16.gguf` (FP16 / F16)
## Use with llama.cpp
### Option A: Install via Homebrew (macOS/Linux)
```bash
brew install llama.cpp
```
#### CLI
```bash
llama-cli --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -p "The meaning of life and the universe is"
```
#### Server
```bash
llama-server --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -c 2048
```
### Option B: Build llama.cpp from source
Step 1: Clone llama.cpp:
```bash
git clone https://github.com/ggerganov/llama.cpp
```
Step 2: Build (enable Hugging Face download support):
```bash
cd llama.cpp && LLAMA_CURL=1 make
```
Step 3: Run:
```bash
./llama-cli --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -p "The meaning of life and the universe is"
```
or
```bash
./llama-server --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -c 2048
```
## Notes
- The `-c 2048` value is an example context size; adjust based on your needs and available memory.
- If you publish additional quantizations (e.g. `Q4_K_M`, `Q5_K_M`), add them to the **Files** section above and reference them in the example commands.