---
language:
- en
license: apache-2.0
base_model: FutureMa/Eva-4B
tags:
- finance
- earnings-calls
- financial-nlp
- text-classification
- qwen3
- llama-cpp
- gguf
- gguf-my-repo
pipeline_tag: text-generation
library_name: transformers
---

# FutureMa/Eva-4B-GGUF

This repository hosts **GGUF** files for [`FutureMa/Eva-4B`](https://huggingface.co/FutureMa/Eva-4B), intended for use with [`llama.cpp`](https://github.com/ggerganov/llama.cpp).

- **Base model:** `FutureMa/Eva-4B`
- **Format:** GGUF (for llama.cpp)
- **License:** Apache-2.0

Refer to the [original model card](https://huggingface.co/FutureMa/Eva-4B) for model details, intended use, limitations, and evaluation information.

## Files

- `Eva-4B-F16.gguf` (FP16 / F16)

## Use with llama.cpp

### Option A: Install via Homebrew (macOS/Linux)

```bash
brew install llama.cpp
```

#### CLI

```bash
llama-cli --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -p "The meaning of life and the universe is"
```

#### Server

```bash
llama-server --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -c 2048
```

### Option B: Build llama.cpp from source

Step 1: Clone llama.cpp:

```bash
git clone https://github.com/ggerganov/llama.cpp
```

Step 2: Build (enable Hugging Face download support):

```bash
cd llama.cpp && LLAMA_CURL=1 make
```

Step 3: Run:

```bash
./llama-cli --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -p "The meaning of life and the universe is"
```

or

```bash
./llama-server --hf-repo FutureMa/Eva-4B-GGUF --hf-file Eva-4B-F16.gguf -c 2048
```

## Notes

- The `-c 2048` value is an example context size; adjust based on your needs and available memory.
- If you publish additional quantizations (e.g. `Q4_K_M`, `Q5_K_M`), add them to the **Files** section above and reference them in the example commands.