Gemma3-4B_WEASEL

This model is a WEASEL fine-tuned variant of google/gemma-3-4b-it for web-agent style reasoning and action generation.

Model description

  • Base model: google/gemma-3-4b-it
  • Fine-tuning objective: web task solving with reasoning/action style supervision
  • Format: Hugging Face Transformers + Safetensors

Intended use

This model is intended for research on web-agent behavior, instruction following, and reasoning traces.

Training data

Fine-tuned on an AgentTrek-style web-agent dataset. If released separately, link it here: yeonjooooni/agenttrek-dataset

Limitations

  • May generate incorrect or unsafe browser actions
  • Reasoning traces do not guarantee factual correctness
  • Not evaluated for safety-critical use

Loading

from transformers import AutoProcessor, AutoModelForImageTextToText

model_id = "yeonjooooni/Gemma3-4B_WEASEL"
processor = AutoProcessor.from_pretrained(model_id)
model = AutoModelForImageTextToText.from_pretrained(model_id)
Downloads last month
52
Safetensors
Model size
4B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for yeonjooooni/Gemma3-4B_WEASEL

Finetuned
(708)
this model

Dataset used to train yeonjooooni/Gemma3-4B_WEASEL

Collection including yeonjooooni/Gemma3-4B_WEASEL