Video2LoRA: Parametric Video Internalization for Vision-Language Models
Paper • 2606.04351 • Published • 4
Official implementation of Frames2LoRA
Manan Suri · Sarvesh Baskar · Dinesh Manocha
University of Maryland, College Park
This repository contains two Frames2LoRA Stage 1 checkpoint files:
frames2lora-smolvlm2-500m-best-ce.pt for HuggingFaceTB/SmolVLM2-500M-Video-Instructframes2lora-smolvlm2-2.2b-best-ce.pt for HuggingFaceTB/SmolVLM2-2.2B-Instruct@misc{suri2026frames2loraparametricvideointernalization,
title={Frames2LoRA: Parametric Video Internalization for Vision-Language Models},
author={Manan Suri and Sarvesh Baskar and Dinesh Manocha},
year={2026},
eprint={2606.04351},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2606.04351},
}