lmms-lab/LLaVA-Video-178K
Viewer • Updated • 1.63M • 23.4k • 196
How to use tsinghua-ee/video-SALMONN-2 with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("tsinghua-ee/video-SALMONN-2")
model = AutoModelForCausalLM.from_pretrained("tsinghua-ee/video-SALMONN-2")Official model release of video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
Base model
Qwen/Qwen2-7B