Collections
Discover the best community collections!
Collections including paper arxiv:2404.07972
-
An Interactive Agent Foundation Model
Paper • 2402.05929 • Published • 29 -
From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
Paper • 2401.02777 • Published • 1 -
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper • 2402.14034 • Published • 13 -
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Paper • 2403.04746 • Published • 24
-
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 40 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper • 2310.08740 • Published • 15 -
The Consensus Game: Language Model Generation via Equilibrium Search
Paper • 2310.09139 • Published • 14 -
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 29
-
How Far Are We from Intelligent Visual Deductive Reasoning?
Paper • 2403.04732 • Published • 21 -
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper • 2403.07508 • Published • 78 -
DragAnything: Motion Control for Anything using Entity Representation
Paper • 2403.07420 • Published • 14 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 33
-
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 32 -
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Paper • 2312.17172 • Published • 31 -
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Paper • 2401.01974 • Published • 7 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28
-
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 55 -
TheBloke/dolphin-2.5-mixtral-8x7b-GPTQ
Text Generation • 47B • Updated • 38 • 113 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 249 -
WebArena: A Realistic Web Environment for Building Autonomous Agents
Paper • 2307.13854 • Published • 27
-
How Far Are We from Intelligent Visual Deductive Reasoning?
Paper • 2403.04732 • Published • 21 -
MoAI: Mixture of All Intelligence for Large Language and Vision Models
Paper • 2403.07508 • Published • 78 -
DragAnything: Motion Control for Anything using Entity Representation
Paper • 2403.07420 • Published • 14 -
Learning and Leveraging World Models in Visual Representation Learning
Paper • 2403.00504 • Published • 33
-
An Interactive Agent Foundation Model
Paper • 2402.05929 • Published • 29 -
From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models
Paper • 2401.02777 • Published • 1 -
AgentScope: A Flexible yet Robust Multi-Agent Platform
Paper • 2402.14034 • Published • 13 -
LLMs in the Imaginarium: Tool Learning through Simulated Trial and Error
Paper • 2403.04746 • Published • 24
-
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
Paper • 2312.16862 • Published • 32 -
Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Paper • 2312.17172 • Published • 31 -
Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers
Paper • 2401.01974 • Published • 7 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 28
-
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 40 -
A Zero-Shot Language Agent for Computer Control with Structured Reflection
Paper • 2310.08740 • Published • 15 -
The Consensus Game: Language Model Generation via Equilibrium Search
Paper • 2310.09139 • Published • 14 -
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 29
-
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 55 -
TheBloke/dolphin-2.5-mixtral-8x7b-GPTQ
Text Generation • 47B • Updated • 38 • 113 -
GAIA: a benchmark for General AI Assistants
Paper • 2311.12983 • Published • 249 -
WebArena: A Realistic Web Environment for Building Autonomous Agents
Paper • 2307.13854 • Published • 27