Bridging the Data Provenance Gap Across Text, Speech and Video Paper • 2412.17847 • Published about 1 month ago • 8
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated 29 days ago • 8
CoTA Datasets Collection This collection contains all versions of the CoTA (Chain-of-Thought-and-Action) datasets. • 5 items • Updated 29 days ago • 6
Idefics2 🐶 Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6, 2024 • 91
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets Paper • 2406.18518 • Published Jun 26, 2024 • 24
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 140
xLAM models Collection xLAM: A Family of Large Action Models to Empower AI Agent Systems: https://github.com/SalesforceAIResearch/xLAM • 11 items • Updated 30 days ago • 45
AgentOhana: Design Unified Data and Training Pipeline for Effective Agent Learning Paper • 2402.15506 • Published Feb 23, 2024 • 14
AgentLite: A Lightweight Library for Building and Advancing Task-Oriented LLM Agent System Paper • 2402.15538 • Published Feb 23, 2024 • 6
Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit Paper • 2401.00288 • Published Dec 30, 2023 • 1
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization Paper • 2308.02151 • Published Aug 4, 2023 • 18
DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI Paper • 2307.10172 • Published Jul 19, 2023 • 12