Datasets Datasets for training and evaluation teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 3.22k • 737 liuhaotian/LLaVA-Instruct-150K Preview • Updated Jan 3, 2024 • 3.07k • 523 euclaise/reddit-instruct-curated Viewer • Updated Feb 1, 2024 • 11k • 129 • 20 xingyaoww/code-act Viewer • Updated Feb 5, 2024 • 78.4k • 259 • 67
RAG RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31, 2024 • 46
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31, 2024 • 46
Agent A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts Paper • 2402.09727 • Published Feb 15, 2024 • 39 OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46 Executable Code Actions Elicit Better LLM Agents Paper • 2402.01030 • Published Feb 1, 2024 • 154
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts Paper • 2402.09727 • Published Feb 15, 2024 • 39
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46
Unlearning Rethinking Machine Unlearning for Large Language Models Paper • 2402.08787 • Published Feb 13, 2024 • 3
Rethinking Machine Unlearning for Large Language Models Paper • 2402.08787 • Published Feb 13, 2024 • 3
Uncertainity Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs Paper • 2402.08733 • Published Feb 13, 2024
Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs Paper • 2402.08733 • Published Feb 13, 2024
KV SubGen: Token Generation in Sublinear Time and Memory Paper • 2402.06082 • Published Feb 8, 2024 • 12
SubGen: Token Generation in Sublinear Time and Memory Paper • 2402.06082 • Published Feb 8, 2024 • 12
RL Offline Actor-Critic Reinforcement Learning Scales to Large Models Paper • 2402.05546 • Published Feb 8, 2024 • 5
Offline Actor-Critic Reinforcement Learning Scales to Large Models Paper • 2402.05546 • Published Feb 8, 2024 • 5
Reasoning Chain-of-Thought Reasoning Without Prompting Paper • 2402.10200 • Published Feb 15, 2024 • 110 SocraSynth: Multi-LLM Reasoning with Conditional Statistics Paper • 2402.06634 • Published Jan 19, 2024
SocraSynth: Multi-LLM Reasoning with Conditional Statistics Paper • 2402.06634 • Published Jan 19, 2024
FT/IT DoRA: Weight-Decomposed Low-Rank Adaptation Paper • 2402.09353 • Published Feb 14, 2024 • 27 LESS: Selecting Influential Data for Targeted Instruction Tuning Paper • 2402.04333 • Published Feb 6, 2024 • 3
LESS: Selecting Influential Data for Targeted Instruction Tuning Paper • 2402.04333 • Published Feb 6, 2024 • 3
Drug Design A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
Learning Methods Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models Paper • 2402.08756 • Published Feb 13, 2024 Predictive representations: building blocks of intelligence Paper • 2402.06590 • Published Feb 9, 2024
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models Paper • 2402.08756 • Published Feb 13, 2024
Predictive representations: building blocks of intelligence Paper • 2402.06590 • Published Feb 9, 2024
Large Context World Model on Million-Length Video And Language With RingAttention Paper • 2402.08268 • Published Feb 13, 2024 • 40
World Model on Million-Length Video And Language With RingAttention Paper • 2402.08268 • Published Feb 13, 2024 • 40
ANNs Approximate Nearest Neighbor Search with Window Filters Paper • 2402.00943 • Published Feb 1, 2024
Models Demonthos/dolphin-2_6-phi-2-candle Text Generation • 3B • Updated Feb 27, 2024 • 61 • 4 m-a-p/OpenCodeInterpreter-DS-6.7B Text Generation • 7B • Updated Mar 3, 2024 • 2.15k • 136
Datasets Datasets for training and evaluation teknium/OpenHermes-2.5 Viewer • Updated Apr 15, 2024 • 1M • 3.22k • 737 liuhaotian/LLaVA-Instruct-150K Preview • Updated Jan 3, 2024 • 3.07k • 523 euclaise/reddit-instruct-curated Viewer • Updated Feb 1, 2024 • 11k • 129 • 20 xingyaoww/code-act Viewer • Updated Feb 5, 2024 • 78.4k • 259 • 67
Reasoning Chain-of-Thought Reasoning Without Prompting Paper • 2402.10200 • Published Feb 15, 2024 • 110 SocraSynth: Multi-LLM Reasoning with Conditional Statistics Paper • 2402.06634 • Published Jan 19, 2024
SocraSynth: Multi-LLM Reasoning with Conditional Statistics Paper • 2402.06634 • Published Jan 19, 2024
RAG RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31, 2024 • 46
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval Paper • 2401.18059 • Published Jan 31, 2024 • 46
FT/IT DoRA: Weight-Decomposed Low-Rank Adaptation Paper • 2402.09353 • Published Feb 14, 2024 • 27 LESS: Selecting Influential Data for Targeted Instruction Tuning Paper • 2402.04333 • Published Feb 6, 2024 • 3
LESS: Selecting Influential Data for Targeted Instruction Tuning Paper • 2402.04333 • Published Feb 6, 2024 • 3
Agent A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts Paper • 2402.09727 • Published Feb 15, 2024 • 39 OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46 Executable Code Actions Elicit Better LLM Agents Paper • 2402.01030 • Published Feb 1, 2024 • 154
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts Paper • 2402.09727 • Published Feb 15, 2024 • 39
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement Paper • 2402.07456 • Published Feb 12, 2024 • 46
Drug Design A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
A Survey of Generative AI for De Novo Drug Design: New Frontiers in Molecule and Protein Generation Paper • 2402.08703 • Published Feb 13, 2024 • 1
Unlearning Rethinking Machine Unlearning for Large Language Models Paper • 2402.08787 • Published Feb 13, 2024 • 3
Rethinking Machine Unlearning for Large Language Models Paper • 2402.08787 • Published Feb 13, 2024 • 3
Learning Methods Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models Paper • 2402.08756 • Published Feb 13, 2024 Predictive representations: building blocks of intelligence Paper • 2402.06590 • Published Feb 9, 2024
Learning How To Ask: Cycle-Consistency Refines Prompts in Multimodal Foundation Models Paper • 2402.08756 • Published Feb 13, 2024
Predictive representations: building blocks of intelligence Paper • 2402.06590 • Published Feb 9, 2024
Uncertainity Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs Paper • 2402.08733 • Published Feb 13, 2024
Experts Don't Cheat: Learning What You Don't Know By Predicting Pairs Paper • 2402.08733 • Published Feb 13, 2024
Large Context World Model on Million-Length Video And Language With RingAttention Paper • 2402.08268 • Published Feb 13, 2024 • 40
World Model on Million-Length Video And Language With RingAttention Paper • 2402.08268 • Published Feb 13, 2024 • 40
KV SubGen: Token Generation in Sublinear Time and Memory Paper • 2402.06082 • Published Feb 8, 2024 • 12
SubGen: Token Generation in Sublinear Time and Memory Paper • 2402.06082 • Published Feb 8, 2024 • 12
ANNs Approximate Nearest Neighbor Search with Window Filters Paper • 2402.00943 • Published Feb 1, 2024
RL Offline Actor-Critic Reinforcement Learning Scales to Large Models Paper • 2402.05546 • Published Feb 8, 2024 • 5
Offline Actor-Critic Reinforcement Learning Scales to Large Models Paper • 2402.05546 • Published Feb 8, 2024 • 5
Models Demonthos/dolphin-2_6-phi-2-candle Text Generation • 3B • Updated Feb 27, 2024 • 61 • 4 m-a-p/OpenCodeInterpreter-DS-6.7B Text Generation • 7B • Updated Mar 3, 2024 • 2.15k • 136