Boosting Open-Domain Continual Learning via Leveraging Intra-domain Category-aware Prototype Paper • 2408.09984 • Published Aug 19 • 1
Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models Paper • 2312.06685 • Published Dec 9, 2023 • 1
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models Paper • 2410.00363 • Published Oct 1 • 1
From screenshots to HTML Collection WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15 • 18
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Paper • 2409.15278 • Published Sep 23 • 22
To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining Paper • 2408.02657 • Published Aug 5 • 32
Synthetic Data Generation Collection A curated list of papers focusing on synthetic data generation • 9 items • Updated Mar 11 • 3
Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217
OLMo Suite Collection Artifacts for the first set of OLMo models. • 18 items • Updated 3 days ago • 65
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models Paper • 2402.05935 • Published Feb 8 • 15
OBELICS 📚🔍 Collection Collection gathering artifacts related to OBELICS • 4 items • Updated Apr 15 • 5