Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers Paper β’ 2505.21497 β’ Published 30 days ago β’ 100
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 174
view article Article Distilling from Dialogues: Finding Meaning in LLM Interactions By chansung β’ Feb 25 β’ 4
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 868
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference By mfuntowicz and 1 other β’ Jan 16 β’ 75
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Paper β’ 2412.06071 β’ Published Dec 8, 2024 β’ 9
view article Article dstack to manage clusters of on-prem servers for AI workloads with ease By chansung β’ Oct 10, 2024 β’ 7
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. β’ 2 items β’ Updated 27 days ago β’ 86
view article Article Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques π π By Isayoften β’ Aug 26, 2024 β’ 67
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs Paper β’ 2408.13467 β’ Published Aug 24, 2024 β’ 26
view article Article dstack: Your LLM Launchpad - From Fine-Tuning to Serving, Simplified By chansung β’ Aug 22, 2024 β’ 13
view article Article Expanding Model Context and Creating Chat Models with a Single Click By maywell β’ Apr 28, 2024 β’ 38
view article Article CodeGemma - an official Google release for code LLMs By pcuenq and 5 others β’ Apr 9, 2024 β’ 101
DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing Paper β’ 2306.14435 β’ Published Jun 26, 2023 β’ 20