Jovillios (Jules Decaestecker)

upvoted a paper 7 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 62

upvoted 8 papers 8 months ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30, 2024 • 22

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

Paper • 2405.21060 • Published May 31, 2024 • 64

LLMs achieve adult human performance on higher-order theory of mind tasks

Paper • 2405.18870 • Published May 29, 2024 • 17

upvoted a paper 9 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 67

upvoted an article 9 months ago

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4, 2024

• 75

upvoted a paper 9 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 120

upvoted an article 9 months ago

Article

Fine-tune Llama 3 with ORPO

By

•

Apr 22, 2024

• 233

upvoted 3 papers 9 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 64

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 121

Make Your LLM Fully Utilize the Context

Paper • 2404.16811 • Published Apr 25, 2024 • 53

upvoted an article 9 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 282

Jules Decaestecker

AI & ML interests

Organizations

Jovillios's activity

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

mDPO: Conditional Preference Optimization for Multimodal Large Language Models

CRAG -- Comprehensive RAG Benchmark

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

BitsFusion: 1.99 bits Weight Quantization of Diffusion Model

RAFT: Adapting Language Model to Domain Specific RAG

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Transformers are SSMs: Generalized Models and Efficient Algorithms Through Structured State Space Duality

LLMs achieve adult human performance on higher-order theory of mind tasks

RLHF Workflow: From Reward Modeling to Online RLHF

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Fine-tune Llama 3 with ORPO

ORPO: Monolithic Preference Optimization without Reference Model

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Make Your LLM Fully Utilize the Context

Welcome Llama 3 - Meta's new open LLM