Costa Pissaris's picture

24 28

Costa Pissaris

somtimz

·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 1 month ago

upvoted a collection 2 months ago

Self-improving LLMs

upvoted a paper 2 months ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

View all activity

Organizations

upvoted a collection about 1 month ago

Gemma 3n

4 items • Updated 23 days ago • 201

upvoted a collection 2 months ago

Self-improving LLMs

17 items • Updated Mar 27 • 2

upvoted a paper 2 months ago

Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification

Paper • 2502.01839 • Published Feb 3 • 11

upvoted a paper 3 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28, 2024 • 21

upvoted 2 articles 4 months ago

Article

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

By

•

May 7, 2024

• 3

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 352

upvoted a collection 5 months ago

Gemma 3 Release

24 items • Updated 23 days ago • 425

upvoted an article 10 months ago

Article

Let's talk about LLM evaluation

By

•

May 23, 2024

• 181

upvoted 2 papers about 1 year ago

RAG Does Not Work for Enterprises

Paper • 2406.04369 • Published May 31, 2024 • 1

Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models

Paper • 2406.04271 • Published Jun 6, 2024 • 31

upvoted a collection over 1 year ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 43

upvoted 5 papers over 1 year ago

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8, 2024 • 159

WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 51

Gemini: A Family of Highly Capable Multimodal Models

Paper • 2312.11805 • Published Dec 19, 2023 • 46

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 226

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 76

upvoted a collection over 1 year ago

Nemotron 3 8B

The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. • 5 items • Updated 12 days ago • 51

upvoted 3 papers almost 2 years ago

Eureka: Human-Level Reward Design via Coding Large Language Models

Paper • 2310.12931 • Published Oct 19, 2023 • 26

In-Context Pretraining: Language Modeling Beyond Document Boundaries

Paper • 2310.10638 • Published Oct 16, 2023 • 30

Table-GPT: Table-tuned GPT for Diverse Table Tasks

Paper • 2310.09263 • Published Oct 13, 2023 • 41