Yedidia AGNIMO's picture

24 1

Yedidia AGNIMO

Yedson54

·

AI & ML interests

Reinforcement Learning, Federated Learning

Organizations

Yedson54's activity

upvoted 3 papers 4 months ago

Law of the Weakest Link: Cross Capabilities of Large Language Models

Paper • 2409.19951 • Published Sep 30, 2024 • 54

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20, 2024 • 49

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 136

upvoted 6 papers 5 months ago

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval

Paper • 2409.10516 • Published Sep 16, 2024 • 41

On the Diagram of Thought

Paper • 2409.10038 • Published Sep 16, 2024 • 13

Towards Building the Federated GPT: Federated Instruction Tuning

Paper • 2305.05644 • Published May 9, 2023 • 5

A Web-Based Solution for Federated Learning with LLM-Based Automation

Paper • 2408.13010 • Published Aug 23, 2024 • 10

Attention Heads of Large Language Models: A Survey

Paper • 2409.03752 • Published Sep 5, 2024 • 89

Authorship Attribution in the Era of LLMs: Problems, Methodologies, and Challenges

Paper • 2408.08946 • Published Aug 16, 2024 • 12

upvoted 11 papers 7 months ago

Patch-Level Training for Large Language Models

Paper • 2407.12665 • Published Jul 17, 2024 • 17

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation

Paper • 2407.10817 • Published Jul 15, 2024 • 14

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15, 2024 • 23

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers

Paper • 2407.09413 • Published Jul 12, 2024 • 10

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 60

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

Paper • 2407.09435 • Published Jul 12, 2024 • 22

H2O-Danube3 Technical Report

Paper • 2407.09276 • Published Jul 12, 2024 • 19

Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 32

Training Task Experts through Retrieval Based Distillation

Paper • 2407.05463 • Published Jul 7, 2024 • 8

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15, 2024 • 83

CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues

Paper • 2404.03820 • Published Apr 4, 2024 • 25