Dev Khant's picture

25 3

Dev Khant

DK46

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Qwen2.5-Omni Technical Report

upvoted a paper 3 months ago

Great Models Think Alike and this Undermines AI Oversight

upvoted an article 3 months ago

Open-source DeepResearch – Freeing our search agents

View all activity

Organizations

None yet

DK46's activity

upvoted a paper about 1 month ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 143

upvoted a paper 3 months ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 34

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.23k

upvoted 2 papers 3 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 87

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 73

upvoted a paper 9 months ago

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 63

upvoted 13 papers about 1 year ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 259

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 44

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 68

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 110

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 59

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 34

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 60

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 77

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 92

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 615

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 109

ChemLLM: A Chemical Large Language Model

Paper • 2402.06852 • Published Feb 10, 2024 • 31

More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3, 2024 • 54

upvoted a paper over 1 year ago

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Paper • 2401.04468 • Published Jan 9, 2024 • 50