11 15 1

Ajith V Prabhakar

ajithprabhakar

https://www.ajithp.com

ajithprabhakar

AI & ML interests

NLP, Responsible AI, Generative AI

Recent Activity

commented on a paper about 1 month ago

Byte Latent Transformer: Patches Scale Better Than Tokens

commented on a paper 4 months ago

Training Language Models to Self-Correct via Reinforcement Learning

liked a model 4 months ago

mattshumer/Reflection-Llama-3.1-70B

View all activity

Organizations

Posts 2

Post

523

Hi All,
In my latest blog post, I created a comprehensive guide on LLM Benchmarking.
➟ 20+ key benchmarks, from MMLU to TruthfulQA
➟ How each benchmark assesses different LLM capabilities
➟ Why benchmarking matters for real-world AI applications
➟ Future trends in AI evaluation
Read the blog here: https://wp.me/p7Qix-wO

Please let me know your thoughts, suggestions, and comments.

Post

1363

Can AI cheat or lie?

In this blog, we will explore the research conducted by experts from MIT, Australian Catholic University, and the Center for AI Safety to better understand the nature of AI deception, its various forms, and the potential risks it poses. We will examine real-world examples and the underlying mechanisms that enable AI systems to deceive.

Learn more at: https://ajithp.com/2024/05/12/ai-deception-risks-real-world-examples-and-proactive-solutions/

Ajith V Prabhakar

AI & ML interests

Recent Activity

Organizations

Posts 2

Collections 1

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

OneLLM: One Framework to Align All Modalities with Language

Generative Multimodal Models are In-Context Learners

The LLM Surgeon

models

datasets