arxiv:2310.16944
Nathan Habib PRO
AI & ML interests
Evals
Recent Activity
upvoted an article about 10 hours ago
Is it agentic enough? Benchmarking open models on your own tooling published an article 1 day ago
Is it agentic enough? Benchmarking open models on your own tooling liked a model 4 days ago
nex-agi/Nex-N2-Pro