Sailor2

community

Activity Feed Request to join this org

AI & ML interests

Open language models for South-East Asia

Recent Activity

huybery authored a paper 11 days ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

afaji authored a paper 19 days ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

gentaiscool authored a paper 19 days ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

View all activity

huybery

authored a paper 11 days ago

SWE-Universe: Scale Real-World Verifiable Environments to Millions

Paper • 2602.02361 • Published 13 days ago • 60

afaji

authored a paper 19 days ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Paper • 2601.17277 • Published 23 days ago • 6

gentaiscool

authored a paper 19 days ago

PingPong: A Natural Benchmark for Multi-Turn Code-Switching Dialogues

Paper • 2601.17277 • Published 23 days ago • 6

dreamerdeo

authored 2 papers 3 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

Training Optimal Large Diffusion Language Models

Paper • 2510.03280 • Published Sep 28, 2025

SivilTaram

authored a paper 3 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

huybery

authored a paper 4 months ago

VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos

Paper • 2510.19488 • Published Oct 22, 2025 • 20

pitikorn32

authored a paper 4 months ago

Deflanderization for Game Dialogue: Balancing Character Authenticity with Task Execution in LLM-based NPCs

Paper • 2510.13586 • Published Oct 15, 2025 • 1

taicheng

authored 3 papers 4 months ago

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Paper • 2402.05138 • Published Feb 6, 2024 • 2

Data Interpreter: An LLM Agent For Data Science

Paper • 2402.18679 • Published Feb 28, 2024 • 1

MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training

Paper • 2510.12831 • Published Oct 12, 2025 • 5

saksornr

authored 2 papers 4 months ago

Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting

Paper • 2509.00482 • Published Aug 30, 2025

Thai Semantic End-of-Turn Detection for Real-Time Voice Agents

Paper • 2510.04016 • Published Oct 5, 2025 • 4

Cameron-Chen

authored a paper 5 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 90

Abhaykoul

posted an update 5 months ago

Post

3258

🚀 Ever dreamed of training your own Large Language Model from scratch? What if I told you it doesn't require a supercomputer or PhD in ML? 🤯

Introducing LLM Trainer - the educational framework that makes LLM training accessible to EVERYONE! Whether you're on a CPU-only laptop or scaling to distributed GPUs, we've got you covered. 💻➡️🖥️

Why LLM Trainer? Because existing tools are either too simplistic (hiding the magic) or too complex (requiring expert knowledge). We bridge the gap with:

🎓 Educational transparency - every component built from scratch with clear code
💻 CPU-first approach - start training immediately, no GPU needed
🔧 Full customization - modify anything you want
📈 Seamless scaling - from laptop to cluster without code changes
🤝 HuggingFace integration - works with existing models & tokenizers

Key highlights:
✅ Built-in tokenizers (BPE, WordPiece, HF wrappers)
✅ Complete Transformer implementation from scratch
✅ Optimized for CPU training
✅ Advanced features: mixed precision, gradient checkpointing, multiple generation strategies
✅ Comprehensive monitoring & metrics

Perfect for:
- Students learning transformers
- Researchers prototyping new ideas
- Developers building domain-specific models

Ready to train your first LLM? It's easier than you think!

🔗 Check it out: https://github.com/HelpingAI/llm-trainer
📚 Docs: Getting Started Guide
💬 Join the community: GitHub Discussions

#AI #MachineLearning #LLM #DeepLearning #OpenSource #Python #HuggingFace #NLP

Special thanks to HuggingFace and PyTorch teams for the amazing ecosystem! 🙏

1 reply

SivilTaram

authored a paper 6 months ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published Sep 2, 2025 • 84

afaji

authored a paper 6 months ago

Predicting the Order of Upcoming Tokens Improves Language Modeling

Paper • 2508.19228 • Published Aug 26, 2025 • 23

wannaphong

authored a paper 6 months ago

Mangosteen: An Open Thai Corpus for Language Model Pretraining

Paper • 2507.14664 • Published Jul 19, 2025 • 7

Abhaykoul

posted an update 7 months ago

Post

4169

🚀 Dhanishtha-2.0-preview-0825 Is Here

The Intermediate Thinking Model just leveled up again.

With sharper reasoning, better tool use, and expanded capabilities, Dhanishtha-2.0-preview-0825 is now live and ready to impress.

🧠 What Makes Dhanishtha Special?
Unlike typical CoT models that only thinks one time, Dhanishtha thinks iteratively:

> Think → Answer → Rethink → Improve → Rethink again if needed.

🔗 Try it now: HelpingAI/Dhanishtha-2.0-preview-0825

🔞 Dhanishtha NSFW Preview

For those exploring more expressive and immersive roleplay scenarios, we’re also releasing:

HelpingAI/Dhanishtha-nsfw
A specialized version tuned for adult-themed interactions and character-driven roleplay.

🔗 Explore it here: HelpingAI/Dhanishtha-nsfw

💬 You can also try all of these live at chat.helpingai.co

4 replies

gabrielchua

authored a paper 7 months ago

Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security

Paper • 2507.19399 • Published Jul 25, 2025 • 2

AI & ML interests

Recent Activity

Team members 114

sailor2's activity