14 9 93

tokenbender

TokenBender

https://tokenbender.com

AI & ML interests

Fine-tune small useful models, build datasets and anything related to local LLM hosting and serving.

Organizations

upvoted a collection 8 months ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 12 items • Updated 10 days ago • 75

upvoted a paper 8 months ago

Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning

Paper • 2505.01441 • Published Apr 28, 2025 • 39

upvoted an article about 1 year ago

Article

Releasing the largest multilingual open pretraining dataset

Nov 13, 2024

•

104

upvoted an article over 1 year ago

Article

Introduction to ggml

Aug 13, 2024

•

256

upvoted a collection over 1 year ago

Gemma 2 2B Release

Collection

The 2.6B parameter version of Gemma 2. • 6 items • Updated Jul 10, 2025 • 82

upvoted a paper over 1 year ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

upvoted an article over 1 year ago

Article

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation

Apr 29, 2024

•

upvoted 2 papers over 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81

MEGABYTE: Predicting Million-byte Sequences with Multiscale Transformers

Paper • 2305.07185 • Published May 12, 2023 • 9

tokenbender

AI & ML interests

Organizations

TokenBender's activity

Releasing the largest multilingual open pretraining dataset

Introduction to ggml

StarCoder2-Instruct: Fully Transparent and Permissive Self-Alignment for Code Generation