9 80 223

Travis King

travisking

AI & ML interests

have you heard of generative AI?

Recent Activity

liked a model 3 days ago

templates/model-card-example

liked a model 3 days ago

mrfakename/mistral-small-3.1-24b-instruct-2503-hf

upvoted a paper 3 days ago

All is Not Lost: LLM Recovery without Checkpoints

View all activity

Organizations

None yet

travisking's activity

upvoted a paper 3 days ago

All is Not Lost: LLM Recovery without Checkpoints

Paper • 2506.15461 • Published 5 days ago • 31

upvoted a collection 3 days ago

MultiFinBen

Collection

4 items • Updated May 16 • 3

upvoted a paper 12 days ago

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published 16 days ago • 71

upvoted 3 papers 13 days ago

upvoted a collection 22 days ago

Falcon-H1

Collection

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 37 items • Updated 10 days ago • 42

upvoted an article 22 days ago

Article

Building an Open Ecosystem for Time Series Forecasting: Introducing TimesFM in Hugging Face

and 1 other •

May 19

• 16

upvoted an article about 1 month ago

Article

The N Implementation Details of RLHF with PPO

and 2 others •

Oct 24, 2023

• 59

upvoted a collection about 1 month ago

Falcon Edge series

Collection

A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated May 21 • 22

upvoted 2 papers about 1 month ago

Learning Dynamics in Continual Pre-Training for Large Language Models

Paper • 2505.07796 • Published May 12 • 19

Generating Physically Stable and Buildable LEGO Designs from Text

Paper • 2505.05469 • Published May 8 • 27

upvoted 8 papers about 2 months ago

LLMs for Engineering: Teaching Models to Design High Powered Rockets

Paper • 2504.19394 • Published Apr 27 • 13

Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks

Paper • 2505.00234 • Published May 1 • 26

A Survey on Inference Engines for Large Language Models: Perspectives on Optimization and Efficiency

Paper • 2505.01658 • Published May 3 • 36

Practical Efficiency of Muon for Pretraining

Paper • 2505.02222 • Published May 4 • 37

An Empirical Study of Qwen3 Quantization

Paper • 2505.02214 • Published May 4 • 24

DeepCritic: Deliberate Critique with Large Language Models

Paper • 2505.00662 • Published May 1 • 53

Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements

Paper • 2410.08968 • Published Oct 11, 2024 • 14

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70