6 8 59

Yang

jacklanda

AI & ML interests

Language Modeling, Lexical Semantics

Recent Activity

upvoted a paper 4 days ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

new activity 9 days ago

RuleReasoner/rule-reasoning:Upload README.md with huggingface_hub

new activity 9 days ago

RuleReasoner/rule-reasoning:Upload folder using huggingface_hub

View all activity

Organizations

None yet

jacklanda's activity

upvoted a paper 4 days ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published 5 days ago • 30

New activity in RuleReasoner/rule-reasoning 9 days ago

Upload README.md with huggingface_hub

#2 opened 9 days ago by

jacklanda

Upload folder using huggingface_hub

#1 opened 9 days ago by

jacklanda

liked a dataset 11 days ago

bigai-nlco/ReflectionEvo

Viewer • Updated 11 days ago • 437k • 641 • 11

authored 2 papers 13 days ago

In-Context Meta LoRA Generation

Paper • 2501.17635 • Published Jan 29

ReflectEvo: Improving Meta Introspection of Small LLMs by Learning Self-Reflection

Paper • 2505.16475 • Published 24 days ago • 2

upvoted a paper 26 days ago

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published 27 days ago • 26

upvoted a paper about 1 month ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 170

liked a model 2 months ago

virtuoussy/Qwen2.5-7B-Instruct-RLVR

Updated May 4 • 36 • 12

liked a dataset 2 months ago

opendatalab/ProverQA

Preview • Updated 4 days ago • 100 • 5

upvoted a paper 2 months ago

OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

Paper • 2503.22952 • Published Mar 29 • 18

liked a dataset 3 months ago

facebook/natural_reasoning

Viewer • Updated Feb 21 • 1.15M • 1.58k • 506

published a model 4 months ago

jacklanda/Qwen-2.5-1.5B-Simple-RL

Updated Feb 17

liked a dataset 6 months ago

peiyi9979/Math-Shepherd

Viewer • Updated Jan 3, 2024 • 445k • 315 • 97

liked a model 6 months ago

answerdotai/ModernBERT-large

Fill-Mask • Updated Jan 15 • 70.3k • 402

liked a model 7 months ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 3.41M • • 2.43k

liked a dataset 7 months ago

O1-OPEN/OpenO1-SFT

Viewer • Updated Apr 22 • 77.7k • 764 • 376

liked a model 8 months ago

TencentBAC/Conan-embedding-v1

Updated Nov 27, 2024 • 40.5k • 158

liked a model 9 months ago

google/gemma-scope-2b-pt-res

Updated Jan 19 • 12

liked a dataset 9 months ago

google-research-datasets/nq_open

Viewer • Updated Mar 22, 2024 • 91.5k • 3.89k • 26