GeorgiaTech (Georgia Tech (Georgia Institute of Technology))

cliang1453

authored a paper 29 days ago

SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation

Paper • 2506.18349 • Published Jun 23 • 13

Pearush

authored 3 papers about 2 months ago

COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs

Paper • 2502.17410 • Published Feb 24

LLMs Can Generate a Better Answer by Aggregating Their Own Responses

Paper • 2503.04104 • Published Mar 6 • 1

SlimMoE: Structured Compression of Large MoE Models via Expert Slimming and Distillation

Paper • 2506.18349 • Published Jun 23 • 13

rxb0506

updated a Space 3 months ago

CS6460 EdTech

🏆

CS6460 Ed Tech Presentation Dashbaord

rxb0506

published a Space 4 months ago

CS6460 EdTech

🏆

CS6460 Ed Tech Presentation Dashbaord

harish-ravichandar

authored a paper 5 months ago

KOROL: Learning Visualizable Object Feature with Koopman Operator Rollout for Manipulation

Paper • 2407.00548 • Published Jun 29, 2024

azheng83

updated a model 6 months ago

GeorgiaTech/sonic

Updated Feb 24

azheng83

published a model 6 months ago

GeorgiaTech/sonic

Updated Feb 24

lnair

authored a paper 6 months ago

Flow-of-Options: Diversified and Improved LLM Reasoning by Thinking Through Options

Paper • 2502.12929 • Published Feb 18 • 7

Alanturner2

updated a Space 7 months ago

2

Arxiv Summarizer

🐨

summarize arixv papers and chat with your data

Alanturner2

published a Space 7 months ago

2

Arxiv Summarizer

🐨

summarize arixv papers and chat with your data

RayY

authored 2 papers 10 months ago

Training Socially Aligned Language Models in Simulated Human Society

Paper • 2305.16960 • Published May 26, 2023 • 3

Confidence Calibration and Rationalization for LLMs via Multi-Agent Deliberation

Paper • 2404.09127 • Published Apr 14, 2024 • 2

lnair

authored 2 papers 10 months ago

Improved Generation of Synthetic Imaging Data Using Feature-Aligned Diffusion

Paper • 2410.00731 • Published Oct 1, 2024

Creative Problem Solving in Large Language and Vision Models -- What Would it Take?

Paper • 2405.01453 • Published May 2, 2024

alexwb

authored a paper 10 months ago

HelpSteer2-Preference: Complementing Ratings with Preferences

Paper • 2410.01257 • Published Oct 2, 2024 • 25

cliang1453

authored a paper 11 months ago

GRIN: GRadient-INformed MoE

Paper • 2409.12136 • Published Sep 18, 2024 • 16

cliang1453

authored a paper about 1 year ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 257

ZhangShenao

updated a model about 1 year ago

GeorgiaTech/0.0005_llama_nodpo_3iters_bs128_531lr_oldtrl_iter_3

Text Generation • 8B • Updated May 13, 2024 • 2

AI & ML interests

Team members 1,744

GeorgiaTech's activity

CS6460 EdTech

CS6460 EdTech

Arxiv Summarizer

Arxiv Summarizer