DInghao's picture

22 5

DInghao

HaoDuy

AI & ML interests

None yet

Recent Activity

upvoted a paper 21 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

liked a dataset 21 days ago

GAIR/LIMO

liked a dataset 21 days ago

HuggingFaceFW/fineweb

View all activity

Organizations

None yet

HaoDuy's activity

upvoted a paper 21 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 24 days ago • 182

liked 2 datasets 21 days ago

GAIR/LIMO

Viewer • Updated 27 days ago • 817 • 7.49k • 127

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 317k • 2.02k

liked 3 models 21 days ago

tencent/Hunyuan3D-2

Image-to-3D • Updated 9 days ago • 38.2k • 1.04k

deepseek-ai/DeepSeek-R1

Text Generation • Updated 13 days ago • 3.64M • • 11k

agentica-org/DeepScaleR-1.5B-Preview

Text Generation • Updated 14 days ago • 63.4k • • 512

upvoted 14 papers 21 days ago

The Semantic Hub Hypothesis: Language Models Share Semantic Representations Across Languages and Modalities

Paper • 2411.04986 • Published Nov 7, 2024 • 6

Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models

Paper • 2411.06272 • Published Nov 9, 2024 • 4

DELIFT: Data Efficient Language model Instruction Fine Tuning

Paper • 2411.04425 • Published Nov 7, 2024 • 10

Parameter-Efficient Fine-Tuning of Large Language Models for Unit Test Generation: An Empirical Study

Paper • 2411.02462 • Published Nov 4, 2024 • 10

CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM

Paper • 2411.04954 • Published Nov 7, 2024 • 9

Balancing Pipeline Parallelism with Vocabulary Parallelism

Paper • 2411.05288 • Published Nov 8, 2024 • 20

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published Nov 6, 2024 • 34

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15

RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval

Paper • 2411.04752 • Published Nov 7, 2024 • 17

GazeGen: Gaze-Driven User Interaction for Visual Content Generation

Paper • 2411.04335 • Published Nov 7, 2024 • 15

SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation

Paper • 2411.04989 • Published Nov 7, 2024 • 15

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published Nov 7, 2024 • 18

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published Nov 7, 2024 • 23

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Paper • 2411.05000 • Published Nov 7, 2024 • 22