ChemDFM-R: An Chemical Reasoner LLM Enhanced with Atomized Chemical Knowledge
Abstract
A Chemical Reasoner LLM, ChemDFM-R, enhances chemical reasoning through a comprehensive dataset, mix-sourced distillation, and domain-specific reinforcement learning, achieving state-of-the-art performance with interpretable outputs.
While large language models (LLMs) have achieved impressive progress, their application in scientific domains such as chemistry remains hindered by shallow domain understanding and limited reasoning capabilities. In this work, we focus on the specific field of chemistry and develop a Chemical Reasoner LLM, ChemDFM-R. We first construct a comprehensive dataset of atomized knowledge points to enhance the model's understanding of the fundamental principles and logical structure of chemistry. Then, we propose a mix-sourced distillation strategy that integrates expert-curated knowledge with general-domain reasoning skills, followed by domain-specific reinforcement learning to enhance chemical reasoning. Experiments on diverse chemical benchmarks demonstrate that ChemDFM-R achieves state-of-the-art performance while providing interpretable, rationale-driven outputs. Further case studies illustrate how explicit reasoning chains significantly improve the reliability, transparency, and practical utility of the model in real-world human-AI collaboration scenarios.
Community
exciting work!
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- A Large Language Model for Chemistry and Retrosynthesis Predictions (2025)
- Boosting LLM's Molecular Structure Elucidation with Knowledge Enhanced Tree Search Reasoning (2025)
- Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code (2025)
- ChemActor: Enhancing Automated Extraction of Chemical Synthesis Actions with LLM-Generated Data (2025)
- Training a Scientific Reasoning Model for Chemistry (2025)
- A Survey on Large Language Models for Mathematical Reasoning (2025)
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning (2025)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper