Balancing Specialized and General Skills in LLMs: The Impact of Modern Tuning and Data Strategy Paper • 2310.04945 • Published Oct 7, 2023 • 1
ICE-GRT: Instruction Context Enhancement by Generative Reinforcement based Transformers Paper • 2401.02072 • Published Jan 4, 2024 • 11
A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest Paper • 2311.10614 • Published Nov 17, 2023
Balancing Enhancement, Harmlessness, and General Capabilities: Enhancing Conversational LLMs with Direct RLHF Paper • 2403.02513 • Published Mar 4, 2024
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs Paper • 2406.08657 • Published Jun 12, 2024 • 9
Mistral-C2F: Coarse to Fine Actor for Analytical and Reasoning Enhancement in RLHF and Effective-Merged LLMs Paper • 2406.08657 • Published Jun 12, 2024 • 9