List Model Training with CompeteSMoE
AI & ML interests
Various topics in AI and ML: large foundation model, computer vision, reinforcement learning, transformers, optimization, recommender system, etc.
Recent Activity
View all activity
-
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models
Paper • 2411.00918 • Published • 8 -
Fsoft-AIC/Phi3-CLIP-MoE
Image-Text-to-Text • Updated -
Fsoft-AIC/Phi3-SigLiP-MoE
Image-Text-to-Text • Updated -
Fsoft-AIC/Phi3.5-Siglip-MoE
Image-Text-to-Text • Updated
Transformer-based Comment Classifiers through Domain Post-training and Multi-level layer aggregation
-
Fsoft-AIC/dopamin-java-deprecation
Text Classification • 0.2B • Updated • 9 -
Fsoft-AIC/dopamin-java-ownership
Text Classification • 0.2B • Updated • 12 -
Fsoft-AIC/dopamin-java-pointer
Text Classification • 0.2B • Updated • 11 -
Fsoft-AIC/dopamin-java-rational
Text Classification • 0.2B • Updated • 12
A Series of Large Language Model for Mainframe Modernization
List Model Training with CompeteSMoE
-
LIBMoE: A Library for comprehensive benchmarking Mixture of Experts in Large Language Models
Paper • 2411.00918 • Published • 8 -
Fsoft-AIC/Phi3-CLIP-MoE
Image-Text-to-Text • Updated -
Fsoft-AIC/Phi3-SigLiP-MoE
Image-Text-to-Text • Updated -
Fsoft-AIC/Phi3.5-Siglip-MoE
Image-Text-to-Text • Updated
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities
Transformer-based Comment Classifiers through Domain Post-training and Multi-level layer aggregation
-
Fsoft-AIC/dopamin-java-deprecation
Text Classification • 0.2B • Updated • 9 -
Fsoft-AIC/dopamin-java-ownership
Text Classification • 0.2B • Updated • 12 -
Fsoft-AIC/dopamin-java-pointer
Text Classification • 0.2B • Updated • 11 -
Fsoft-AIC/dopamin-java-rational
Text Classification • 0.2B • Updated • 12
A Series of Large Language Model for Mainframe Modernization