FlagOpen

community

https://github.com/FlagOpen

AI & ML interests

None defined yet.

authored a paper 3 months ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published Jan 22 • 19

submitted a paper to Daily Papers 3 months ago

Towards Automated Kernel Generation in the Era of LLMs

Paper • 2601.15727 • Published Jan 22 • 19

authored a paper 3 months ago

VQ-Seg: Vector-Quantized Token Perturbation for Semi-Supervised Medical Image Segmentation

Paper • 2601.10124 • Published Jan 15 • 4

authored a paper 7 months ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

authored a paper 9 months ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published Aug 4, 2025 • 19

authored 2 papers 11 months ago

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

Paper • 2506.11116 • Published Jun 9, 2025 • 5

CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Paper • 2506.07463 • Published Jun 9, 2025 • 12

authored a paper 11 months ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published Jun 4, 2025 • 80

authored a paper 12 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 83

updated 4 models about 1 year ago

flagopen/starcoderbase-1b-taco

Text Generation • Updated Mar 19, 2025 • 13

flagopen/starcoder-15b-taco

Text Generation • Updated Mar 19, 2025 • 15

flagopen/codegen25-mono-taco

Text Generation • Updated Mar 19, 2025 • 11

flagopen/CodeLlama-7b-Python-taco

Text Generation • Updated Mar 19, 2025 • 11

authored 3 papers over 1 year ago

Aquila2 Technical Report

Paper • 2408.07410 • Published Aug 14, 2024 • 15

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

Paper • 2408.06567 • Published Aug 13, 2024 • 2

CCI3.0-HQ: a large-scale Chinese dataset of high quality designed for pre-training large language models

Paper • 2410.18505 • Published Oct 24, 2024 • 11

authored a paper over 1 year ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

authored a paper over 1 year ago

Infinity-MM: Scaling Multimodal Performance with Large-Scale and High-Quality Instruction Data

Paper • 2410.18558 • Published Oct 24, 2024 • 19

authored 2 papers over 1 year ago

TACO: Topics in Algorithmic COde generation dataset

Paper • 2312.14852 • Published Dec 22, 2023 • 4

UniTabE: A Universal Pretraining Protocol for Tabular Foundation Model in Data Science

Paper • 2307.09249 • Published Jul 18, 2023