Running 115 115 TxT360: Trillion Extracted Text π Create a large, deduplicated dataset for LLM pre-training
bunkalab/Phi-3-mini-128k-instruct-LinearBunkaScore-4.6k-DPO Text Generation β’ 4B β’ Updated May 30, 2024 β’ 19 β’ 2
OrdalieTech/Solon-embeddings-large-0.1 Feature Extraction β’ 0.6B β’ Updated Mar 26, 2024 β’ 27.6k β’ β’ 51
MoritzLaurer/deberta-v3-base-zeroshot-v1 Zero-Shot Classification β’ 0.2B β’ Updated Nov 29, 2023 β’ 1.86k β’ β’ 38