Progressive Growth Transformers (PGT) [pretrain] Transformers grown layer-by-layer on frozen embeddings. Explores emergent capabilities with depth. Collection by Bochkov 1 day ago - Bochkov/abs-bvv-6 Text Generation • Updated 1 day ago • 10 Bochkov/abs-bvv-5 Text Generation • Updated 1 day ago • 8 Bochkov/abs-bvv-4 Text Generation • Updated 1 day ago • 8 Bochkov/abs-bvv-3 Text Generation • Updated 1 day ago • 10
Pro models [pretrain] Frozen-embedding LMs for English, Russian, Chinese; demonstration & comparison with standard LM. Collection by Bochkov 1 day ago - Bochkov/pro_bvv_en Text Generation • Updated 1 day ago • 27 Bochkov/pro_bvv_unfrozen Text Generation • Updated 1 day ago • 22 Bochkov/pro_bvv_ru Text Generation • Updated 1 day ago • 13 Bochkov/pro_bvv_zh Text Generation • Updated 1 day ago • 11
Max models [pretrain] Multilingual language model collection with frozen, unified Unicode-based embeddings. Includes Russian, Chinese, and their MoE fusion. Collection by Bochkov 1 day ago - Bochkov/max_bvv_moe Updated 1 day ago • 12 Bochkov/max_bvv_ru Updated 1 day ago • 22 Bochkov/max_bvv_zh Updated 1 day ago • 16 Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Best demo models [pretrain] Frozen embedding LMs (en/ru/zh) & their MoE fusion. Baselines: frozen vs unfrozen embedding ablation. Collection by Bochkov 1 day ago - Bochkov/best_bvv_moe Updated 1 day ago • 15 Bochkov/best_bvv_ru Updated 1 day ago • 13 Bochkov/best_bvv_unfrozen_ru Updated 1 day ago • 16 Bochkov/best_bvv_zh Updated 1 day ago • 13
Nemo models [pretrain] Proof-of-concept: SOTA tokenizers can be used for compatible precomputed embeddings, industry can repeat with their tokenizers Collection by Bochkov 1 day ago - Bochkov/nemo_bvv_moe Updated 1 day ago • 19 Bochkov/nemo_bvv_ru Updated 1 day ago • 17 Bochkov/nemo_bvv_zh Updated 1 day ago • 13 Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Tokenizers This collection features frozen, precomputed token embedding tensors designed for experimentation with semantic emergence in language models. Collection by Bochkov 1 day ago - Bochkov/bvv241-2-3 Updated 1 day ago • 11 Bochkov/bvv241-max Updated 1 day ago • 12 Bochkov/bvv241-nemo Updated 1 day ago • 8 Bochkov/bvv241-abs Updated 1 day ago • 11
Progressive Growth Transformers (PGT) [pretrain] Transformers grown layer-by-layer on frozen embeddings. Explores emergent capabilities with depth. Collection by Bochkov 1 day ago - Bochkov/abs-bvv-6 Text Generation • Updated 1 day ago • 10 Bochkov/abs-bvv-5 Text Generation • Updated 1 day ago • 8 Bochkov/abs-bvv-4 Text Generation • Updated 1 day ago • 8 Bochkov/abs-bvv-3 Text Generation • Updated 1 day ago • 10
Best demo models [pretrain] Frozen embedding LMs (en/ru/zh) & their MoE fusion. Baselines: frozen vs unfrozen embedding ablation. Collection by Bochkov 1 day ago - Bochkov/best_bvv_moe Updated 1 day ago • 15 Bochkov/best_bvv_ru Updated 1 day ago • 13 Bochkov/best_bvv_unfrozen_ru Updated 1 day ago • 16 Bochkov/best_bvv_zh Updated 1 day ago • 13
Pro models [pretrain] Frozen-embedding LMs for English, Russian, Chinese; demonstration & comparison with standard LM. Collection by Bochkov 1 day ago - Bochkov/pro_bvv_en Text Generation • Updated 1 day ago • 27 Bochkov/pro_bvv_unfrozen Text Generation • Updated 1 day ago • 22 Bochkov/pro_bvv_ru Text Generation • Updated 1 day ago • 13 Bochkov/pro_bvv_zh Text Generation • Updated 1 day ago • 11
Nemo models [pretrain] Proof-of-concept: SOTA tokenizers can be used for compatible precomputed embeddings, industry can repeat with their tokenizers Collection by Bochkov 1 day ago - Bochkov/nemo_bvv_moe Updated 1 day ago • 19 Bochkov/nemo_bvv_ru Updated 1 day ago • 17 Bochkov/nemo_bvv_zh Updated 1 day ago • 13 Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Max models [pretrain] Multilingual language model collection with frozen, unified Unicode-based embeddings. Includes Russian, Chinese, and their MoE fusion. Collection by Bochkov 1 day ago - Bochkov/max_bvv_moe Updated 1 day ago • 12 Bochkov/max_bvv_ru Updated 1 day ago • 22 Bochkov/max_bvv_zh Updated 1 day ago • 16 Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations Paper • 2507.04886 • Published 5 days ago • 1
Tokenizers This collection features frozen, precomputed token embedding tensors designed for experimentation with semantic emergence in language models. Collection by Bochkov 1 day ago - Bochkov/bvv241-2-3 Updated 1 day ago • 11 Bochkov/bvv241-max Updated 1 day ago • 12 Bochkov/bvv241-nemo Updated 1 day ago • 8 Bochkov/bvv241-abs Updated 1 day ago • 11