Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Andyrasika 's Collections
Agents
Embedding
Prompt-collection
computation
Fine-Tuning
Ankush Collection
RAG articles
multimodal
Time series
Audio
Reinforcement Learning
Transformers
Stable Diffusion
cool models
Synthetic Datasets

Ankush Collection

updated Mar 15

Transformer Articles

Upvote
1

  • DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention

    Paper • 2309.14327 • Published Sep 25, 2023 • 22

  • MambaVision: A Hybrid Mamba-Transformer Vision Backbone

    Paper • 2407.08083 • Published Jul 10, 2024 • 33

  • Memory^3: Language Modeling with Explicit Memory

    Paper • 2407.01178 • Published Jul 1, 2024 • 4

  • Teaching Transformers Causal Reasoning through Axiomatic Training

    Paper • 2407.07612 • Published Jul 10, 2024 • 2

  • mistralai/Mamba-Codestral-7B-v0.1

    Updated Aug 23, 2024 • 46.5k • 594

  • PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM

    Paper • 2503.07111 • Published Mar 10 • 3
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs