Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
538.8
TFLOPS
1
60
51
Jaward Sesay
Jaward
Follow
sylas8765's profile picture
MohammedEltoum's profile picture
Parkerlambert123's profile picture
392 followers
·
24 following
https://github.com/Jaykef
JawardSesay_
Jaykef
AI & ML interests
Building Lectūra Labs | CS Grad Student @BIT | AI/ML Research: Autonomous Agents, LLMs | Building The Cursor for Learning | Role Model Karpathy
Recent Activity
posted
an
update
about 12 hours ago
Incredible work!! They claim this is the year of recursive language models (I hope so). As models get bigger and better managing their context windows to fit longer prompts has been a standing engineering problem. They propose an inference technique that allows the model to externally crunch down long prompts into snippets that it can recursively call itself on, instead of directly feeding the entire prompt into the transformer. This could make models cheaper and more efficient but I doubt if big tech will adopt it since they profit more with the current approach (bigger models = longer context windows = more expensive the model). Once again such work came from academia/oss community cuz I doubt big tech would have shared these findings lol. They probably have much better inference methods that we may never know of haha. Paper: https://arxiv.org/pdf/2512.24601
liked
a Space
about 2 months ago
HuggingFaceTB/smol-training-playbook
upvoted
a
paper
2 months ago
Emu3.5: Native Multimodal Models are World Learners
View all activity
Organizations
Jaward
's datasets
2
Sort: Recently updated
Jaward/krio_vision_qa
Viewer
•
Updated
Oct 4, 2025
•
200
•
7
Jaward/Krio-Corpus
Viewer
•
Updated
Sep 17, 2025
•
33
•
21
•
1