Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
144.5
TFLOPS
681
25
239
Arthur Zucker
ArthurZ
Follow
dlzwl's profile picture
Poorni555's profile picture
Cqcpzx's profile picture
420 followers
Β·
72 following
art_zucker
ArthurZucker
AI & ML interests
None yet
Recent Activity
reacted
to
danieldk
's
post
with π€
5 days ago
We have been working on a project called `kernels`. `kernels` makes it possible to load compute kernels directly from the Hub! π We plan to give kernels a more proper introduction soon. But for those who have been following along, we are happy to announce a new release: - New layer API with `torch.compile` support. - Experimental support for loading Apple Silicon Metal π€ Kernels. - Generate wheels from Hub kernels for legacy deployments. Full release notes here: https://github.com/huggingface/kernels/releases/tag/v0.6.0
reacted
to
danieldk
's
post
with π₯
5 days ago
We have been working on a project called `kernels`. `kernels` makes it possible to load compute kernels directly from the Hub! π We plan to give kernels a more proper introduction soon. But for those who have been following along, we are happy to announce a new release: - New layer API with `torch.compile` support. - Experimental support for loading Apple Silicon Metal π€ Kernels. - Generate wheels from Hub kernels for legacy deployments. Full release notes here: https://github.com/huggingface/kernels/releases/tag/v0.6.0
liked
a Space
12 days ago
nanotron/ultrascale-playbook
View all activity
Organizations
ArthurZ
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
mistralai/Devstral-Small-2505
about 1 month ago
Adding transformers tag for better tracking of library
π₯
π
3
1
#2 opened about 1 month ago by
reach-vb
New activity in
meta-llama/Llama-4-Scout-17B-16E-Instruct
about 2 months ago
No attribute `sliding_window`?
2
#59 opened 3 months ago by
farzadab
Does LLama4 have chunked attention in generation phase ?
2
#64 opened 3 months ago by
vanshils
New activity in
meta-llama/Llama-4-Maverick-17B-128E-Instruct
3 months ago
remove <|finetune_right_pad_id|> and change pad_token to <|finetune_right_pad|>
1
#25 opened 3 months ago by
wukaixingxp
New activity in
meta-llama/Llama-4-Scout-17B-16E-Instruct
3 months ago
pad error
π
β
7
8
#25 opened 3 months ago by
bobber
Bug in AutoModel
π
1
3
#26 opened 3 months ago by
random-checkin
New activity in
meta-llama/Llama-4-Scout-17B-16E
3 months ago
Cannot generate with BS > 1
1
#25 opened 3 months ago by
chenjiel
New activity in
meta-llama/Llama-4-Maverick-17B-128E-Instruct
3 months ago
change to spda
2
#14 opened 3 months ago by
wukaixingxp
New activity in
mistral-community/pixtral-12b
5 months ago
Fastest way for inference?
3
#28 opened 5 months ago by
psycy
New activity in
deepseek-ai/DeepSeek-R1
5 months ago
model-00078-of-000163.safetensors not marked safe?
2
#80 opened 5 months ago by
aborst
New activity in
mistralai/Pixtral-Large-Instruct-2411
8 months ago
Upload transformers version
10
#3 opened 8 months ago by
ArthurZ
New activity in
huggingface/documentation-images
8 months ago
Upload Meta-Llama-3-8B-Instruct, seqlen = 512, python, w_ compile.png
1
#392 opened 8 months ago by
kwen2501
New activity in
mistral-community/pixtral-12b
9 months ago
Update model weight
8
#13 opened 9 months ago by
nguyen-brat
Update hidden_act to silu
2
#14 opened 9 months ago by
ArthurZ
New activity in
rhymes-ai/Aria
9 months ago
llama.cpp support
π₯
π
11
9
#1 opened 9 months ago by
ayyylol
New activity in
google/gemma-2-2b-jpn-it
9 months ago
tokenizer_config.json is different from gemma-2-2b-it
2
#8 opened 9 months ago by
dahara1
New activity in
mistral-community/pixtral-12b
9 months ago
How can i use the full 24GB model instead of this separated safetensors files?
1
#8 opened 9 months ago by
Valadaro
New activity in
meta-llama/Llama-3.2-11B-Vision-Instruct
10 months ago
hidden_activation vs hidden_act in config.json
2
#10 opened 10 months ago by
heheda
New activity in
mistral-community/pixtral-12b-240910
10 months ago
How to use safetensors?
2
#13 opened 10 months ago by
prathi1729
New activity in
mistral-community/pixtral-12b
10 months ago
lamma cpp ht to gguf not working
4
#2 opened 10 months ago by
RameshRajamani
Load more