Jerrold's picture

Jerrold

JerroldK

AI & ML interests

None yet

Recent Activity

updated a model 20 days ago

JerroldK/H4-14b-contract-extractor-adapter

published a model about 1 month ago

JerroldK/H4-14b-contract-extractor-adapter

updated a model about 1 month ago

JerroldK/Hermes-4-14B-contract-extractor

View all activity

Organizations

None yet

updated a model 20 days ago

JerroldK/H4-14b-contract-extractor-adapter

Updated 20 days ago • 90

published a model about 1 month ago

JerroldK/H4-14b-contract-extractor-adapter

Updated 20 days ago • 90

updated a model about 1 month ago

JerroldK/Hermes-4-14B-contract-extractor

Text Generation • 425k • Updated May 18 • 313

published a model about 2 months ago

JerroldK/Hermes-4-14B-contract-extractor-loramerged-FP8

upvoted an article about 2 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

sirluk

•

Oct 7, 2024

• 71

liked a model 2 months ago

NousResearch/Hermes-4-14B-FP8

Text Generation • 15B • Updated Sep 3, 2025 • 16k • 27

upvoted an article 2 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 411

liked a model 2 months ago

NousResearch/Hermes-4-14B

Text Generation • 425k • Updated Jan 9 • 69.5k • • 163

liked 2 datasets 3 months ago

reuben256/contract-nli

Viewer • Updated Jun 17, 2025 • 10.3k • 10 • 1

kiddothe2b/contract-nli

Viewer • Updated Jul 27, 2022 • 20.1k • 367 • 18

commented on Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face 4 months ago

In your bitsandbytes config, why are you decompressing the weights to torch.float32, when the native format of phi3 is torch.bfloat16? This seems like a waste of memory

upvoted an article 6 months ago

Article

Design Patterns for Building Agentic Workflows

dcarpintero

•

Jul 14, 2025

• 11

New activity in agents-course/unit_1_quiz 7 months ago

Ai Agents course

#539 opened 7 months ago by