Commit History

Updating decription
0d61fe6
Running

luca-peric commited on

local block causal when cuda avail
a37fec7

luca-peric commited on

xformers when cuda available
661d10b

luca-peric commited on

Xformers pkg version update
86969f4

luca-peric commited on

Removing sphinx version spec
4747368

luca-peric commited on

Adding spaces GPU decorator
4ec9e88

luca-peric commited on

Removing specified CUDA version
d026e7c

luca-peric commited on

Changing HF Space python version
75e90d9

luca-peric commited on

Adding HF ZeroGPU torch version compatible
587f6ed

luca-peric commited on

Revert "Removing CUDA deps"
7642f0e

luca-peric commited on

cleaning things up via gemini 2.5 pro
f2f927b

luca-peric commited on

more finishing touches
ad774a9

luca-peric commited on

adding patch counts and cleaning up
545bc06

luca-peric commited on

tiktoken & llama both plotted
b074257

luca-peric commited on

Working locally, TBD HF space
2af55e5

luca-peric commited on

Removing CUDA deps
d52b754

luca-peric commited on

Not sure what happened, more deps
847b7ee

luca-peric commited on

Adding again
c318efd

luca-peric commited on

Removing gcc and gnu deps from req file
9825f33

luca-peric commited on

Init for HF space
570eaa9

luca-peric commited on

Visualisation working on CPU via CUDA_VISIBLE_DEVICE=-1 python demo_patcher.py 'Daenerys Targaryen is in Game of Thrones, a fantasy epic by George R.R. Martin.'
41ea791

luca-peric commited on

Improve HF integration (#98)
1b67cbe
unverified

NielsRogge commited on

Open source weights! (#97)
96d51b5
unverified

par-meta commited on

Cast int sample id to str (#96)
e299427
unverified

Srinivasan Iyer sviyer commited on

Init distributed when loading model (#94)
138c2f3
unverified

Srinivasan Iyer sviyer commited on

Fix eval mask (#93)
19a3f75
unverified

Srinivasan Iyer sviyer commited on

remove selective activation checkpointing (#92)
8c1b1a7
unverified

Srinivasan Iyer sviyer commited on

update (#91)
1e78a49
unverified

par-meta commited on

Get generation working for BLT (#86)
b79eb3e
unverified

par-meta commited on

Fix in-place addition of patch_embds (#85)
2dcf48b
unverified

Hanna commited on

Some fixes for entropy model predictions (#83)
fc946a1
unverified

Srinivasan Iyer sviyer commited on

Update ppl evals to work with blt model, in addition to entropy model (#82)
083656c
unverified

par-meta commited on

Update iterate_data (#81)
f84ee63
unverified

par-meta commited on

Add way to call consolidate (#80)
c110f6b
unverified

Srinivasan Iyer sviyer commited on

When merging configs, do not merge data sources (#79)
a5ceaaa
unverified

Srinivasan Iyer sviyer commited on

Get evals working again. (#46)
7517ac2
unverified

par-meta commited on

Reduce per file resources arrow uses (#77)
63913e4
unverified

par-meta commited on

Let process start before yielding preloaded prefetch buffer, avoid needlessly losing buffer in edge cases (#75)
8f2cf88
unverified

par-meta commited on

Add approximate state persistence (#73)
ea1fc75
unverified

par-meta commited on

Fix rsync to not preserve original permissions, instead use destination (#76)
9bd51df
unverified

par-meta commited on

Correctly reset batch iterator at each arrow create_iter call. (#74)
c727844
unverified

par-meta commited on

Pass mask in packing_iterator, correctly handle last batch, fix masking (#65)
08b8c7c
unverified

par-meta commited on

Initialize rope embeddings properly for the entropy model (#72)
0da051f
unverified

Srinivasan Iyer sviyer commited on

Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
aeb95f1
unverified

par-meta commited on

Add vocab and seq len abstract fields (#66)
ff36aa8
unverified

par-meta commited on

Fix: Correct model_args usage in parallelize_model call (#69)
a6ed14f
unverified

Bocheng Li commited on

Update iterator inheritance, pass file format args, limit iterator (#63)
fc3399e
unverified

par-meta commited on

Make apex logs less noisy (#60)
b0956bd
unverified

par-meta commited on

Make it possible to specify multiple config files (#54)
82ab593
unverified

par-meta commited on