Commit History

Working locally, TBD HF space
2af55e5

luca-peric commited on

Removing CUDA deps
d52b754

luca-peric commited on

Not sure what happened, more deps
847b7ee

luca-peric commited on

Adding again
c318efd

luca-peric commited on

Removing gcc and gnu deps from req file
9825f33

luca-peric commited on

Init for HF space
570eaa9

luca-peric commited on

Visualisation working on CPU via CUDA_VISIBLE_DEVICE=-1 python demo_patcher.py 'Daenerys Targaryen is in Game of Thrones, a fantasy epic by George R.R. Martin.'
41ea791

luca-peric commited on

Improve HF integration (#98)
1b67cbe
unverified

NielsRogge commited on

Open source weights! (#97)
96d51b5
unverified

par-meta commited on

Cast int sample id to str (#96)
e299427
unverified

Srinivasan Iyer sviyer commited on

Init distributed when loading model (#94)
138c2f3
unverified

Srinivasan Iyer sviyer commited on

Fix eval mask (#93)
19a3f75
unverified

Srinivasan Iyer sviyer commited on

remove selective activation checkpointing (#92)
8c1b1a7
unverified

Srinivasan Iyer sviyer commited on

update (#91)
1e78a49
unverified

par-meta commited on

Get generation working for BLT (#86)
b79eb3e
unverified

par-meta commited on

Fix in-place addition of patch_embds (#85)
2dcf48b
unverified

Hanna commited on

Some fixes for entropy model predictions (#83)
fc946a1
unverified

Srinivasan Iyer sviyer commited on

Update ppl evals to work with blt model, in addition to entropy model (#82)
083656c
unverified

par-meta commited on

Update iterate_data (#81)
f84ee63
unverified

par-meta commited on

Add way to call consolidate (#80)
c110f6b
unverified

Srinivasan Iyer sviyer commited on

When merging configs, do not merge data sources (#79)
a5ceaaa
unverified

Srinivasan Iyer sviyer commited on

Get evals working again. (#46)
7517ac2
unverified

par-meta commited on

Reduce per file resources arrow uses (#77)
63913e4
unverified

par-meta commited on

Let process start before yielding preloaded prefetch buffer, avoid needlessly losing buffer in edge cases (#75)
8f2cf88
unverified

par-meta commited on

Add approximate state persistence (#73)
ea1fc75
unverified

par-meta commited on

Fix rsync to not preserve original permissions, instead use destination (#76)
9bd51df
unverified

par-meta commited on

Correctly reset batch iterator at each arrow create_iter call. (#74)
c727844
unverified

par-meta commited on

Pass mask in packing_iterator, correctly handle last batch, fix masking (#65)
08b8c7c
unverified

par-meta commited on

Initialize rope embeddings properly for the entropy model (#72)
0da051f
unverified

Srinivasan Iyer sviyer commited on

Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
aeb95f1
unverified

par-meta commited on

Add vocab and seq len abstract fields (#66)
ff36aa8
unverified

par-meta commited on

Fix: Correct model_args usage in parallelize_model call (#69)
a6ed14f
unverified

Bocheng Li commited on

Update iterator inheritance, pass file format args, limit iterator (#63)
fc3399e
unverified

par-meta commited on

Make apex logs less noisy (#60)
b0956bd
unverified

par-meta commited on

Make it possible to specify multiple config files (#54)
82ab593
unverified

par-meta commited on

fix(README): correct typo in quickstart instructions (#62)
9f29e0d
unverified

CharlesCNorton commited on

using apex rmsnorm (#57)
f3e8125
unverified

Srinivasan Iyer sviyer commited on

Update README.md (#58)
c49e251
unverified

Srinivasan Iyer commited on

Fix multiprocessing dataloader checkpointing and use it in the train script (#50)
8c61ab5
unverified

par-meta commited on

Test first batch matches (#53)
85c2f28
unverified

par-meta commited on

disable reshard after forward (#56)
9d907fe
unverified

Srinivasan Iyer sviyer commited on

make sure max_encoder_seq_length matches (#55)
48e4ad0
unverified

Srinivasan Iyer sviyer commited on

fix save and reload model state (#49)
22c7fe1
unverified

Srinivasan Iyer sviyer commited on

Add bpb and n_bytes to metric logging (#41)
fe45f69
unverified

par-meta commited on

Fix init and repro (#48)
aebdc48
unverified

Srinivasan Iyer sviyer commited on

Allow ArrowIterator to read from json (#45)
936d943
unverified

par-meta commited on

Update checkpointing to use fsspec (#39)
afedb16
unverified

par-meta commited on

Add rope fp32 (#43)
739dc71
unverified

Srinivasan Iyer sviyer commited on

fix stool (#44)
6fbaf72
unverified

Srinivasan Iyer sviyer commited on

Fix wandb logging (#42)
7cf8fab
unverified

Srinivasan Iyer sviyer commited on