blt-entropy-patcher / bytelatent

Commit History

local block causal when cuda avail
a37fec7

lucalp commited on

xformers when cuda available
661d10b

lucalp commited on

more finishing touches
ad774a9

lucalp commited on

Working locally, TBD HF space
2af55e5

lucalp commited on

Visualisation working on CPU via CUDA_VISIBLE_DEVICE=-1 python demo_patcher.py 'Daenerys Targaryen is in Game of Thrones, a fantasy epic by George R.R. Martin.'
41ea791

lucalp commited on

Improve HF integration (#98)
1b67cbe
unverified

NielsRogge commited on

Open source weights! (#97)
96d51b5
unverified

par-meta commited on

Cast int sample id to str (#96)
e299427
unverified

Srinivasan Iyer sviyer commited on

Init distributed when loading model (#94)
138c2f3
unverified

Srinivasan Iyer sviyer commited on

Fix eval mask (#93)
19a3f75
unverified

Srinivasan Iyer sviyer commited on

remove selective activation checkpointing (#92)
8c1b1a7
unverified

Srinivasan Iyer sviyer commited on

Get generation working for BLT (#86)
b79eb3e
unverified

par-meta commited on

Fix in-place addition of patch_embds (#85)
2dcf48b
unverified

Hanna commited on

Some fixes for entropy model predictions (#83)
fc946a1
unverified

Srinivasan Iyer sviyer commited on

Update ppl evals to work with blt model, in addition to entropy model (#82)
083656c
unverified

par-meta commited on

Update iterate_data (#81)
f84ee63
unverified

par-meta commited on

Add way to call consolidate (#80)
c110f6b
unverified

Srinivasan Iyer sviyer commited on

When merging configs, do not merge data sources (#79)
a5ceaaa
unverified

Srinivasan Iyer sviyer commited on

Get evals working again. (#46)
7517ac2
unverified

par-meta commited on

Reduce per file resources arrow uses (#77)
63913e4
unverified

par-meta commited on

Let process start before yielding preloaded prefetch buffer, avoid needlessly losing buffer in edge cases (#75)
8f2cf88
unverified

par-meta commited on

Add approximate state persistence (#73)
ea1fc75
unverified

par-meta commited on

Fix rsync to not preserve original permissions, instead use destination (#76)
9bd51df
unverified

par-meta commited on

Correctly reset batch iterator at each arrow create_iter call. (#74)
c727844
unverified

par-meta commited on

Pass mask in packing_iterator, correctly handle last batch, fix masking (#65)
08b8c7c
unverified

par-meta commited on

Initialize rope embeddings properly for the entropy model (#72)
0da051f
unverified

Srinivasan Iyer sviyer commited on

Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
aeb95f1
unverified

par-meta commited on

Add vocab and seq len abstract fields (#66)
ff36aa8
unverified

par-meta commited on

Fix: Correct model_args usage in parallelize_model call (#69)
a6ed14f
unverified

Bocheng Li commited on

Update iterator inheritance, pass file format args, limit iterator (#63)
fc3399e
unverified

par-meta commited on

Make apex logs less noisy (#60)
b0956bd
unverified

par-meta commited on

Make it possible to specify multiple config files (#54)
82ab593
unverified

par-meta commited on

using apex rmsnorm (#57)
f3e8125
unverified

Srinivasan Iyer sviyer commited on

Fix multiprocessing dataloader checkpointing and use it in the train script (#50)
8c61ab5
unverified

par-meta commited on

Test first batch matches (#53)
85c2f28
unverified

par-meta commited on

disable reshard after forward (#56)
9d907fe
unverified

Srinivasan Iyer sviyer commited on

make sure max_encoder_seq_length matches (#55)
48e4ad0
unverified

Srinivasan Iyer sviyer commited on

fix save and reload model state (#49)
22c7fe1
unverified

Srinivasan Iyer sviyer commited on

Add bpb and n_bytes to metric logging (#41)
fe45f69
unverified

par-meta commited on

Fix init and repro (#48)
aebdc48
unverified

Srinivasan Iyer sviyer commited on

Allow ArrowIterator to read from json (#45)
936d943
unverified

par-meta commited on

Update checkpointing to use fsspec (#39)
afedb16
unverified

par-meta commited on

Add rope fp32 (#43)
739dc71
unverified

Srinivasan Iyer sviyer commited on

fix stool (#44)
6fbaf72
unverified

Srinivasan Iyer sviyer commited on

Fix wandb logging (#42)
7cf8fab
unverified

Srinivasan Iyer sviyer commited on

Fix distributed all reduce grad norm (#40)
c79b1fd
unverified

par-meta commited on

This includes fixes that make checkpointing and reloading work correctly. (#35)
7044771
unverified

par-meta commited on

Initial codes and scripts for training entropy model (#34)
7622d28
unverified

par-meta commited on

Use load_async flag to not start MP iterator (#33)
a809259
unverified

par-meta commited on

Update file check script to check sizes (#32)
bc42ceb
unverified

par-meta commited on