Spaces:
Runtime error
Runtime error
Commit History
feat: scan layers + gradient checkpointing (#161)
07a6f9a
unverified
Merge branch 'main' of https://github.com/borisdayma/dalle-mini into main
bcd360f
feat(text): support emojis (#154)
7ef7bd9
unverified
fix: smelu
7f2f8ed
fix: sinkformer
2c583b3
fix: support smelu
a2dcee4
feat: allow relative position (#156)
769d20a
unverified
feat: sinkhorn in lse mode (#155)
00d4661
unverified
fix: sinkformer gradient
eed4896
feat(model): allow bias (#152)
361a994
unverified
feat: add sinkformer + custom final ln + pre-ln (#151)
f139b0b
unverified
feat: placeholders for more config
69bcbeb
feat: force final ln in encoder
32f4ba5
feat: allow more configurations
5bd4c20
fix: DeepNet doesn't scale weights of embedding/output layers (#150)
503d6b4
unverified
Shuming Ma
Shuming Ma
commited on
feat: remove unecessary LN
02824a7
feat: add cogview
472c4cc
fix(textnormalizer): consider utf8 on windows (#148)
3b8d8cb
unverified
illtellyoulater
commited on
feat: implement transformer variants (#144)
542378c
unverified
feat(data): super conditioning (#141)
7939874
unverified
feat: support pod (#139)
803ccbf
unverified
feat: handle gradient checkpointing
5173ec7
feat: load from bucket
1c4e839
feat: reduce artifact space + offset step
34cf91c
feat: restore weights on CPU
5f954fc
fix: position embedding for generate method
ebac379
fix: typo
68cc185
fix: load from checkpoint
44b7c3e
feat(modeling): simplify abstract_init
fa72aa7
feat: handle model parallel
1bb3269
style(tokenizer): remove unused variables
605df32
feat: use fast tokenizer
767d78a
feat(train): distributed_shampoo with pjit
cc34d07
feat(model): clean way to load on cpu
12f323d
feat(train): no batch dimension with pjit
df1fe19
feat: use_artifact if run existing
a5ed112
Style (isort).
f9d51f7
Pedro Cuenca
commited on
Tokenizer, config, model can be loaded from wandb.
7e48337
Pedro Cuenca
commited on
Change import order again.
2b2be9b
Pedro Cuenca
commited on
Fix import order to make isort happy.
64d99b2
Pedro Cuenca
commited on
Accept changes suggested by linter.
9f522b8
Pedro Cuenca
commited on
Never consider local dirs as remote wandb references.
08dd098
Pedro Cuenca
commited on
Store resolved path after loading model.
55a631d
Pedro Cuenca
commited on
Override from_pretrained to support wandb artifacts.
1023afa
Pedro Cuenca
commited on