LoRE / scripts /train.py

Commit History

change save limit to 1
5af07d9

Charlie81 commited on

add better small expert debugs and del checkpoints
01ec808

Charlie81 commited on

delete checkpoints
5b4ec54

Charlie81 commited on

keeping save steps at 2k
53278e0

Charlie81 commited on

save steps to 5
6154f2d

Charlie81 commited on

changes to training script
3ed7a55

Charlie81 commited on

delete checkpoints all epochs to 3
d4a6b93

Charlie81 commited on

trying changes to git stuff
be580d6

Charlie81 commited on

save functionality
e785830

Charlie81 commited on

YOLO run setup
20c7ba3

Charlie81 commited on

attempt new distribution experts
834ad70

Charlie81 commited on

batch 2 is biggest
f03d4f8

Charlie81 commited on

syntax
dd5034c

Charlie81 commited on

batch 3
85d731b

Charlie81 commited on

batch size 2
d55ddf7

Charlie81 commited on

batch 4
0d64997

Charlie81 commited on

16batch
7d3ca95

Charlie81 commited on

train agaaa
45d6e50

Charlie81 commited on

train aga
580eff8

Charlie81 commited on

fix train
1f3825f

Charlie81 commited on

debugging missing grad
325d2d0

Charlie81 commited on

update training script
f9596a0

Charlie81 commited on

reorder small experts
7050cb6

Charlie81 commited on

unfreeze only gate and experts
356573e

Charlie81 commited on

1 batch size
6b0e19d

Charlie81 commited on

batch size back to 2
14b2125

Charlie81 commited on

batch size 3 lol
f078b84

Charlie81 commited on

batch size to 4
5f8bb1e

Charlie81 commited on

batch size 8
9fc70e4

Charlie81 commited on

modify batch and fix tensor issue
2a594f6

Charlie81 commited on

tokenize fn
5c05368

Charlie81 commited on

tokenize function
1e0b293

Charlie81 commited on

key value
7ab89f2

Charlie81 commited on

restore
7abbd62

Charlie81 commited on

claude attempt 2 dataset
3db4e2e

Charlie81 commited on

cache diagnostics
dd2e997

Charlie81 commited on

claudeattempt dataset
5b01886

Charlie81 commited on

alternative dataset load
d7f70e5

Charlie81 commited on

dataset keep in memory
1182794

Charlie81 commited on

ignore mismatches
e039ec3

Charlie81 commited on

fix import
78b85e8

Charlie81 commited on

overhaul
c4785c5

Charlie81 commited on

reset modeling file
36acce3

Charlie81 commited on

attempt fix and more prints
a82f934

Charlie81 commited on

init expanded model after config change
438a56a

Charlie81 commited on