Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Charlie81
/
LoRE
like
0
TensorBoard
Safetensors
License:
mit
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
main
LoRE
/
scripts
/
train.py
Commit History
change save limit to 1
5af07d9
Charlie81
commited on
Sep 2, 2025
add better small expert debugs and del checkpoints
01ec808
Charlie81
commited on
Sep 2, 2025
delete checkpoints
5b4ec54
Charlie81
commited on
Aug 12, 2025
keeping save steps at 2k
53278e0
Charlie81
commited on
Jul 15, 2025
save steps to 5
6154f2d
Charlie81
commited on
Jul 14, 2025
changes to training script
3ed7a55
Charlie81
commited on
Jul 14, 2025
delete checkpoints all epochs to 3
d4a6b93
Charlie81
commited on
Jul 12, 2025
trying changes to git stuff
be580d6
Charlie81
commited on
Jul 12, 2025
save functionality
e785830
Charlie81
commited on
Jul 12, 2025
YOLO run setup
20c7ba3
Charlie81
commited on
Jul 12, 2025
attempt new distribution experts
834ad70
Charlie81
commited on
Jul 11, 2025
batch 2 is biggest
f03d4f8
Charlie81
commited on
Jul 11, 2025
syntax
dd5034c
Charlie81
commited on
Jul 11, 2025
batch 3
85d731b
Charlie81
commited on
Jul 11, 2025
batch size 2
d55ddf7
Charlie81
commited on
Jul 11, 2025
batch 4
0d64997
Charlie81
commited on
Jul 11, 2025
comma
d2653ee
Charlie81
commited on
Jul 11, 2025
16batch
7d3ca95
Charlie81
commited on
Jul 11, 2025
train agaaa
45d6e50
Charlie81
commited on
Jul 11, 2025
train aga
580eff8
Charlie81
commited on
Jul 11, 2025
fix train
1f3825f
Charlie81
commited on
Jul 11, 2025
debugging missing grad
325d2d0
Charlie81
commited on
Jul 10, 2025
update training script
f9596a0
Charlie81
commited on
Jul 10, 2025
reorder small experts
7050cb6
Charlie81
commited on
Jul 7, 2025
unfreeze only gate and experts
356573e
Charlie81
commited on
Jul 7, 2025
1 batch size
6b0e19d
Charlie81
commited on
Jul 7, 2025
batch size back to 2
14b2125
Charlie81
commited on
Jul 6, 2025
batch size 3 lol
f078b84
Charlie81
commited on
Jul 6, 2025
batch size to 4
5f8bb1e
Charlie81
commited on
Jul 6, 2025
batch size 8
9fc70e4
Charlie81
commited on
Jul 6, 2025
modify batch and fix tensor issue
2a594f6
Charlie81
commited on
Jul 6, 2025
tokenize fn
5c05368
Charlie81
commited on
Jul 6, 2025
fix
52bdc02
Charlie81
commited on
Jul 6, 2025
add
d6ffab2
Charlie81
commited on
Jul 6, 2025
tokenize function
1e0b293
Charlie81
commited on
Jul 6, 2025
debugs
6d21fca
Charlie81
commited on
Jul 6, 2025
key value
7ab89f2
Charlie81
commited on
Jul 6, 2025
restore
7abbd62
Charlie81
commited on
Jul 6, 2025
claude attempt 2 dataset
3db4e2e
Charlie81
commited on
Jul 6, 2025
cache diagnostics
dd2e997
Charlie81
commited on
Jul 6, 2025
sanity
8e88ea1
Charlie81
commited on
Jul 6, 2025
claudeattempt dataset
5b01886
Charlie81
commited on
Jul 6, 2025
alternative dataset load
d7f70e5
Charlie81
commited on
Jul 6, 2025
dataset keep in memory
1182794
Charlie81
commited on
Jul 6, 2025
ignore mismatches
e039ec3
Charlie81
commited on
Jul 6, 2025
fix import
78b85e8
Charlie81
commited on
Jul 6, 2025
overhaul
c4785c5
Charlie81
commited on
Jul 6, 2025
reset modeling file
36acce3
Charlie81
commited on
Jul 6, 2025
attempt fix and more prints
a82f934
Charlie81
commited on
Jul 5, 2025
init expanded model after config change
438a56a
Charlie81
commited on
Jul 5, 2025
Previous
1
2
Next