arxiv:2410.04612
Jonathan Chang
jdchang
AI & ML interests
None yet
Organizations
models
95
jdchang/test_rm_8b
Feature Extraction
•
8B
•
Updated
•
13
jdchang/patch_14b
Text Generation
•
15B
•
Updated
•
9
jdchang/norm_test_400
Text Generation
•
15B
•
Updated
•
7
jdchang/norm_test_200
Text Generation
•
15B
•
Updated
•
8
jdchang/norm_test
Text Generation
•
15B
•
Updated
•
8
jdchang/bt-model-lr-7e-06-step-955
2B
•
Updated
•
8
jdchang/bt-model-lr-7e-06-step-954
2B
•
Updated
•
5
jdchang/bt-model-lr-3e-05-step-955
2B
•
Updated
•
8
jdchang/bt-model-lr-1e-05-step-955
2B
•
Updated
•
7
jdchang/bt-model-lr-3e-05-step-954
2B
•
Updated
•
7
datasets
60
jdchang/distill-llama70-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
12
jdchang/distill-qwen32-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
7
jdchang/distill-qwen14-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
42
jdchang/distill-qwen7-n16-rollin-llama-t2s
Viewer
•
Updated
•
302k
•
23
jdchang/distill-llama70-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
31
jdchang/distill-qwen32-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
18
jdchang/distill-qwen14-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
21
jdchang/distill-qwen7-n16-rollin-t2s
Viewer
•
Updated
•
302k
•
35
jdchang/qsharp-bt-mixture
Viewer
•
Updated
•
27.2k
•
15
jdchang/qsharp-bt-32b
Viewer
•
Updated
•
31.9k
•
35