Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Asap7772
/
Qwen3-4B-second-stage-DPO-lr-1e-7-beta-0.1-loss-sigmoid-rpo-1.0-ckpt-135
like
0
Safetensors
qwen3
trl
dpo
rlhf
alignment
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
Qwen3-4B-second-stage-DPO-lr-1e-7-beta-0.1-loss-sigmoid-rpo-1.0-ckpt-135
8.06 GB
1 contributor
History:
2 commits
Asap7772
Upload checkpoint from checkpoint-135
884089d
verified
about 1 month ago
.gitattributes
Safe
1.57 kB
Upload checkpoint from checkpoint-135
about 1 month ago
README.md
7.18 kB
Upload checkpoint from checkpoint-135
about 1 month ago
added_tokens.json
Safe
707 Bytes
Upload checkpoint from checkpoint-135
about 1 month ago
chat_template.jinja
Safe
4.04 kB
Upload checkpoint from checkpoint-135
about 1 month ago
config.json
Safe
1.54 kB
Upload checkpoint from checkpoint-135
about 1 month ago
generation_config.json
Safe
187 Bytes
Upload checkpoint from checkpoint-135
about 1 month ago
latest
Safe
14 Bytes
Upload checkpoint from checkpoint-135
about 1 month ago
merges.txt
Safe
1.67 MB
Upload checkpoint from checkpoint-135
about 1 month ago
model-00001-of-00002.safetensors
4.97 GB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
model-00002-of-00002.safetensors
3.08 GB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
model.safetensors.index.json
Safe
32.9 kB
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_0.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_1.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_2.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_3.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_4.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_5.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_6.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
rng_state_7.pth
pickle
Detected Pickle imports (7)
"torch._utils._rebuild_tensor_v2"
,
"numpy.dtype"
,
"_codecs.encode"
,
"numpy.ndarray"
,
"numpy._core.multiarray._reconstruct"
,
"torch.ByteStorage"
,
"collections.OrderedDict"
How to fix it?
16.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
scheduler.pt
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
1.47 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
special_tokens_map.json
Safe
613 Bytes
Upload checkpoint from checkpoint-135
about 1 month ago
tokenizer.json
Safe
11.4 MB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
tokenizer_config.json
Safe
5.4 kB
Upload checkpoint from checkpoint-135
about 1 month ago
trainer_state.json
72.9 kB
Upload checkpoint from checkpoint-135
about 1 month ago
training_args.bin
pickle
Detected Pickle imports (15)
"accelerate.utils.dataclasses.DistributedType"
,
"accelerate.state.PartialState"
,
"transformers.trainer_utils.SaveStrategy"
,
"torch.device"
,
"torch.bfloat16"
,
"transformers.integrations.deepspeed.HfDeepSpeedConfig"
,
"transformers.trainer_utils.HubStrategy"
,
"transformers.training_args.OptimizerNames"
,
"transformers.integrations.deepspeed.HfTrainerDeepSpeedConfig"
,
"trl.trainer.dpo_config.DPOConfig"
,
"trl.trainer.dpo_config.FDivergenceType"
,
"accelerate.utils.dataclasses.DeepSpeedPlugin"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.SchedulerType"
How to fix it?
8.4 kB
xet
Upload checkpoint from checkpoint-135
about 1 month ago
vocab.json
Safe
2.78 MB
Upload checkpoint from checkpoint-135
about 1 month ago
zero_to_fp32.py
Safe
33.3 kB
Upload checkpoint from checkpoint-135
about 1 month ago