Uploaded model
- Developed by: robust-rlhf
- License: apache-2.0
- Finetuned from model : unsloth/Llama-3.3-70B-Instruct-bnb-4bit
This llama model was trained 2x faster with Unsloth and Huggingface's TRL library.
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	๐
			
		Ask for provider support
Model tree for robust-rlhf/Llama-3.3-70B-Instruct-bnb-4bit_docs30k_r64_lr1e-5_epochs1
Base model
meta-llama/Llama-3.1-70B
				Finetuned
	
	
meta-llama/Llama-3.3-70B-Instruct
						
				Quantized
	
	
unsloth/Llama-3.3-70B-Instruct-bnb-4bit
						