AI & ML interests
None yet
Organizations
shivank21/dpo_Llama-3-8B-9455-1883
Text Generation
•
Updated
•
3
shivank21/dpo_Llama-3-8B-9455-1800
Text Generation
•
Updated
•
1
shivank21/dpo_Qwen2-7B-9455-1883
Text Generation
•
Updated
shivank21/dpo_Qwen2-7B-9455-1800
Text Generation
•
Updated
•
2
shivank21/dpo_deepseek-llm-7b-9455-1883
Text Generation
•
Updated
shivank21/dpo_deepseek-llm-7b-9455-1800
Text Generation
•
Updated
•
8
shivank21/diag_agent_Qwen2-7B-9455
8B
•
Updated
•
3
shivank21/diag_agent_Qwen2-7B-7500
8B
•
Updated
•
1
shivank21/diag_agent_Qwen2-7B
Text Generation
•
333k
•
Updated
•
1
shivank21/diag_agent_deepseek-llm-7b-9455
7B
•
Updated
•
10
shivank21/diag_agent_deepseek-llm-7b-3500
7B
•
Updated
•
1
shivank21/diag_agent_Meta-Llama-3-8B
Text Generation
•
8B
•
Updated
•
9
shivank21/diag_agent_Meta-Llama-3-8B-9000
8B
•
Updated
shivank21/diag_agent_deepseek-llm-7b-base
Text Generation
•
250k
•
Updated
shivank21/smolvlm-instruct_base_2k_random
2B
•
Updated
•
2
shivank21/smolvlm-instruct_mix_2k_5ep_0_1_only_text
2B
•
Updated
•
2
shivank21/smolvlm-instruct_mix_2k_5ep_0_1
2B
•
Updated
•
3
shivank21/smolvlm-instruct_mix_2k
2B
•
Updated
•
2
shivank21/mistral_dpo_reward_full
Updated
shivank21/Mistral_dpo_reward_code
Text Generation
•
8B
•
Updated
•
2
shivank21/llama_dpo_reward_full
Updated
shivank21/llama_dpo_reward
Text Generation
•
1B
•
Updated
shivank21/llama_dpo_self_full
Updated
Text Generation
•
1B
•
Updated
•
1
shivank21/llama_dpo_ours_full
Updated
shivank21/mistral_dpo_self_full
Updated
shivank21/mistral_dpo_self
Updated