David Fu's picture

1

David Fu

debisoft

·

https://www.tenatch.com

debisoft

AI & ML interests

AI/ML

Organizations

debisoft 's models 59

debisoft/smol-course-SmolVLM2-2.2B-Instruct-trl-sft-ChartQA

Updated Nov 19, 2025

debisoft/smollm3-dpo-aligned-peft

Updated Nov 9, 2025

debisoft/smollm3-dpo-aligned

Updated Nov 9, 2025

debisoft/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Oct 25, 2025

debisoft/ppo-LunarLander-v2

Reinforcement Learning • Updated Oct 24, 2025 • 3

debisoft/ppo-CartPole-v2

Reinforcement Learning • Updated Oct 24, 2025

debisoft/ppo-CartPole-v1

Reinforcement Learning • Updated Oct 24, 2025

debisoft/poca-SoccerTwos-2

Reinforcement Learning • Updated Oct 21, 2025 • 575

debisoft/poca-SoccerTwos

Reinforcement Learning • Updated Oct 21, 2025 • 680

debisoft/a2c-PandaPickAndPlace-v3

Reinforcement Learning • Updated Oct 18, 2025 • 7

debisoft/sac-PandaPickAndPlace-v3

Reinforcement Learning • Updated Oct 18, 2025 • 10

debisoft/tqc-PandaPickAndPlace-v3

Reinforcement Learning • Updated Oct 18, 2025 • 9

debisoft/a2c-PandaReachDense-v3

Reinforcement Learning • Updated Oct 17, 2025 • 7

debisoft/Reinforce-Pixelcopter-PLE-v0-checkpt

Reinforcement Learning • Updated Oct 13, 2025

debisoft/ppo-Pyramids

Reinforcement Learning • Updated Oct 13, 2025 • 1

debisoft/ppo-SnowballTarget

Reinforcement Learning • Updated Oct 13, 2025

debisoft/Reinforce-Pixelcopter-PLE-v0

Reinforcement Learning • Updated Oct 11, 2025

debisoft/Reinforce-Cartpole-v1

Reinforcement Learning • Updated Oct 5, 2025

debisoft/ppo-LunarLander-v3-x

Reinforcement Learning • Updated Sep 28, 2025 • 5

debisoft/SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Sep 23, 2025 • 8

debisoft/Taxi-v3-5x5-noRain

Reinforcement Learning • Updated Sep 21, 2025

debisoft/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Sep 20, 2025

debisoft/ppo-Huggy

Reinforcement Learning • Updated Aug 5, 2025

debisoft/ppo-LunarLander-v2-x

Reinforcement Learning • Updated Aug 4, 2025

debisoft/mistral-nemo-12b-instruct-thinking-function_calling-logic-capturing-V0

Updated Jun 19, 2025

debisoft/mistral-nemo-12b-base-thinking-function_calling-logic-capturing-V0

Updated Jun 19, 2025

debisoft/mistral-nemo-minitron-8b-base-thinking-function_calling-logic-capturing-V0

Updated Jun 17, 2025

debisoft/mistral-nemo-minitron-8b-instruct-thinking-function_calling-logic-capturing-V0

Updated Jun 17, 2025

debisoft/mistral-nemo-minitron-8B-Instruct-thinking-function_calling-V0

Updated Jun 17, 2025

debisoft/openmath-mistral-7b-thinking-function_calling-logic-capturing-V0

Updated Jun 17, 2025