Somehow this is the only version of gpt coder that works. The results are still weak and it loops after 60k but considering the speed/accuracy ratio it becomes a useful model.Can you do it with full weights? What about the 120b instead?
· Sign up or log in to comment