Not Uncensored

#5
by ares2324 - opened

Model refuses a lot of harmful prompts. KL divergence can be lower than other abliteration techniques but it doesn't matter because at the end of the day model refuses a lot of things. Still great tho achieving this kind of KL divergence is huge success , I wish it would be more uncensored.

The tool is intended to be used with your own prompts, you need to engineer a set of prompts you want to pass and ones you don't mind failing. The trial that passes the most number of prompts from your pass list with lowest divergence is your keeper.

ohhh that's why I was stuck , thank you!

Sign up or log in to comment