Have you tried any abliterated models?

rabi_molar · on Aug 16, 2024

Hadn't heard about the abliteration before, thanks for bringing it up! Here's a HF walkthrough [1] of the concept for anyone else interested in learning more.

[1] https://huggingface.co/blog/mlabonne/abliteration

phren0logy · on Aug 15, 2024

Yes, with mixed results.

ustad · on Aug 15, 2024

Any recommendations?

chpatrick · on Aug 15, 2024

LMStudio + https://huggingface.co/mlabonne/Llama-3.1-70B-Instruct-lorab...

reissbaker · on Aug 16, 2024

Yup, 3.1-70B-Instruct-lorablated is the one I currently recommend too for anti-rejection models — it seems roughly as anti-rejection as the original failspy "abliterated" model, but it works with 128k context since it's based on 3.1 instead of 3 (which only had 8k context). It's currently our second-most popular model on glhf.chat, behind Llama-3.1-405B-Instruct.

pizza · on Aug 15, 2024

failspy’s or mlabonne’s models. Or just look for any model with ‘abliterated’ in the title. Eg try failspy/meta-llama-3-8b-instruct-abliterated-v3 though of course bigger models will probably be better

stavros · on Aug 15, 2024

No specific ones, but there are some abliteration LoRas for Llama (8B and 70B, I think). Those should be good for what you want.