@wassname is the most recent version of the notebook broken? running it as is doesn't seem to ablate the refusals. The intervention generations still refuse, but the fully "merged" orthogonalized ones are more helpful.
wing lian PRO
winglian
AI & ML interests
None yet
Recent Activity
liked
a dataset
1 day ago
HuggingFaceFW/finepdfs-edu
liked
a dataset
18 days ago
OpenMed/agab-db
liked
a model
24 days ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16