Pinned Loading
Repositories
Showing 7 of 7 repositories
- steering Public
Official repo for the paper: "Selective Steering: Norm-Preserving Control Through Discriminative Layer Selection"
knoveleng/steering’s past year of commit activity - redeval Public
Official repo for the paper: "RedBench: A Universal Dataset for Comprehensive Red Teaming of Large Language Models"
knoveleng/redeval’s past year of commit activity - rainbowplus Public
Official repo for paper: "RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity Search"
knoveleng/rainbowplus’s past year of commit activity - open-rs Public
Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"
knoveleng/open-rs’s past year of commit activity - mod Public
This repository contains the codebase for the paper MoD: A Distribution-Based Approach for Merging Large Language Models
knoveleng/mod’s past year of commit activity - mod-evaluate Public
knoveleng/mod-evaluate’s past year of commit activity - neurips-llm-2023 Public
knoveleng/neurips-llm-2023’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…