Ethos: Rectifying Language Models in Orthogonal Parameter Space

Code repository for the paper:

"Ethos: Rectifying Language Models in Orthogonal Parameter Space"

Code Structure

For each unlearning task, there is a dedicated folder named after the task. These folders contain all necessary datasets and training/evaluation code for conducting experiments related to that specific task. Within each task folder, there is also a collection of scripts for different models.

Getting Started

Create a virtual environment:
```
conda create -n ethos python=3.9
```
Install required python packages:
```
pip install -r requirements.txt
```
Navigate to the corresponding folder and run the provided script. For instance, to perform the toxification unlearning task for the OPT-1.3B model:
```
cd toxification
sh opt-1.3b.sh
```

These scripts first fine-tune the model on specific datasets to generate task vectors. They then produce a task vector for the unlearning purpose. Finally, the scripts evaluate the model's unlearning performance after incorporating the task vector. Please refer to the scripts for detailed execution commands.
To run experiment on Llama model, make sure you have requested access in the official Meta Llama-2 webpage on HuggingFace and have logged into your HuggingFace account with the access token.
```
huggingface-cli login
```

Acknowledgement

This repo refers to the following projects:

How to Cite

@inproceedings{
    gao2024ethos,
    title={Ethos: Rectifying Language Models in Orthogonal Parameter Space},
    author={Lei Gao and Yue Niu and Tingting Tang and Salman Avestimehr and Murali Annavaram},
    booktitle={2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics},
    year={2024},
    url={https://arxiv.org/abs/2403.08994}
}

Contact

Questions or comments can be sent to "leig AT usc.edu".

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
bias		bias
memorization		memorization
toxification		toxification
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
unlearn.py		unlearn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ethos: Rectifying Language Models in Orthogonal Parameter Space

Code Structure

Getting Started

Acknowledgement

How to Cite

Contact

About

Releases

Packages

Languages

leigao97/Ethos

Folders and files

Latest commit

History

Repository files navigation

Ethos: Rectifying Language Models in Orthogonal Parameter Space

Code Structure

Getting Started

Acknowledgement

How to Cite

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages