PTR: Prompt Tuning with Rules for Text Classification

The code and datasets of our paper "PTR: Prompt Tuning with Rules for Text Classification"

To clone the repository, please run the following command:

git clone https://github.com/thunlp/PTR.git --depth 1

If you use the code, please cite the following paper:

@article{han2021ptr,
  title={PTR: Prompt Tuning with Rules for Text Classification},
  author={Han, Xu and Zhao, Weilin and Ding, Ning and Liu, Zhiyuan and Sun, Maosong},
  journal={arXiv preprint arXiv:2105.11259},
  year={2021}
}

Quick Links

Overview
Requirements
Data Preparation
Experiments
- Baselines
- Reproduce Results

Overview

In this work, we propose prompt tuning with rules (PTR) for many-class text classification and apply logic rules to construct prompts with several sub-prompts. In this way, PTR is able to encode prior knowledge of each class into prompt tuning. You can find more details in our [paper](https://arxiv.org/pdf/2105.11259.pdf).

Requirements

The model is implemented using PyTorch. The versions of packages used are shown below.

numpy==1.18.0
scikit-learn==0.22.1
scipy==1.4.1
torch==1.4.0
tqdm==4.41.1
transformers==4.0.0

To set up the dependencies, you can run the following command:

pip install -r requirements.txt

Data Preparation

We have provided a scripts to download all the datasets we used in our paper. You can run the following command to download the datasets:

bash data/download.sh all

The above command will download all the datasets including

Retacred
Tacred
Tacrev
Semeval

If you only want to download a specific dataset, you can run the following command:

bash data/download.sh $dataset_name1 $dataset_name2 ...

where $dataset_nameX can be one or multiple of retacred, tacred, tacrev, semeval.

Experiments

Baselines

Some baselines, especially the baselines using entity markers, come from the project [RE_improved_baseline].

Reproduce Results in Our Work

1. For TACRED

bash scipts/run_large_tacred.sh

2. For TACREV

bash scripts/run_large_tacrev.sh

3. For RETACRED

bash scripts/run_large_retacred.sh

4. For Semeval

bash scripts/run_large_semeval.sh

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
figs		figs
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PTR: Prompt Tuning with Rules for Text Classification

Quick Links

Overview

Requirements

Data Preparation

Experiments

Baselines

Reproduce Results in Our Work

1. For TACRED

2. For TACREV

3. For RETACRED

4. For Semeval

About

Releases

Packages

Contributors 2

Languages

License

thunlp/PTR

Folders and files

Latest commit

History

Repository files navigation

PTR: Prompt Tuning with Rules for Text Classification

Quick Links

Overview

Requirements

Data Preparation

Experiments

Baselines

Reproduce Results in Our Work

1. For TACRED

2. For TACREV

3. For RETACRED

4. For Semeval

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages