Cross-Model Comparative Loss for Enhancing Neuronal Utility

This repository contains the code and processed data for the paper "Cross-Model Comparative Loss for Enhancing Neuronal Utility in Language Understanding".

Comparative loss is a simple task-agnostic loss function to improve neuronal utility without additional human supervision. It is essentially a pairwise ranking loss based on the comparison principle between the full model and its ablated models, with the expectation that the less ablation there is, the smaller the task-specific loss. It is theoretically applicable to all dropout-compatible models and tasks whose inputs contain irrelevant content.

@article{zhu2023cmp,
  title = {Cross-Model Comparative Loss for Enhancing Neuronal Utility in Language Understanding},
  author = {Zhu, Yunchang and Pang, Liang and Wu, Kangxi and Lan, Yanyan and Shen, Huawei and Cheng, Xueqi},
  year = {2023}
}

Setup

Our experiments are conducted in the following environment with V100 (32 GB) GPUs.

conda create -n pt11 python=3.8
conda activate pt11
conda install -c conda-forge jupyterlab=3.4.5 tensorboard=2.10.0 ipywidgets=8.0.2
conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch
conda install -c conda-forge scikit-learn=1.1.1 scipy=1.8.1
conda install -c huggingface -c conda-forge tokenizers=0.12.1 datasets=2.1.0 transformers=4.19.2
pip install python-Levenshtein==0.20.8 matplotlib==3.6.2
# Unpack the large corpus file for HotpotQA
cd data/hotpot/ && tar -jxvf corpus.distractor.tsv.tar.bz2 && cd -

Experiments

We have applied comparative loss on 14 datasets from 3 NLU tasks with distinct prediction types on top of 4 widely used PLMs. The tasks include:

Classification: language understanding
Extraction: reading comprehension
- Single-hop
- Multi-hop
Ranking: pseudo-relevance feedback

DIY

If you want to train your models on other tasks with comparative loss, you can follow the algorithm below and refer to CmpQA in modeling.py and run_hotpot.py.

Contact

For any questions about the paper or the code, please contact the first author or leave issues.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data/hotpot		data/hotpot
img		img
scripts		scripts
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
arguments.py		arguments.py
modeling.py		modeling.py
requirements.txt		requirements.txt
run_glue.py		run_glue.py
run_hotpot.py		run_hotpot.py
run_squad.py		run_squad.py
trainer.py		trainer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-Model Comparative Loss for Enhancing Neuronal Utility

Setup

Experiments

DIY

Contact

About

Releases

Packages

Languages

zycdev/CmpLoss

Folders and files

Latest commit

History

Repository files navigation

Cross-Model Comparative Loss for Enhancing Neuronal Utility

Setup

Experiments

DIY

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages