Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts

Tian Yu, Shaolei Zhang, Yang Feng*

Source code for paper "Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts".

If you find this project useful, feel free to ⭐️ it and give it a citation!

Overview

Truth-Aware Context Selection (TACS) is a method of selecting context based on its truthfulness, which discards the unreal parts of the context and retains the truthful parts, protecting the LLMs from being misled by untruthful context, thus avoiding the generation of hallucinations. TACS first performs truth detection on the context, and then constructs the corresponding attention mask according to the truthfulness of each position to filter the context.

GUI interaction: We provide a GUI interface to intuitively compare the effect of TACS on LLM. You can click on the examples at the bottom of the page to quickly fill in the 'Question' and 'Information'. After clicking the 'Submit' button, you will see the results of the truth detection on the right, and get the results generated without (left-bottom) or using (right-bottom) TACS respectively.

TACS firstly conduct truth detection on the contextual information, and then construct the corresponding attention mask according to the truthfulness of each position to filter the context.

To interact with TACS in your browser, follow the guide for installation and GUI interaction.

Models Download

We provide trained classifiers for truth detection!

TruthfulQA Truth Detection Classifiers: Classifiers for Llama 2-Chat-7B. Classifiers for Honest Llama.

ConflictQA Truth Detection Classifiers: Classifiers for Llama 2-Chat-7B.

With these classifiers, TACS can perform truth (hallucination) detection on the contextual information based on the internal representations, evaluating the truthfulness of each position.

Installation

Clone TACS's repo.

git clone https://github.com/ictnlp/TACS.git
cd TACS
export ROOT=pwd

Environment requirements: Python 3.9, Pytorch 1.13.1.

pip install -r requirements.txt

GUI Interaction

To interact with TACS in your browser, you should firstly download the truth detection classifiers and place the models at $ROOT/tfqa/svm, and then run the following command:

cd $ROOT/webui
CUDA_VISIBLE_DEVICES=0,1 python webui.py\
    --model_name ${path_to_Llama-2-7b-chat}\ # e.g. Llama-2-7b-chat-hf
    --token_svm_path ${path_to_token_level_classifier}\ # e.g. svm_single_evidence_Llama-2-7b-chat-hf_fold2.pt
    --token_svm_acc ${path_to_token_level_classifier_acc}\ # e.g. acc_single_evidence_Llama-2-7b-chat-hf_fold2.pt
    --sentence_svm_path ${path_to_sentence_level_classifier}\ # e.g. mean_svm_single_evidence_Llama-2-7b-chat-hf_fold2.pt
    --sentence_svm_acc ${path_to_sentence_level_classifier_acc}\ # e.g. mean_acc_single_evidence_Llama-2-7b-chat-hf_fold2.pt
    --TACS_mode 'DEMO_token'

Tip

You can switch the truth detection granularity and adjust the classification threshold. Positions with scores above the threshold will be considered truthful.

Evaluation

Note

Experimental results shows that TACS can significantly alleviate the hallucination caused by untruthful context and improve the LLMs' adaptability in the face of information interference. More information can be found in the paper.

Truthful information Acceptance Rate (TA Rate), Untruthful information Resistance Rate (UR Rate) and Disturbance Adaptation Rate on TruthfulQA and ConflictQA.

TruthfulQA Evaluation

Generative Multiple-Choice

Generate using Llama 2-Chat 7B with TACS
- download truth detection classifiers, and save them to $ROOT/tfqa/svm

# Generation
cd $ROOT/tfqa
export model_path={path_to_llm}
export CUDA_VISIBLE_DEVICES=0,1,2,3

bash gmc_infer.sh

Generation results can be find at $ROOT/tfqa/generative_multiple_choice_results. Our generation results are also provided in ./tfqa/generative_multiple_choice_results.

Evaluate using Llama 2-Chat 7B with TACS

# Evaluation
bash gmc_eval.sh

Open-ended Generation

Generate using Llama 2-Chat 7B with TACS

export model_path={path_to_llm}
export CUDA_VISIBLE_DEVICES=0,1,2,3

bash opg_infer.sh

Generation results can be find at $ROOT/tfqa/open_ended_generation_results. Our generation results are also provided in ./tfqa/open_ended_generation_results.

Probabilistic Multiple-Choice

Evaluate using Llama 2-Chat 7B with TACS

export model_path={path_to_llm}
export CUDA_VISIBLE_DEVICES=0,1,2,3

bash mc_eval.sh

Metrics can be find at ./tfqa/probabilistic_multiple_choice_results after running the above code.

ConflictQA Evaluation

Generative Multiple-Choice

Generate using Llama 2-Chat 7B with TACS
- download truth detection classifiers, and save them to $ROOT/conflictqa/svm

# Generation
cd $ROOT/conflictqa
export model_path={path_to_llm}
export CUDA_VISIBLE_DEVICES=0,1,2,3

bash infer.sh

Generation results can be find at $ROOT/conflictqa/generative_multiple_choice_results. Our generation results are also provided in ./conflictqa/generative_multiple_choice_results.

Evaluate using Llama 2-Chat 7B with TACS

# Evaluation
bash eval.sh

Licence

Model weights and the inference code are released under The GNU General Public License v3.0 (GPLv3)

Citation

If this repository is useful for you, please cite as:

@inproceedings{yu-etal-2024-truth,
    title = "Truth-Aware Context Selection: Mitigating Hallucinations of Large Language Models Being Misled by Untruthful Contexts",
    author = "Yu, Tian  and
      Zhang, Shaolei  and
      Feng, Yang",
    editor = "Ku, Lun-Wei  and
      Martins, Andre  and
      Srikumar, Vivek",
    booktitle = "Findings of the Association for Computational Linguistics ACL 2024",
    month = aug,
    year = "2024",
    address = "Bangkok, Thailand and virtual meeting",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-acl.645",
    pages = "10862--10884",
    abstract = "Although Large Language Models (LLMs) have demonstrated impressive text generation capabilities, they are easily misled by untruthful contexts provided by users or knowledge augmentation tools, leading to hallucinations. To alleviate LLMs from being misled by untruthful context and take advantage of knowledge augmentation, we propose Truth-Aware Context Selection (TACS), a lightweight method to adaptively recognize and mask untruthful context from the inputs. TACS begins by performing truth detection on the input context, leveraging the parameterized knowledge within the LLM. Subsequently, it constructs a corresponding attention mask based on the truthfulness of each position, selecting the truthful context and discarding the untruthful context. Additionally, we introduce a new evaluation metric, Disturbance Adaption Rate, to further study the LLMs{'} ability to accept truthful information and resist untruthful information.Experimental results indicate that TACS can effectively filter untruthful context and significantly improve the overall quality of LLMs{'} responses when presented with misleading information.",
}

If you have any questions, feel free to contact [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
assets		assets
conflictqa		conflictqa
tfqa		tfqa
webui		webui
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts

Overview

Models Download

Installation

GUI Interaction

Evaluation

TruthfulQA Evaluation

Generative Multiple-Choice

Open-ended Generation

Probabilistic Multiple-Choice

ConflictQA Evaluation

Generative Multiple-Choice

Licence

Citation

About

Releases

Packages

Languages

ictnlp/TACS

Folders and files

Latest commit

History

Repository files navigation

Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts

Overview

Models Download

Installation

GUI Interaction

Evaluation

TruthfulQA Evaluation

Generative Multiple-Choice

Open-ended Generation

Probabilistic Multiple-Choice

ConflictQA Evaluation

Generative Multiple-Choice

Licence

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages