Cruz Control

GPT2 Models

We currently have the following training scripts for the models:

GPT2 Baseline Text + Fact
Knowledge Dependent Policy Driven Neural Response Generator using Mezza Tags

Contact

For any clarification related to the above code, please reach out to Rishi Rajasekaran ([email protected])

DSTC9 Baseline Code (untested)

Response Generation

Scripts to train Seq2Seq and Transformer models on the Amazon Topical-Chat Corpus. This code serves as the baseline for DSTC9 Track 3.

To train: python3 train.py --use_knowledge --transformer --save_path transformer/

To test: python3 test.py --use_knowledge --transformer --save_path transformer/

To serve interactive model with TF-IDF based fact selection: python3 dynamic.py --use_knowledge --transformer --save_path transformer/

Data

The pre-processed data can be found in data.zip. If you would like to use a different pre-processing strategy, please download the original data from here.

The dataset preparation code is split between the utils.py file and the tc_dataset.py. The data loading and tokenization is done in utils.py while the data preparation to feed into the model is done in tc_dataset.py.

Contact

If you experience any issues with this code, please contact me at [email protected]

Setup

spacy
python -m spacy download en_core_web_lg
nltk.download('punkt')

Name		Name	Last commit message	Last commit date
Latest commit History 404 Commits
DialogueAct_Tagger		DialogueAct_Tagger
annotators		annotators
baseline		baseline
datasets		datasets
elastic		elastic
encoder		encoder
evaluation		evaluation
glove		glove
knowledge_selection		knowledge_selection
model		model
pd_nrg		pd_nrg
taggers		taggers
tc_processed		tc_processed
train_util		train_util
transformers_src		transformers_src
unused_code		unused_code
.gitignore		.gitignore
README.md		README.md
configuration_gpt2_adapter.py		configuration_gpt2_adapter.py
create_topical_chat_submission_dataset.py		create_topical_chat_submission_dataset.py
data.py		data.py
data.zip		data.zip
dataset_analysis.py		dataset_analysis.py
dataset_base.py		dataset_base.py
dataset_base_demo.py		dataset_base_demo.py
dstc9_anno.py		dstc9_anno.py
generate_submission.py		generate_submission.py
gpt2.py		gpt2.py
knowledge_index.py		knowledge_index.py
modeling_gpt2_adapter.py		modeling_gpt2_adapter.py
my-requirements.txt		my-requirements.txt
my_train.py		my_train.py
requirements.txt		requirements.txt
straggler-info.txt		straggler-info.txt
tc_annotation.py		tc_annotation.py
tc_dataset.py		tc_dataset.py
tc_knowledge_selection.py		tc_knowledge_selection.py
test_freq_cache		test_freq_cache
train_athena_tagger.py		train_athena_tagger.py
train_model.py		train_model.py
train_swbd_tagger.py		train_swbd_tagger.py
trainer.py		trainer.py
utils.py		utils.py
valid_freq_cache		valid_freq_cache
valid_freq_facts_bert.txt		valid_freq_facts_bert.txt
valid_freq_facts_bert_2.txt		valid_freq_facts_bert_2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cruz Control

GPT2 Models

Contact

DSTC9 Baseline Code (untested)

Response Generation

Data

Contact

Setup

About

Releases

Packages

Contributors 4

Languages

rrajasek95/DSTC9-Dialog-Evaluation-Challenge

Folders and files

Latest commit

History

Repository files navigation

Cruz Control

GPT2 Models

Contact

DSTC9 Baseline Code (untested)

Response Generation

Data

Contact

Setup

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages