Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

use pre-trained model on i2b2 2014 dataset #152

Open
InternetMedical opened this issue Oct 5, 2019 · 0 comments
Open

use pre-trained model on i2b2 2014 dataset #152

InternetMedical opened this issue Oct 5, 2019 · 0 comments

Comments

@InternetMedical
Copy link

Dear Franck
The NeuroNER is really a great work. You developers provided detailed answers to users, which helped me a lot when I encountered the same problem. But I still need your help.

In your paper, Transfer Learning for Named-Entity Recognition with Neural Network, you said:

'we apply transfer learning by training the parameters of the ANN model on the source dataset (MIMIC), and using the same ANN to retrain on the target dataset (i2b2 2014 or 2016) for fine-tuning.'

Can you tell me the details how you achieve it?

I used the pre-trained model, namely mimic_glove_spacy_bioes, to fine-tuning on i2b2 dataset.
A part of params used for fine-tuning are set as:
'--train_model=True --use_pretrained_model=True'

But I got error.
'AssertionError: The label B-BIOID does not exist in the pretraining dataset. Please ensure that only the following labels exist in the dataset: B-AGE, B-COUNTRY, B-DATE, B-DOCTOR, B-HOSPITAL, B-IDNUM, B-LOCATION_OTHER, B-PATIENT, B-PHONE, B-STATE, B-STREET, B-ZIP, E-AGE, E-COUNTRY, E-DATE, E-DOCTOR, E-HOSPITAL, E-IDNUM, E-LOCATION_OTHER, E-PATIENT, E-PHONE, E-STATE, E-STREET, E-ZIP, I-AGE, I-COUNTRY, I-DATE, I-DOCTOR, I-HOSPITAL, I-IDNUM, I-LOCATION_OTHER, I-PATIENT, I-PHONE, I-STATE, I-STREET, I-ZIP, O, S-AGE, S-COUNTRY, S-DATE, S-DOCTOR, S-HOSPITAL, S-IDNUM, S-LOCATION_OTHER, S-PATIENT, S-PHONE, S-STATE, S-STREET, S-ZIP'

It seems the labels in i2b2 dataset are different from the labels in MIMIC dataset.
Can you tell me how to fine-tuning the pre-trained model, namely mimic_glove_spacy_bioes, on i2b2 dataset?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant