Here are a few experiments / attempts to generate a Qiuz out of highly structured IT specific document. The floe is the following:
- Transform PDF into TXT;
- Transform text into stringIO object;
- Remove parts of the document which do not have meaning for the experiments;
- Perform experiments on cleaning;
- Perform experiments on Q&A generation using pre-trained models;
- Work is still in progress