On the importance of diversity in question generation for QA.

Sultan, Md Arafat, Shubham Chandel, Ramón Fernandez Astudillo, and Vittorio Castelli.

In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5651-5656. 2020.[arXiv]

Whats Unique Author demonstrates that question generation using top-p nucleus sampling gives generate more diverse questions, and it provides better QA training. And, also BLEU, METEOR, ROGUE are inversely correlated with diversity.

How It Works

Following example shows how different question can be created from an unique answer span.

Source: Author

Top-p nucleus method samples from re-normalised categorical distribution P_N of nucleus N, which is samllest subset of vocabolary items that has a commulative probability mass greater than p. and highest probability among all the subsets.
By restricting the pool to a high likelihood region of vocabolary instead of top-k sampling, NS reduces the chances of generating low-probability items.
First, 8 QG models are trained, 4 base and 4 large models. And, depending on the volume of training data.
Quetions on dev data were generated using QG models, which were used to train QA model, which were then tested on the test data.
We can see the results as below, which compares beam search and NS@p (neclues sampling at probability mass p)

Source: Author

NS@.95 on QG model trained with 100% of data has given much better results then the on with beam search.

Source: Author

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QG_diversity.md

QG_diversity.md

On the importance of diversity in question generation for QA.

Sultan, Md Arafat, Shubham Chandel, Ramón Fernandez Astudillo, and Vittorio Castelli.

In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5651-5656. 2020.[arXiv]

Files

QG_diversity.md

Latest commit

History

QG_diversity.md

File metadata and controls

On the importance of diversity in question generation for QA.

Sultan, Md Arafat, Shubham Chandel, Ramón Fernandez Astudillo, and Vittorio Castelli.

In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 5651-5656. 2020.[arXiv]