Automatic Q&A Generation from Text


Asking relevant and intelligent questions has always been an integral part of human learning, as it can help assess the learner’s understanding of a piece of text. However, compiling questions manually is arduous. Automated question generation (QG) systems can help, as they have the ability to generate questions quicker and on a larger scale.

A typical scenario is evaluating students on reading comprehension, where it becomes tedious for a teacher to manually create questions, find answers to these questions, and then evaluate answer papers after the test has been administered. All these complex tasks can now be automated using an automatic question and answer generation system.

This is why research scholar Vishwajeet Kumar’s project titled, ‘Automatic Question and Answer Generation from Text’ has tremendous potential in a scenario where scientists are closely examining the efficacy of neural network-based methods in question generation from text.

“A compact Question Generation system would be able to generate meaningful, syntactically correct, semantically sound, and natural questions from text. The questions that work best are those that have supporting answers present in the text,” explains Vishwajeet, who has enrolled for a PhD programme in the IITB-Monash Research Academy,

 Early attempts at automated question generation depended heavily on a strict, limited, ad-hoc, and hand-crafted set of rules. These rules focus mainly on the syntactic structure of the text and are limited only to sentences of simple structures. Recently, the success of sequence-to-sequence learning models has opened up possibilities of looking beyond a fixed set of rules for the task of question generation.

 An automatic question generation system has applications in areas as diverse as FAQ generation, intelligent tutoring systems, and virtual assistants. Question generation can be naturally applied in the educational setting such as online courses, automated help systems, and search engines. It can also be applied in a wide variety of other domains — including chatbot systems (e.g. for customer interaction) and health care for analysing mental health.

Explaining his work so far, Vishwajeet says, “We present a system to automatically generate question and answer from text. Our system follows a two-stage process to generate question-answer pairs from the text. In the first stage, we present alternatives for encoding the span of the pivotal answer in the sentence using Pointer Networks. In the next stage, we employ sequence-to-sequence models for question generation, enhanced with rich linguistic features. Finally, global attention and answer encoding are used for generating the question most relevant to the answer.”

The IITB-Monash Research Academy is a collaboration between India and Australia that endeavours to strengthen scientific relationships between the two countries. Graduate research scholars like Vishawajeet study for a dually-badged PhD from both IIT Bombay and Monash University, spending time at both institutions to enrich their research experience.

Says Prof. Murali Sastry, CEO of the IITB-Monash Research Academy, “More and more institutions are moving from classroom teaching to online courses using platforms like MOOC, EDX, etc. For those administering such online courses, generating meaningful questions manually is a tedious task. The work of researchers like Vishwajeet shows us that asking meaningful and intelligent questions will improve the ability to answer them!”

Research scholar: Vishwajeet Kumar, IITB-Monash Research Academy

Project title: Automatic Question and Answer Generation from Text

Supervisors: Prof.Ganesh Ramakrishnan and Prof. Yuan-Fang Li

Contact details: vishwajeet@cse.iitb.ac.in

The above story was written by Mr Krishna Warrier based on inputs from the research student, his supervisors, and IITB-Monash Research Academy. Copyright IITB-Monash Research Academy.