-
公开(公告)号:AU2021236965A1
公开(公告)日:2022-09-01
申请号:AU2021236965
申请日:2021-03-08
Applicant: IBM
Inventor: SULTAN M D ARAFAT , CASTELLI VITTORIO , CHANDEL SHUBHAM , FERNANDEZ ASTUDILLO RAMON
IPC: G06F16/332
Abstract: An artificial Intelligence (AI) computer platform to incorporate synthetic data and ground truth data, and to promote diversity and accuracy in generating the synthetic data. Synthetic questions are generated by a question generator in response to semantically related ground truth passage and answer data. Each generated question is presented to an answer generator together with the semantically related ground truth passage. Each synthetic question is evaluated with respect to its diversity from previous synthetic questions generated for the same ground truth passage and answer data. Each synthetic question is also evaluated with respect to the accuracy of the answer generated by the answer generator. A reward function that captures both accuracy and diversity of each synthetic question is leveraged to selectively modify the question generator, with the selective modification(s) directed at increasing textual diversity and maintaining accuracy of the generated synthetic questions.