-
公开(公告)号:US20230325426A1
公开(公告)日:2023-10-12
申请号:US18132224
申请日:2023-04-07
Inventor: Hassan Sajjad , Fahim Dalvi , Firoj Alam , Nadir Durrani , Abdul Rafae Khan , Jia Xu
Abstract: A method of constructing a dataset for identifying a plurality of latent concepts in a Natural Language Processing model is provided. The method includes executing a clustering process on a first dataset, preparing a second dataset, defining a hierarchical concept tag-set from the second dataset, and annotating the hierarchical concept tag-set.