Invention Grant
- Patent Title: Generation of training data for verbal harassment detection
-
Application No.: US17135015Application Date: 2020-12-28
-
Publication No.: US11620987B2Publication Date: 2023-04-04
- Inventor: Ying Lyu , Kun Han
- Applicant: Beijing DiDi Infinity Technology and Development Co., Ltd.
- Applicant Address: CN Beijing
- Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.
- Current Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.
- Current Assignee Address: CN Beijing
- Agency: Knobbe, Martens, Olson & Bear LLP
- Main IPC: G10L15/06
- IPC: G10L15/06 ; G06N20/00 ; G06N3/08 ; G10L25/63 ; G10L15/26 ; G10L15/08

Abstract:
In some cases, one or more heuristics can be automatically generated using a small dataset of segments previously labeled by one or more domain experts. The generated one or more heuristics along with one or more patterns can be used to assign training labels to a large unlabeled dataset of segments. A subset of segments representing an occurrence of verbal harassment can be selected using the assigned training labels. Randomly selected segments can be used as being indicative of a non-occurrence of verbal harassment. The selected subset of segments and randomly selected segments can be used to train one or more machine learning models for verbal harassment detection.
Public/Granted literature
- US20210201891A1 GENERATION OF TRAINING DATA FOR VERBAL HARASSMENT DETECTION Public/Granted day:2021-07-01
Information query