Invention Grant
- Patent Title: Method and system for classifying word as obscene word
-
Application No.: US17553800Application Date: 2021-12-17
-
Publication No.: US12026465B2Publication Date: 2024-07-02
- Inventor: Mikhail Borisovich Libman
- Applicant: YANDEX EUROPE AG
- Applicant Address: CH Lucerne
- Assignee: Direct Cursus Technology L.L.C
- Current Assignee: Direct Cursus Technology L.L.C
- Current Assignee Address: AE Dubai
- Agency: BCF LLP
- Priority: RU 2020142418 2020.12.22
- Main IPC: G06F40/232
- IPC: G06F40/232 ; G06F40/279 ; G06F40/40 ; G06V30/19

Abstract:
There is disclosed a method and system for classifying a word as an obscene word, the method comprising, at a training phrase: acquiring a first word, the first word corresponding to a given obscene word; generating a first set of misspelled words, the first set of misspelled words comprising a plurality of misspelled variations of the first word; generating a training pairs, the training pairs comprising: a set of positive training pairs comprising the first word paired with each misspelled variations of the first word; training a machine learning algorithm, the training comprising: determining, for each training pairs, a set of features representative of a property of the training pairs; generating an inferred function based on the set of features, the inferred function being configured to assign, in use, an indecency score, the decency score being indicative of a likelihood of the word being obscene.
Public/Granted literature
- US20220198143A1 METHOD AND SYSTEM FOR CLASSIFYING WORD AS OBSCENE WORD Public/Granted day:2022-06-23
Information query