Invention Grant
- Patent Title: Categorical data transformation and clustering for machine learning using natural language processing
-
Application No.: US15824382Application Date: 2017-11-28
-
Publication No.: US11531927B2Publication Date: 2022-12-20
- Inventor: Kourosh Modarresi , Abdurrahman Ibn Munir
- Applicant: Adobe Inc.
- Applicant Address: US CA San Jose
- Assignee: Adobe Inc.
- Current Assignee: Adobe Inc.
- Current Assignee Address: US CA San Jose
- Agency: FIG. 1 Patents
- Main IPC: G06F16/00
- IPC: G06F16/00 ; G06N20/00 ; G06F16/242 ; G06F16/28 ; G06F16/35

Abstract:
Categorical data transformation and clustering techniques and systems are described for machine learning using natural language processing. These techniques and systems are configured to improve operation of a computing device to support efficient and accurate use of categorical data, which is not possible using conventional techniques. In an example, categorical data is received by a computing device that includes a categorical variable having a non-numerical data type for a number of classes. The categorical data is then converted into numerical data using natural language processing. Data is then generated by the computing device that includes a plurality of latent classes. This is performed by clustering the numerical data into a number of clusters that is smaller than the number of classes in the categorical data.
Public/Granted literature
- US20190164083A1 Categorical Data Transformation and Clustering for Machine Learning using Natural Language Processing Public/Granted day:2019-05-30
Information query