Context and domain sensitive spelling correction in a database
Abstract:
A method of operating a health tracking system is disclosed. The method comprises: receiving a first data record comprising at least a first descriptive string regarding a consumable item, the first descriptive string having at least one word thereof incorrectly spelled; generating a vector using the first descriptive string using a machine learning model; identifying a second descriptive string which corresponds to the consumable item and which has a correct spelling of the at least one incorrectly spelled word by applying the machine learning model to the generated vector; calculating a confidence factor regarding the identified second descriptive string using the machine learning model; and when it is determined that the confidence factor exceeds a predetermined threshold, (i) modifying the first data record by replacing the first descriptive string with the second descriptive string, and (ii) storing the modified first data record in the database.
Public/Granted literature
Information query
Patent Agency Ranking
0/0