Mapping natural language and code segments
Abstract:
Techniques are provided for mapping natural language to code segments. In one embodiment, the techniques involve receiving a document and software code, wherein the document comprises a natural language description of a use of the code, generating, via a vectorization process performed on the document, at least one vector or word embedding, generating, via a natural language processing technique performed on the at least one vector or word embedding, a first label set, generating, via a machine learning analysis of the software code, a second label set, determining, based on a comparison of the first label set and the second label set, a match confidence between the document and the software code, wherein the match confidence indicates a measure of similarity between the first label set and the second label set, and upon determining that the match confidence exceeds a predefined threshold, mapping the document to the software code.
Public/Granted literature
Information query
Patent Agency Ranking
0/0