Invention Grant
- Patent Title: Long-range modeling of source code files by syntax hierarchy
-
Application No.: US17507808Application Date: 2021-10-22
-
Publication No.: US11656851B2Publication Date: 2023-05-23
- Inventor: Colin Bruce Clement , Neelakantan Sundaresan , Alexey Svyatkovskiy , Michele Tufano
- Applicant: MICROSOFT TECHNOLOGY LICENSING, LLC.
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC.
- Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC.
- Current Assignee Address: US WA Redmond
- Main IPC: G06F9/44
- IPC: G06F9/44 ; G06F8/33

Abstract:
The syntax elements of a source code program used to represent the context of a focal method are selected based on a priority order. The selected syntax elements are input into a fixed-size context window that is used to train a neural transformer with attention model to learn to generate source code and used by the neural transformer model to generate source code. The context window contains prioritized sequences of tokens that extend beyond the target focus in order to provide a longer visibility back into the source code program for the model to learn predictive patterns. This gives the model a file-level context of the source code program without increasing the size of the context window.
Public/Granted literature
- US20230128200A1 LONG-RANGE MODELING OF SOURCE CODE FILES BY SYNTAX HIERARCHY Public/Granted day:2023-04-27
Information query