Invention Grant
- Patent Title: Adaptive speculative decoding
-
Application No.: US16592465Application Date: 2019-10-03
-
Publication No.: US10911063B2Publication Date: 2021-02-02
- Inventor: Vikram B. Suresh , Sudhir K. Satpathy , Sanu K. Mathew
- Applicant: Intel Corporation
- Applicant Address: US CA Santa Clara
- Assignee: Intel Corporation
- Current Assignee: Intel Corporation
- Current Assignee Address: US CA Santa Clara
- Agency: Compass IP Law PC
- Main IPC: H03M7/30
- IPC: H03M7/30 ; H03M7/42

Abstract:
Examples herein relate to decoding tokens using speculative decoding operations to decode tokens at an offset from a token decoded by a sequential decoding operation. At a checkpoint, a determination is made as to whether tokens to be decoded by the sequential and speculative decoding operations align. If there is alignment, the speculatively decoded tokens after a discard window are committed and made available for access. If there is not alignment, the speculatively decoded tokens are discarded. A miss in alignment and a fullness level of a buffer that stores speculatively decoded tokens are assessed to determine a next offset level for a start of speculative decoding. A size of a discard window can be set using a relationship based on the offset level to improve buffer utilization and to attempt to improve changes of alignments.
Public/Granted literature
- US20200036389A1 ADAPTIVE SPECULATIVE DECODING Public/Granted day:2020-01-30
Information query
IPC分类: