Schedule-aware tensor distribution module

    公开(公告)号:US12288153B2

    公开(公告)日:2025-04-29

    申请号:US18408716

    申请日:2024-01-10

    Abstract: Methods and systems include a neural network system that includes a neural network accelerator. The neural network accelerator includes multiple processing engines coupled together to perform arithmetic operations in support of an inference performed using the deep neural network system. The neural network accelerator also includes a schedule-aware tensor data distribution circuitry or software that is configured to load tensor data into the multiple processing engines in a load phase, extract output data from the multiple processing engines in an extraction phase, reorganize the extracted output data, and store the reorganized extracted output data to memory.

Patent Agency Ranking