-
公开(公告)号:US20240244354A1
公开(公告)日:2024-07-18
申请号:US18561985
申请日:2022-05-20
Applicant: Massachusetts Institute of Technology
Inventor: Manya Ghobadi , Zhizhen Zhong , Weiyang Wang , Liane Sarah Beland Bernstein , Alexander Sludds , Ryan HAMERLY , Dirk Robert ENGLUND
IPC: H04Q11/00 , G06N5/04 , H04B10/524
CPC classification number: H04Q11/0005 , G06N5/04 , H04B10/524 , H04Q2011/0039 , H04Q2011/0041
Abstract: In-network Optical Inference (IOI) provides low-latency machine learning inference by leveraging programmable switches and optical matrix multiplication. IOI uses a transceiver module, called a Neuro Transceiver, with an optical processor to perform linear operations, such as matrix multiplication, in the optical domain. IOI's transceiver modules can be plugged into programmable packet switches, which are programmed to perform non-linear activations in the electronic domain and to respond to inference queries. Processing inference queries at the programmable packet switches inside the network, without sending them to cloud or edge inference servers, significantly reduces end-to-end inference latency experienced by users.