Patent search ap:("Samsung Electronics Co. Page Ltd.") AND inv:"Krishna Malladi"

1.

发明申请
RACK-LEVEL SCHEDULING FOR REDUCING THE LONG TAIL LATENCY USING HIGH PERFORMANCE SSDS 审中-公开

公开(公告)号：US20200225999A1

公开(公告)日：2020-07-16

申请号：US16828649

申请日：2020-03-24

Applicant: Samsung Electronics Co., Ltd.

Inventor： Qiumin Xu , Krishna Malladi , Manu Awasthi

IPC: G06F9/50

Abstract: A method for migrating a workload includes: receiving workloads generated from a plurality of applications running in a plurality of server nodes of a rack system; monitoring latency requirements for the workloads and detecting a violation of the latency requirement for a workload; collecting system utilization information of the rack system; calculating rewards for migrating the workload to other server nodes in the rack system; determining a target server node among the plurality of server nodes that maximizes the reward; and performing migration of the workload to the target server node.

2.

发明授权
Electronic system with memory management mechanism and method of operation thereof 有权

公开(公告)号：US09934154B2

公开(公告)日：2018-04-03

申请号：US15174986

申请日：2016-06-06

Applicant: Samsung Electronics Co., Ltd.

Inventor： Krishna Malladi , Uksong Kang , Hongzhong Zheng

IPC: G11C7/00 , G11C5/00 , G06F12/0897 , G06F13/16

CPC classification number: G06F12/0897 , G06F12/0811 , G06F12/0868 , G06F13/1673 , G06F2212/1016 , G06F2212/60

Abstract: An electronic system includes: a processor configured to access operation data; a local cache memory, coupled to the processor, configured to store a limited amount of the operation data; a memory controller, coupled to the local cache memory, configured to maintain a flow of the operation data; and a memory subsystem, coupled to the memory controller, including: a first tier memory configured to store the operation data, with critical timing, by a fast control bus, and a second tier memory configured to store the operation data with non-critical timing, by a reduced performance control bus.

3.

发明申请
METHOD AND APPARATUS FOR ENABLING LARGER MEMORY CAPACITY THAN PHYSICAL MEMORY SIZE 审中-公开

公开(公告)号：US20170286010A1

公开(公告)日：2017-10-05

申请号：US15498371

申请日：2017-04-26

Applicant: Samsung Electronics Co., Ltd.

Inventor： Dongyan Jiang , Changhui Lin , Krishna Malladi , Jongmin Gim , Hongzhong Zheng

IPC: G06F3/06 , G06F12/0802

Abstract: A dedupe module is provided. The dedupe module includes: a host interface; a dedupe engine to receive a data request from a host system via the host interface; a memory controller; a plurality of memory modules, each memory module being coupled to the memory controller; and a read cache for caching data from the memory controller for use by the dedupe engine.

4.

发明申请
ELECTRONIC SYSTEM WITH MEMORY CONTROL MECHANISM AND METHOD OF OPERATION THEREOF 审中-公开
Title translation: 具有存储器控制机制的电子系统及其操作方法

公开(公告)号：US20150363312A1

公开(公告)日：2015-12-17

申请号：US14563710

申请日：2014-12-08

Applicant: Samsung Electronics Co., Ltd.

Inventor： Hongzhong Zheng , Krishna Malladi , Steven Shrader

IPC: G06F12/08

CPC classification number: G06F12/0813 , G06F12/0833 , G06F12/0871 , G06F2212/1016 , G06F2212/1028 , Y02D10/13

Abstract: An electronic system includes: a second memory module; a first memory module coupled to the second memory module; and a multicast controller for managing a cache on the first memory module for the second memory module.

Abstract translation: 电子系统包括：第二存储器模块; 耦合到第二存储器模块的第一存储器模块; 以及用于管理第二存储器模块的第一存储器模块上的高速缓存的组播控制器。

5.

发明授权
Dataflow accelerator architecture for general matrix-matrix multiplication and tensor computation in deep learning 有权

公开(公告)号：US11100193B2

公开(公告)日：2021-08-24

申请号：US16388860

申请日：2019-04-18

Applicant: Samsung Electronics Co., Ltd.

Inventor： Peng Gu , Krishna Malladi , Hongzhong Zheng , Dimin Niu

IPC: G06F17/16 , G06F12/0877 , G06F12/0802 , G06N3/063 , G06N3/00 , G06N3/04 , G06N3/08

Abstract: A general matrix-matrix multiplication (GEMM) dataflow accelerator circuit is disclosed that includes a smart 3D stacking DRAM architecture. The accelerator circuit includes a memory bank, a peripheral lookup table stored in the memory bank, and a first vector buffer to store a first vector that is used as a row address into the lookup table. The circuit includes a second vector buffer to store a second vector that is used as a column address into the lookup table, and lookup table buffers to receive and store lookup table entries from the lookup table. The circuit further includes adders to sum the first product and a second product, and an output buffer to store the sum. The lookup table buffers determine a product of the first vector and the second vector without performing a multiply operation. The embodiments include a hierarchical lookup architecture to reduce latency. Accumulation results are propagated in a systolic manner.

6.

发明授权
Method and apparatus for enabling larger memory capacity than physical memory size 有权

公开(公告)号：US10678704B2

公开(公告)日：2020-06-09

申请号：US15476757

申请日：2017-03-31

Applicant: Samsung Electronics Co., Ltd.

Inventor： Dongyan Jiang , Changhui Lin , Krishna Malladi , Jongmin Gim , Hongzhong Zheng

IPC: G06F12/1018 , G06F3/06 , G06F12/0864

Abstract: A method of retrieving data stored in a memory associated with a dedupe module is provided. The method includes: identifying a logical address of the data; identifying a physical line ID of the data in accordance with the logical address by looking up at least a portion of the logical address in a translation table; locating a respective physical line, the respective physical line corresponding to the PLID; and retrieving the data from the respective physical line, the retrieving including copying a respective hash cylinder to the read cache, the respective hash cylinder including: a respective hash bucket, the respective hash bucket including the respective physical line; and a respective reference counter bucket, the respective reference counter bucket including a respective reference counter associated with the respective physical line.

7.

发明授权
Dedupe DRAM system algorithm architecture 有权

公开(公告)号：US09966152B2

公开(公告)日：2018-05-08

申请号：US15162512

申请日：2016-05-23

Applicant: Samsung Electronics Co., Ltd.

Inventor： Chaohong Hu , Hongzhong Zheng , Krishna Malladi , Bob Brennan

IPC: G06F3/06 , G11C29/00 , G06F12/0802

CPC classification number: G11C29/808 , G06F12/0802 , G11C29/74

Abstract: A deduplication memory module, which is configured to internally perform memory deduplication, includes a hash table memory for storing multiple blocks of data in a hash table array including hash tables, each of the hash tables including physical buckets and a plurality of virtual buckets each including some of the physical buckets, each of the physical buckets including ways, an address lookup table memory (ALUTM) including a plurality of pointers indicating a location of each of the stored blocks of data in a corresponding one of the physical buckets, and a buffer memory for storing unique blocks of data not stored in the hash table memory when the hash table array is full, a processor, and memory, wherein the memory has stored thereon instructions that, when executed by the processor, cause the memory module to exchange data with an external system.

8.

发明授权
Smart in-module refresh for DRAM 有权

公开(公告)号：US09761296B2

公开(公告)日：2017-09-12

申请号：US15299445

申请日：2016-10-20

Applicant: Samsung Electronics Co., Ltd.

Inventor： Mu-Tien Chang , Krishna Malladi , Dimin Niu , Hongzhong Zheng

IPC: G11C5/14 , G11C11/406 , G11C11/4076 , G11C5/04 , G06F13/16

CPC classification number: G11C11/40615 , G06F13/1636 , G11C5/04 , G11C11/40618 , G11C11/4076

Abstract: A memory (1205) is disclosed. The memory (1205) can includes a stack of dynamic Random Access Memory (DRAM) cores (1210, 1215, 1220, 1225) in a three-dimensional stacked memory architecture (1230). Each of the DRAM cores (1210, 1215, 1220, 1225) can include a plurality of banks (205-1, 205-2, 205-3, 205-4) to store data. The memory (1205) can also include logic layer (1235) which can include an interface (1305) to connect the memory (1205) with a processor (120). The logic layer (1235) can also include a refresh engine (115) that can be used to refresh one of the plurality of banks (205-1, 205-2, 205-3, 205-4) and a Smart Refresh Component (305) that can advise the refresh engine (115) which bank to refresh using an out-of-order per-bank refresh. The Smart Refresh Component (305) can use a logic (415) to identify a farthest bank in the pending transactions in the transaction queue (430) at the time of refresh.

9.

发明授权
Dataflow accelerator architecture for general matrix-matrix multiplication and tensor computation in deep learning 有权

公开(公告)号：US12164593B2

公开(公告)日：2024-12-10

申请号：US17374988

申请日：2021-07-13

Applicant: Samsung Electronics Co., Ltd.

Inventor： Peng Gu , Krishna Malladi , Hongzhong Zheng , Dimin Niu

IPC: G06F17/16 , G06F12/0802 , G06F12/0877 , G06N3/008 , G06N3/045 , G06N3/063 , G06N3/08

Abstract: A general matrix-matrix multiplication (GEMM) dataflow accelerator circuit is disclosed that includes a smart 3D stacking DRAM architecture. The accelerator circuit includes a memory bank, a peripheral lookup table stored in the memory bank, and a first vector buffer to store a first vector that is used as a row address into the lookup table. The circuit includes a second vector buffer to store a second vector that is used as a column address into the lookup table, and lookup table buffers to receive and store lookup table entries from the lookup table. The circuit further includes adders to sum the first product and a second product, and an output buffer to store the sum. The lookup table buffers determine a product of the first vector and the second vector without performing a multiply operation. The embodiments include a hierarchical lookup architecture to reduce latency. Accumulation results are propagated in a systolic manner.

10.

发明授权
Dataflow accelerator architecture for general matrix-matrix multiplication and tensor computation in deep learning 有权

公开(公告)号：US12130884B2

公开(公告)日：2024-10-29

申请号：US17374988

申请日：2021-07-13

Applicant: Samsung Electronics Co., Ltd.

Inventor： Peng Gu , Krishna Malladi , Hongzhong Zheng , Dimin Niu

IPC: G06F17/16 , G06F12/0802 , G06F12/0877 , G06N3/008 , G06N3/045 , G06N3/063 , G06N3/08

CPC classification number: G06F17/16 , G06F12/0802 , G06F12/0877 , G06N3/008 , G06N3/045 , G06N3/063 , G06F2212/1024 , G06F2212/1036 , G06F2212/22 , G06N3/08

Abstract: A general matrix-matrix multiplication (GEMM) dataflow accelerator circuit is disclosed that includes a smart 3D stacking DRAM architecture. The accelerator circuit includes a memory bank, a peripheral lookup table stored in the memory bank, and a first vector buffer to store a first vector that is used as a row address into the lookup table. The circuit includes a second vector buffer to store a second vector that is used as a column address into the lookup table, and lookup table buffers to receive and store lookup table entries from the lookup table. The circuit further includes adders to sum the first product and a second product, and an output buffer to store the sum. The lookup table buffers determine a product of the first vector and the second vector without performing a multiply operation. The embodiments include a hierarchical lookup architecture to reduce latency. Accumulation results are propagated in a systolic manner.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification