Link change decision-making using reinforcement learning based on tracked rewards and outcomes in a wireless communication system

Invention Grant

US11963047B2 Link change decision-making using reinforcement learning based on tracked rewards and outcomes in a wireless communication system 有权

Please log in to see more content

Patent Title: Link change decision-making using reinforcement learning based on tracked rewards and outcomes in a wireless communication system
Application No.: US17286065

Application Date: 2018-10-18
Publication No.: US11963047B2

Publication Date: 2024-04-16
Inventor: Athanasios Karapantelakis , Elena Fersman , Rafia Inam , Markus Andersson , David Lindero
Applicant: Telefonaktiebolaget LM Ericsson (publ)
Applicant Address: SE Stockholm
Assignee: Telefonaktiebolaget LM Ericsson (publ)
Current Assignee: Telefonaktiebolaget LM Ericsson (publ)
Current Assignee Address: SE Stockholm
Agency: Murphy, Bilak & Homiller, PLLC
International Application: PCT/EP2018/078509 2018.10.18
International Announcement: WO2020/078552A 2020.04.23
Date entered country: 2021-04-16
Main IPC: H04W36/00
IPC: H04W36/00 ; G06N20/00 ; H04W36/16 ; H04W36/30

Link change decision-making using reinforcement learning based on tracked rewards and outcomes in a wireless communication system

Abstract:

Decision-making equipment (22) is configured for link change decision-making using reinforcement learning. The decision-making equipment (22) is configured to track rewards (30-1, . . . 30-M) earned for, and outcomes (28-1, . . . 28-M) of, respective link change decisions (26-1, . . . 26-M). In some embodiments, possible outcomes of a link change decision to change a serving link of a wireless device to a target link include at least: a change of the serving link of the wireless device from the target link to another link; and a network-initiated disconnect of the wireless device from the target link. Regardless, the decision-making equipment (22) is also configured to make a link change decision (28-(M+1)) based on the tracked rewards (30-1, . . . 30-M) and outcomes (28-1, . . . 28-M).

Public/Granted literature

US20210377822A1 Link Change Decision-Making using Reinforcement Learning based on Tracked Rewards and Outcomes in a Wireless Communication System Public/Granted day:2021-12-02

Information query

Espacenet

IPC分类:

H	电学
H04	电通信技术
H04W	无线通信网络(广播通信入H04H;使用无线链路来进行非选择性通信的通信系统，如无线扩展入H04M1/72)
H04W36/00	切换或重选装置