Link change decision-making using reinforcement learning based on tracked rewards and outcomes in a wireless communication system
Abstract:
Decision-making equipment (22) is configured for link change decision-making using reinforcement learning. The decision-making equipment (22) is configured to track rewards (30-1, . . . 30-M) earned for, and outcomes (28-1, . . . 28-M) of, respective link change decisions (26-1, . . . 26-M). In some embodiments, possible outcomes of a link change decision to change a serving link of a wireless device to a target link include at least: a change of the serving link of the wireless device from the target link to another link; and a network-initiated disconnect of the wireless device from the target link. Regardless, the decision-making equipment (22) is also configured to make a link change decision (28-(M+1)) based on the tracked rewards (30-1, . . . 30-M) and outcomes (28-1, . . . 28-M).
Information query
Patent Agency Ranking
0/0