Thompson抽样算法-R

作者: 灵妍 | 来源:发表于2018-04-18 22:04 被阅读35次

Thompson抽样算法-R
Thompson抽样算法原理
Thompson抽样算法-Python
机器学习A-Z～Thompson抽样算法
蓄水池抽样算法（Reservoir Sampling）
分层抽样
借鉴水塘抽样算法的一种解决思想
推荐系统遇上深度学习(十三)--linUCB方法浅析及实现
水库抽样
R语言统计抽样

楔子：

Thompson抽样算法.PNG

贝叶斯推理.PNG

1、数据预处理

代码：

# Thompson Sampling

# Importing the dataset
dataset = read.csv('Ads_CTR_Optimisation.csv')

2、数据初始化

代码：

# Implementing Thompson Sampling
N = 10000
d = 10
ads_selected = integer(0)
numbers_of_rewards_1 = integer(d)
numbers_of_rewards_0 = integer(d)
total_reward = 0

3、ThompsonSampling

代码：

for (n in 1:N) {
  ad = 0
  max_random = 0
  for (i in 1:d) {
    random_beta = rbeta(n = 1,
                        shape1 = numbers_of_rewards_1[i] + 1,
                        shape2 = numbers_of_rewards_0[i] + 1)
    if (random_beta > max_random) {
      max_random = random_beta
      ad = i
    }
  }
  ads_selected = append(ads_selected, ad)
  reward = dataset[n, ad]
  if (reward == 1) {
    numbers_of_rewards_1[ad] = numbers_of_rewards_1[ad] + 1
  } else {
    numbers_of_rewards_0[ad] = numbers_of_rewards_0[ad] + 1
  }
  total_reward = total_reward + reward
}

4、数据可视化

代码：

# Visualising the results
hist(ads_selected,
     col = 'blue',
     main = 'Histogram of ads selections',
     xlab = 'Ads',
     ylab = 'Number of times each ad was selected')