思路
为最终比较划分好测试集和不存在边集【全集1-train-test-对角元】,根据集合中非零元【即边的数目】构建比较矩阵test_rd【比较n次,随机生成n个值右取整作为抽取的要比较的边】,下一步对test的邻接矩阵赋值,相似性指标给予一个分值存入test_pre,然后依据判断条件【test==1】只取其中非零边【为什么?降低内存?】,对应位置的test_pre存入test_data,再把最初的test_rd构建成稀疏矩阵【?为什么不直接比】,最后test_rd 和 non_rd比大小,分别给n',n''计算AUC
AUC计算方法
triu
triu Extract upper triangular part.提取上三角
function [ auc ] = CalcAUC( train, test, sim, n )
%% 计算AUC,输入计算的相似度矩阵
sim = triu(sim - sim.*train);%只保留上三角,因为同一条边取一次就足够了
对于CN来说
data:image/s3,"s3://crabby-images/3a7d1/3a7d19c5e47d2de0d9e90376561993a358bba0cc" alt=""
data:image/s3,"s3://crabby-images/c1449/c14494b6c3c508fa9659c5eac5c19f5cd26594f9" alt=""
data:image/s3,"s3://crabby-images/e4a8f/e4a8fa374661c71ee5e83842036a8527fad44637" alt=""
data:image/s3,"s3://crabby-images/a8df1/a8df1cbd7f6d5ee20df9f5568f339f65005eb51b" alt=""
data:image/s3,"s3://crabby-images/5a963/5a963e9da0ce44b22f830346248d9c0c264a3dce" alt=""
% 只保留测试集和不存在边集合中的边的相似度(自环除外)
non = 1 - train - test - eye(max(size(train,1),size(train,2)));
data:image/s3,"s3://crabby-images/f5221/f5221ab5618c864a602552457f662aa6d8b926ba" alt=""
%1代表全集,全连通图,减去边集,减去对角元,剩下的为不存在边集
data:image/s3,"s3://crabby-images/71f7c/71f7cdb72bcf43e5ec537313fb9f087e52639b01" alt=""
test = triu(test);
data:image/s3,"s3://crabby-images/8d6bc/8d6bcab6c612e570519e6a47a5d489f38f776f39" alt=""
non = triu(non);
data:image/s3,"s3://crabby-images/6492e/6492e698691e55abadef0d18a95c63fc4724eb18" alt=""
% 分别取测试集和不存在边集合的上三角矩阵,用以取出他们对应的相似度分值??【点乘sim,即后边的test_pre = sim .* test】
test_num = nnz(test);
non_num = nnz(non);
data:image/s3,"s3://crabby-images/00bb1/00bb17aec57218efbe55debd4c4aea84e0eb0e31" alt=""
test_rd = ceil( test_num * rand( 1, n));
data:image/s3,"s3://crabby-images/2050f/2050fdfb52ca2630450ef540c11d999b5567957e" alt=""
data:image/s3,"s3://crabby-images/1a82f/1a82f3ce7459fc2482027c1495046efc9359af9b" alt=""
data:image/s3,"s3://crabby-images/5d6b1/5d6b19a399eca03d67418404bf79da53a24aea8a" alt=""
% ceil是取大于等于的最小整数,n为抽样比较的次数
non_rd = ceil( non_num * rand( 1, n));
data:image/s3,"s3://crabby-images/30adc/30adccbeec442ab84caf294b59f8d1da8c30b7f0" alt=""
这里开始不太懂
test_pre = sim .* test;%具体某条边乘他的相似性作为预测分值
data:image/s3,"s3://crabby-images/a5980/a5980145ad2aaa830163df46a40f95998b511bb1" alt=""
non_pre = sim .* non;%具体某条不存在边乘他的相似性作为预测分值
data:image/s3,"s3://crabby-images/43031/430317267098f2d0833effc5e08ef99b1a5ca614" alt=""
这里不懂
test_data = test_pre( test == 1 )'; %????【将test中非零元对应位置的test_pre值取出存入test_data】
% 行向量,test 集合存在的边的预测值
data:image/s3,"s3://crabby-images/12eb4/12eb4b96a221d4bd2fa05361a28d63f8a8ad16cb" alt=""
non_data = non_pre( non == 1 )';
data:image/s3,"s3://crabby-images/a34cf/a34cfe9bc9351d7ba5e86fe4b7f5cc7f3e269ffe" alt=""
% 行向量,nonexist集合存在的边的预测值
data:image/s3,"s3://crabby-images/b1569/b156978c5d0fa74f2a46ef6876edb525c25c7136" alt=""
test_rd = test_data( test_rd );
data:image/s3,"s3://crabby-images/069ce/069ce1451a6c774f1177e7b97a53ac785adea6e5" alt=""
non_rd = non_data( non_rd );
data:image/s3,"s3://crabby-images/cad88/cad88e46f2b72047d472263c508bba86b55118e2" alt=""
clear test_data non_data;
n1 = length( find(test_rd > non_rd) );
n2 = length( find(test_rd == non_rd));
data:image/s3,"s3://crabby-images/a350c/a350c7fc93c938c79cce292e9b36c8987d4dce02" alt=""
auc = ( n1 + 0.5*n2 ) / n;
end
data:image/s3,"s3://crabby-images/8d7a7/8d7a755d770cf64ec1d6aef69aaeaee7f4954fad" alt=""
网友评论