美文网首页
2019-04-02 movielens数据集整理

2019-04-02 movielens数据集整理

作者: QQsoso | 来源:发表于2019-04-02 15:30 被阅读0次

    https://grouplens.org/datasets/movielens/

    ml100k:

    官网简介:Stable benchmark dataset. 100,000 ratings from 1000 users on 1700 movies. Released 4/1998.

    下载链接:https://grouplens.org/datasets/movielens/100k/

    数据简介:来自1000个用户对1700部电影的100000个评分

    稀疏度:5.88%    人均100个评分

    ml1m:

    官网简介:Stable benchmark dataset. 1 million ratings from 6000 users on 4000 movies. Released 2/2003.

    下载链接:https://grouplens.org/datasets/movielens/1m/

    数据简介:来自6000个用户对4000部电影的1000000个评分

    稀疏度:4.17%   人均166个评分

    ml10m:

    官网简介:Stable benchmark dataset. 10 million ratings and 100,000 tag applications applied to 10,000 movies by 72,000 users. Released 1/2009.

    下载链接:https://grouplens.org/datasets/movielens/10m/

    数据简介:来自72000个用户对10000部电影的10000000个评分,有tag

    稀疏度:1.39%   人均138个评分

    ml20m:

    官网简介:Stable benchmark dataset. 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users. Includes tag genome data with 12 million relevance scores across 1,100 tags. Released 4/2015; updated 10/2016 to update links.csv and add tag genome data.

    下载链接:https://grouplens.org/datasets/movielens/20m/

    数据简介:来自138000个用户对27000部电影的20000000个评分,有tag

    稀疏度:0.537%   人均144个评分

    ml-latest:正在更新,不建议作为研究报告结果使用

    Book-Crossing:

    官网简介:The BookCrossing (BX) dataset was collected by Cai-Nicolas Ziegler in a 4-week crawl (August / September 2004) from the Book-Crossing community with kind permission from Ron Hornbaker, CTO of Humankind Systems. It contains 278,858 users (anonymized but with demographic information) providing 1,149,780 ratings (explicit / implicit) about 271,379 books.

    下载链接:https://grouplens.org/datasets/book-crossing/

    数据简介:来自278,858个用户对271,379部书籍的1,149,780个评分

    稀疏度:0.00152%   人均4个评分

    相关文章

      网友评论

          本文标题:2019-04-02 movielens数据集整理

          本文链接:https://www.haomeiwen.com/subject/jzdgbqtx.html