https://grouplens.org/datasets/movielens/
ml100k:
官网简介:Stable benchmark dataset. 100,000 ratings from 1000 users on 1700 movies. Released 4/1998.
下载链接:https://grouplens.org/datasets/movielens/100k/
数据简介:来自1000个用户对1700部电影的100000个评分
稀疏度:5.88% 人均100个评分
ml1m:
官网简介:Stable benchmark dataset. 1 million ratings from 6000 users on 4000 movies. Released 2/2003.
下载链接:https://grouplens.org/datasets/movielens/1m/
数据简介:来自6000个用户对4000部电影的1000000个评分
稀疏度:4.17% 人均166个评分
ml10m:
官网简介:Stable benchmark dataset. 10 million ratings and 100,000 tag applications applied to 10,000 movies by 72,000 users. Released 1/2009.
下载链接:https://grouplens.org/datasets/movielens/10m/
数据简介:来自72000个用户对10000部电影的10000000个评分,有tag
稀疏度:1.39% 人均138个评分
ml20m:
官网简介:Stable benchmark dataset. 20 million ratings and 465,000 tag applications applied to 27,000 movies by 138,000 users. Includes tag genome data with 12 million relevance scores across 1,100 tags. Released 4/2015; updated 10/2016 to update links.csv and add tag genome data.
下载链接:https://grouplens.org/datasets/movielens/20m/
数据简介:来自138000个用户对27000部电影的20000000个评分,有tag
稀疏度:0.537% 人均144个评分
ml-latest:正在更新,不建议作为研究报告结果使用
Book-Crossing:
官网简介:The BookCrossing (BX) dataset was collected by Cai-Nicolas Ziegler in a 4-week crawl (August / September 2004) from the Book-Crossing community with kind permission from Ron Hornbaker, CTO of Humankind Systems. It contains 278,858 users (anonymized but with demographic information) providing 1,149,780 ratings (explicit / implicit) about 271,379 books.
下载链接:https://grouplens.org/datasets/book-crossing/
数据简介:来自278,858个用户对271,379部书籍的1,149,780个评分
稀疏度:0.00152% 人均4个评分
网友评论