当你学习机器学习时,最好的就是处理实际数据,而不是人造数据,幸运的是,有成千上万个开放的数据集可供选择,涉及各种领域。
-
流行的开放数据仓库
UC Irvine Machine Learning Repositor
网址:http://archive.ics.uci.edu/ml/
Kaggle datasets
网址:https://www.kaggle.com/datasets
Amazon's AWS datasets
网址:https://registry.opendata.aws/ -
元数据门户(列出了开放数据仓库)
http://dataportals.org/
http://opendatamonitor.eu/
http://quandl.com/ -
其他开放数据仓库
Wikipedia’s list of Machine Learning datasets
网址:https://homl.info/9
Quora.com question
网址:https://homl.info/10
Datasets subreddit
网址:https://www.reddit.com/r/datasets -
参考文献
Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow Concepts, Tools, and Techniques to Build Intelligent Systems(Second Edition) -
内容来自微信公众号:叶开随笔
网友评论