Python库介绍

作者: ACphart | 来源:发表于2018-08-01 19:25 被阅读30次

    括号中的数字代表在GitHub上获得的star数量,里面为2018年8月的数据。
    主要参照GItHub上vinta的awesome-python项目 vinta/awesome-python

    机器学习 Machine Learning.

    1. scikit-learn (29647*)

    • The most popular Python library for Machine Learning.

    2. XGboost (12944*)

    • A scalable, portable, and distributed gradient boosting library.

    自然语言处理 Natural Language Processing.

    1. NLTK (6685*)

    • A leading platform for building Python programs to work with human language data.

    2. jieba (14511*)

    • Chinese text segmentation.

    3. gensim (7480*)

    • Topic Modelling for Humans.

    4. Pattern (6412*)

    • A web mining module for the Python.

    5. TextBlob (5335*)

    • Providing a consistent API for diving into common NLP tasks.

    6. spaCy (10178*)

    • A library for industrial-strength natural language processing in Python and Cython.

    7. SnowNLP (3344*)

    • A library for processing Chinese text.

    8. PyTorch-NLP (982*)

    • Supporting Rapid Prototyping with a Toolkit .

    推荐系统 Recommendation System.

    1. Surprise (2043*)

    • A scikit for building and analyzing recommender systems.

    2. spotlight (1122*)

    • Deep recommender models using PyTorch.

    3. lightFM (1858*)

    • A Python implementation of a number of popular recommendation algorithms.

    4. annoy (3857*)

    • Approximate Nearest Neighbors in C++/Python optimized for memory usage.

    科学计算 Scientific Computing.

    1. Numpy (7852*)

    • A fundamental package for scientific computing with Python.

    2. Sympy (5027*)

    • A computer algebra system written in pure Python

    3. Scipy (4755*)

    • A Python-based ecosystem of open-source software for mathematics, science, and engineering.

    4. statsmodels (3101*)

    • Statistical modeling and econometrics in Python.

    5. colour (570*)

    • A colour science package implementing a comprehensive number of colour theory transformations and algorithms.

    特殊格式处理 Specific Formats Processing.

    1. tablib (3069*)

    • A module for Tabular Datasets in XLS, CSV, JSON, YAML.

    2. PyPDF2 (2003*)

    • A library capable of splitting, merging and transforming PDF pages.

    3. pdfminer (2978*)

    • A tool for extracting information from PDF documents.

    爬虫 Web Crawling.

    1. Scrapy (28463*)

    • A fast high-level screen scraping and web crawling framework.

    2. Pyspider (11732*)

    • A powerful spider system.

    3. portia (6213*)

    • Visual scraping for Scrapy.

    4. MechanicalSoup (2785*)

    • A Python library for automating interaction with websites.

    计算机视觉 Computer Vision.

    1. OpenCV (26580*)

    • Open Source Computer Vision Library.

    2. SimpleCV (2158*)

    • An open source framework for building computer vision applications.

    数据分析 Data Analysis.

    1. Pandas (15335*)

    • A library providing high-performance, easy-to-use data structures and data analysis tools.

    2. blaze (2435*)

    • NumPy and Pandas interface to Big Data.

    3. orange3 (1395*)

    • Data mining, data visualization, analysis and machine learning through visual programming or scripts.

    数据可视化 Data Visualization.

    1. Matplotlib (7652*)

    • A Python 2D plotting library.

    2. Seaborn (5122*)

    • Statistical data visualization using Matplotlib.

    3. bokeh (7957*)

    • Interactive Web Plotting for Python.

    4. Altair (2614*)

    • Declarative statistical visualization library for Python.

    5. pygal (1902*)

    • A Python SVG Charts Creator.

    6. bqplot (1984*)

    • Interactive Plotting Library for the Jupyter Notebook.

    7. pyQtGraph (1178*)

    • Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.

    8. geoplotlib (656*)

    • python toolbox for visualizing geographical data and making maps.

    深度学习 Deep Learning.

    1. caffe (25051*)

    • A fast open framework for deep learning.

    2. Keras (32096*)

    • A high-level neural networks library and capable of running on top of either TensorFlow or Theano.

    3. MXNet (14756*)

    • A deep learning framework designed for both efficiency and flexibility.

    4. TensorFlow (106421*)

    • The most popular Deep Learning framework created by Google.

    5. PyTorch (17515*)

    • Tensors and Dynamic neural networks in Python with strong GPU acceleration.

    6. Theano (8393*)

    • A library for fast numerical computation.

    7. SerpentAI (4458*)

    • Game agent framework. Use any video game as a deep learning sandbox.

    8. neupy (468*)

    • Running and testing different Artificial Neural Networks algorithms.

    地理位置 Geolocation.

    1. geopy (1977)

    • Python Geocoding Toolbox.

    GUI

    1. Tkinter

    • Tkinter is Python's de-facto standard GUI package.

    2. PyQt

    • Python bindings for the Qt cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.

    3. Flexx (1666*)

    • Flexx is a pure Python toolkit for creating GUI's, that uses web technology for its rendering.

    游戏开发 Game Development.

    1. Cocos2d (460*)

    • graphic library for games and multimedia, for python language on PC-like hardware

    1. Cocos2d-x (12197*)

    • Cocos2d-x is a suite of open-source, cross-platform, game-development tools used by millions of developers all over the world.

    2. Panda3D (1162*)

    • 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.

    3. Pygame (524*)

    • Pygame is a set of Python modules designed for writing games.

    4. Cocos2d-html5 (2418*)

    • Cocos2d-html5 is a cross-platform 2D game engine written in JavaScript, based on Cocos2d-X and licensed under MIT.

    图像处理 Image Processing.

    1. pillow (5164*)

    • Pillow is the friendly PIL fork.

    2. thumbor (6054*)

    • A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.

    3. scikit-image (2427*)

    • A Python library for (scientific) image processing.

    文本处理 Text Processing.

    1. ftfy (2264*)

    • Makes Unicode text less broken and more consistent automagically.

    2. fuzzywuzzy (4471*)

    • Fuzzy String Matching.

    3. Pypinyin (1127*)

    • Convert Chinese hanzi to pinyin.

    4. textdistance (1200*)

    • Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

    音频处理类 Audio Manipulating.

    1. eyeD3 (98*)

    • eyeD3 is a Python tool for working with audio files, specifically MP3 files containing ID3 metadata.

    2. pydub (2733*)

    • Manipulate audio with a simple and easy high level interface.

    3. dejavu (3946*)

    • Audio fingerprinting and recognition.

    4. pyAudioAnalysis (1668*)

    • Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications.

    5. m3u8 (429*)

    • A module for parsing m3u8 file.

    相关文章

      网友评论

        本文标题:Python库介绍

        本文链接:https://www.haomeiwen.com/subject/wtwrvftx.html