Python库介绍

作者: ACphart | 来源:发表于2018-08-01 19:25 被阅读30次

括号中的数字代表在GitHub上获得的star数量,里面为2018年8月的数据。
主要参照GItHub上vinta的awesome-python项目 vinta/awesome-python

机器学习 Machine Learning.

1. scikit-learn (29647*)

  • The most popular Python library for Machine Learning.

2. XGboost (12944*)

  • A scalable, portable, and distributed gradient boosting library.

自然语言处理 Natural Language Processing.

1. NLTK (6685*)

  • A leading platform for building Python programs to work with human language data.

2. jieba (14511*)

  • Chinese text segmentation.

3. gensim (7480*)

  • Topic Modelling for Humans.

4. Pattern (6412*)

  • A web mining module for the Python.

5. TextBlob (5335*)

  • Providing a consistent API for diving into common NLP tasks.

6. spaCy (10178*)

  • A library for industrial-strength natural language processing in Python and Cython.

7. SnowNLP (3344*)

  • A library for processing Chinese text.

8. PyTorch-NLP (982*)

  • Supporting Rapid Prototyping with a Toolkit .

推荐系统 Recommendation System.

1. Surprise (2043*)

  • A scikit for building and analyzing recommender systems.

2. spotlight (1122*)

  • Deep recommender models using PyTorch.

3. lightFM (1858*)

  • A Python implementation of a number of popular recommendation algorithms.

4. annoy (3857*)

  • Approximate Nearest Neighbors in C++/Python optimized for memory usage.

科学计算 Scientific Computing.

1. Numpy (7852*)

  • A fundamental package for scientific computing with Python.

2. Sympy (5027*)

  • A computer algebra system written in pure Python

3. Scipy (4755*)

  • A Python-based ecosystem of open-source software for mathematics, science, and engineering.

4. statsmodels (3101*)

  • Statistical modeling and econometrics in Python.

5. colour (570*)

  • A colour science package implementing a comprehensive number of colour theory transformations and algorithms.

特殊格式处理 Specific Formats Processing.

1. tablib (3069*)

  • A module for Tabular Datasets in XLS, CSV, JSON, YAML.

2. PyPDF2 (2003*)

  • A library capable of splitting, merging and transforming PDF pages.

3. pdfminer (2978*)

  • A tool for extracting information from PDF documents.

爬虫 Web Crawling.

1. Scrapy (28463*)

  • A fast high-level screen scraping and web crawling framework.

2. Pyspider (11732*)

  • A powerful spider system.

3. portia (6213*)

  • Visual scraping for Scrapy.

4. MechanicalSoup (2785*)

  • A Python library for automating interaction with websites.

计算机视觉 Computer Vision.

1. OpenCV (26580*)

  • Open Source Computer Vision Library.

2. SimpleCV (2158*)

  • An open source framework for building computer vision applications.

数据分析 Data Analysis.

1. Pandas (15335*)

  • A library providing high-performance, easy-to-use data structures and data analysis tools.

2. blaze (2435*)

  • NumPy and Pandas interface to Big Data.

3. orange3 (1395*)

  • Data mining, data visualization, analysis and machine learning through visual programming or scripts.

数据可视化 Data Visualization.

1. Matplotlib (7652*)

  • A Python 2D plotting library.

2. Seaborn (5122*)

  • Statistical data visualization using Matplotlib.

3. bokeh (7957*)

  • Interactive Web Plotting for Python.

4. Altair (2614*)

  • Declarative statistical visualization library for Python.

5. pygal (1902*)

  • A Python SVG Charts Creator.

6. bqplot (1984*)

  • Interactive Plotting Library for the Jupyter Notebook.

7. pyQtGraph (1178*)

  • Interactive and realtime 2D/3D/Image plotting and science/engineering widgets.

8. geoplotlib (656*)

  • python toolbox for visualizing geographical data and making maps.

深度学习 Deep Learning.

1. caffe (25051*)

  • A fast open framework for deep learning.

2. Keras (32096*)

  • A high-level neural networks library and capable of running on top of either TensorFlow or Theano.

3. MXNet (14756*)

  • A deep learning framework designed for both efficiency and flexibility.

4. TensorFlow (106421*)

  • The most popular Deep Learning framework created by Google.

5. PyTorch (17515*)

  • Tensors and Dynamic neural networks in Python with strong GPU acceleration.

6. Theano (8393*)

  • A library for fast numerical computation.

7. SerpentAI (4458*)

  • Game agent framework. Use any video game as a deep learning sandbox.

8. neupy (468*)

  • Running and testing different Artificial Neural Networks algorithms.

地理位置 Geolocation.

1. geopy (1977)

  • Python Geocoding Toolbox.

GUI

1. Tkinter

  • Tkinter is Python's de-facto standard GUI package.

2. PyQt

  • Python bindings for the Qt cross-platform application and UI framework, with support for both Qt v4 and Qt v5 frameworks.

3. Flexx (1666*)

  • Flexx is a pure Python toolkit for creating GUI's, that uses web technology for its rendering.

游戏开发 Game Development.

1. Cocos2d (460*)

  • graphic library for games and multimedia, for python language on PC-like hardware

1. Cocos2d-x (12197*)

  • Cocos2d-x is a suite of open-source, cross-platform, game-development tools used by millions of developers all over the world.

2. Panda3D (1162*)

  • 3D game engine developed by Disney and maintained by Carnegie Mellon's Entertainment Technology Center. Written in C++, completely wrapped in Python.

3. Pygame (524*)

  • Pygame is a set of Python modules designed for writing games.

4. Cocos2d-html5 (2418*)

  • Cocos2d-html5 is a cross-platform 2D game engine written in JavaScript, based on Cocos2d-X and licensed under MIT.

图像处理 Image Processing.

1. pillow (5164*)

  • Pillow is the friendly PIL fork.

2. thumbor (6054*)

  • A smart imaging service. It enables on-demand crop, re-sizing and flipping of images.

3. scikit-image (2427*)

  • A Python library for (scientific) image processing.

文本处理 Text Processing.

1. ftfy (2264*)

  • Makes Unicode text less broken and more consistent automagically.

2. fuzzywuzzy (4471*)

  • Fuzzy String Matching.

3. Pypinyin (1127*)

  • Convert Chinese hanzi to pinyin.

4. textdistance (1200*)

  • Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

音频处理类 Audio Manipulating.

1. eyeD3 (98*)

  • eyeD3 is a Python tool for working with audio files, specifically MP3 files containing ID3 metadata.

2. pydub (2733*)

  • Manipulate audio with a simple and easy high level interface.

3. dejavu (3946*)

  • Audio fingerprinting and recognition.

4. pyAudioAnalysis (1668*)

  • Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications.

5. m3u8 (429*)

  • A module for parsing m3u8 file.

相关文章

网友评论

    本文标题:Python库介绍

    本文链接:https://www.haomeiwen.com/subject/wtwrvftx.html