关键词:Pandas | Python | 数据分析 | 数据处理
感觉目前 Pandas 的官方文档和官方推荐的一些教程比较散乱,所以我对自己常接触的一些点,结合已有文档和教程做了总结。是 Jupyter Notebook 的形式,即看即用。Github地址。有童鞋反映堆了太多例子、缺乏解释,我之后会更新一波给例子补充说明,包括可能的应用场景。
由于简书无法正确显示带HTML的.md,以下仅为目录:
- Design
- Primary Data Structures
- Mutability and copying of data
- Index
- Series
- DataFrame
- Conversion: Dict
- Conversion: Numpy Array
- Reset Index
- File IO
- Input
- Output
- Excel with Style
- Indexing / Selecting / Slicing
- []
- loc / iloc
- single index
- tuple index
- hierarchical index
- iloc
- Boolean Filtering
- sample
- Math
- Unary Operation
- Binary Operation
- Iteration
- Along axis
- Apply / Map
- Apply with Progress Bar
- Group
- Structure
- Iteration
- Grouped by Time Period
- Group + Apply
- Missing Data
- Performance
- Time
- Dependencies
- Use Wisely
- Parallelize
- Cython / Numba / pandas.eval
- Space
- Time
网友评论