如何使用python进行数据表的合并 !

作者: 14e61d025165 | 来源:发表于2019-06-29 15:26 被阅读2次

如何使用python进行数据表的合并 !
python编程练习5
Python基础(9) - 列表和元组合并成字典
第二章-第二三节
像Excel一样使用Python（二）
AVFoundation详细解析（一）视频合并与混音
myspl模块化
Python:网络编程
Python多进程与多线程编程及GIL详解！
使用python-docx生成Word文档

案例背景

<tt-image data-tteditor-tag="tteditorTag" contenteditable="false" class="syl1561793173999 ql-align-center" data-render-status="finished" data-syl-blot="image" style="box-sizing: border-box; cursor: text; text-align: left; color: rgb(34, 34, 34); font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "WenQuanYi Micro Hei", "Helvetica Neue", Arial, sans-serif; font-size: 16px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: pre-wrap; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(255, 255, 255); text-decoration-style: initial; text-decoration-color: initial; display: block;">

image

假设我们的文件是放在G盘 python文件夹下单的projectFile文件夹中，具体的情况需根据读者文件位置进行设置

我们需要将下面两个文件，合并在一起

<tt-image data-tteditor-tag="tteditorTag" contenteditable="false" class="syl1561793174006 ql-align-center" data-render-status="finished" data-syl-blot="image" style="box-sizing: border-box; cursor: text; text-align: left; color: rgb(34, 34, 34); font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "WenQuanYi Micro Hei", "Helvetica Neue", Arial, sans-serif; font-size: 16px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: pre-wrap; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(255, 255, 255); text-decoration-style: initial; text-decoration-color: initial; display: block;">

image

Python学习交流群：1004391443

合并前，data_1.csv的数据

<tt-image data-tteditor-tag="tteditorTag" contenteditable="false" class="syl1561793174014 ql-align-center" data-render-status="finished" data-syl-blot="image" style="box-sizing: border-box; cursor: text; text-align: left; color: rgb(34, 34, 34); font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "WenQuanYi Micro Hei", "Helvetica Neue", Arial, sans-serif; font-size: 16px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: pre-wrap; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(255, 255, 255); text-decoration-style: initial; text-decoration-color: initial; display: block;">

image

合并前，data_2.csv的数据

<tt-image data-tteditor-tag="tteditorTag" contenteditable="false" class="syl1561793174020 ql-align-center" data-render-status="finished" data-syl-blot="image" style="box-sizing: border-box; cursor: text; text-align: left; color: rgb(34, 34, 34); font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "WenQuanYi Micro Hei", "Helvetica Neue", Arial, sans-serif; font-size: 16px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-indent: 0px; text-transform: none; white-space: pre-wrap; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(255, 255, 255); text-decoration-style: initial; text-decoration-color: initial; display: block;">

image

实现代码如下：

#先导入需要的包

import pandas as pd

import csv

import sys

import glob

#定义一个文件存放位置变量。

input_path= 'G:\Python\projectFile'

#使用glob.glob的方法对所有data_开头的文件进行获取

<pre spellcheck="false" style="box-sizing: border-box; margin: 5px 0px; padding: 5px 10px; border: 0px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-variant-numeric: inherit; font-variant-east-asian: inherit; font-weight: 400; font-stretch: inherit; font-size: 16px; line-height: inherit; font-family: inherit; vertical-align: baseline; cursor: text; counter-reset: list-1 0 list-2 0 list-3 0 list-4 0 list-5 0 list-6 0 list-7 0 list-8 0 list-9 0; background-color: rgb(240, 240, 240); border-radius: 3px; white-space: pre-wrap; color: rgb(34, 34, 34); letter-spacing: normal; orphans: 2; text-align: left; text-indent: 0px; text-transform: none; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; text-decoration-style: initial; text-decoration-color: initial;">all_files= glob.glob(os.path.join(input_path,'data_*'))

创建一个列表，用于接收所有读取的内容

all_data_frames=[]

对获取的所有文件进行遍历

for file in all_files:
#对遍历的内容以csv格式进行读取
data_frame = pd.read_csv(file,index_col=None)
#把读取到的内容，增加到all_data_frames列表中
all_data_frames.append(data_frame)

对放在列表中的内容进行拼接，axis参数为合并方向,0是纵向,1是横向

data_frame_concat=pd.concat(all_data_frames,axis=0,
ignore_index=True)

将合并后的文件，输出到新文件data_concat_output_file中

</pre>

data_frame_concat.to_csv('G:\Python\projectFile\data_concat_output_file.csv',index=False)

<tt-image data-tteditor-tag="tteditorTag" contenteditable="false" class="syl1561793174050" data-render-status="finished" data-syl-blot="image" style="box-sizing: border-box; cursor: text; color: rgb(34, 34, 34); font-family: "PingFang SC", "Hiragino Sans GB", "Microsoft YaHei", "WenQuanYi Micro Hei", "Helvetica Neue", Arial, sans-serif; font-size: 16px; font-style: normal; font-variant-ligatures: normal; font-variant-caps: normal; font-weight: 400; letter-spacing: normal; orphans: 2; text-align: left; text-indent: 0px; text-transform: none; white-space: pre-wrap; widows: 2; word-spacing: 0px; -webkit-text-stroke-width: 0px; background-color: rgb(255, 255, 255); text-decoration-style: initial; text-decoration-color: initial; display: block;">

image

设计思路，在这个案例中，我们将要合并的文件，读取后转化为列表的元素，再进行合并。

总结

这种方法也可以用于几百上千个文件需要合并到一起的情况。

如果需要合并的文件的文件名称并不规则，那么我们可以先修改文件名称（给文件名加一个统一的前缀），再进行以上操作。想了解更多操作技巧，可关注公众号，后期将会有更多内容与大家分享。

网友评论

本文标题：如何使用python进行数据表的合并 !

本文链接：https://www.haomeiwen.com/subject/ltepcctx.html

延伸阅读

深度阅读

您也可以注册成为美文阅读网的作者，发表您的原创作品、分享您的心情！

如何使用python进行数据表的合并 !

创建一个列表，用于接收所有读取的内容

对获取的所有文件进行遍历

对放在列表中的内容进行拼接，axis参数为合并方向,0是纵向,1是横向

将合并后的文件，输出到新文件data_concat_output_file中

相关文章

如何使用python进行数据表的合并 !

python编程练习5

Python基础(9) - 列表和元组合并成字典

第二章-第二三节

像Excel一样使用Python（二）

AVFoundation详细解析（一）视频合并与混音

myspl模块化

Python:网络编程

Python多进程与多线程编程及GIL详解！

使用python-docx生成Word文档

网友评论

延伸阅读

深度阅读

栏目导航

热点阅读

大数据爬虫Python AI Sql

Python小哥哥