sum values of two csv files and results to a new file

Joel Baumert 2019-07-03 20:37

Pandas has good support for reading and manipulating CSV files.


    import pandas as pd

    df = pd.concat([
        pd.read_csv('csv1.csv'),
        pd.read_csv('csv2.csv')
    ])

    result=df.groupby('link', as_index=False).sum()
    result['node'] = 'allnode'

    result.to_csv('result.csv')

Where csv1.csv is:

    node,link,rate-in,rate-out
    node1,link1,10,20
    node1,link2,30,50
    node1,link3,40,60

And csv2.csv is:

    node,link,rate-in,rate-out
    node2,link1,20,10
    node2,link2,50,70
    node2,link3,80,40

The result.csv file will contain:


    link,rate-in,rate-out,node
    link1,30,30,allnode
    link2,80,120,allnode
    link3,120,100,allnode

If you want to reorder the columns you can do that by providing an order list


    result[[
        'node','link','rate-in','rate-out'
    ]].to_csv('result.csv', index=False)

subok 2019-07-17 20:59:49

small question, in the result.csv the node column is put at the end.. is it possible to put this as the first column?

Related issues

How to use python cut method to create bins, accept one parameter and return appropriate bin?

Create a dictionary from a list of lists with certain criteria

selecting columns based on row value, Python, Pandas

plotting count of zeros and ones in a dataframe

BeautifulSoup find.all() web scraping returns empty

python function. output a keys list from a dictionary if the key is todays date

Best way to perform multiple amount of Pandas lookups between two DataFrames

How to get the number of columns and the width of each column in a Pandas pivot table?

Display a column when a desired value is missing while grouping in Pandas dataframe

Python hide ticks but show tick labels