[Python] Normalize the data with Pandas

Posted Answer1215

tags:

篇首语:本文由小常识网(cha138.com)小编为大家整理,主要介绍了[Python] Normalize the data with Pandas相关的知识,希望对你有一定的参考价值。

import os
import pandas as pd
import matplotlib.pyplot as plt

def test_run():
    start_date=\'2017-01-01\'
    end_data=\'2017-12-15\'
    dates=pd.date_range(start_date, end_data)

    # Create an empty data frame
    df=pd.DataFrame(index=dates)

    symbols=[\'SPY\', \'AAPL\', \'IBM\', \'GOOG\', \'GLD\']
    for symbol in symbols:
        temp=getAdjCloseForSymbol(symbol)
        df=df.join(temp, how=\'inner\')

    return df   


def normalize_data(df):
    """ Normalize stock prices using the first row of the dataframe """
    df=df/df.ix[0, :]
    return df


def getAdjCloseForSymbol(symbol): 
    # Load csv file
    temp=pd.read_csv("data/{0}.csv".format(symbol), 
        index_col="Date", 
        parse_dates=True,
        usecols=[\'Date\', \'Adj Close\'],
        na_values=[\'nan\'])
    # rename the column
    temp=temp.rename(columns={\'Adj Close\': symbol})
    return temp

def plot_data(df, title="Stock prices"):
    ax=df.plot(title=title, fontsize=10)
    ax.set_xlabel("Date")
    ax.set_ylabel("Price")
    plt.show()


if __name__ == \'__main__\':
    df=test_run()
    # data=data.ix[\'2017-12-01\':\'2017-12-15\', [\'IBM\', \'GOOG\']]    
    df=normalize_data(df)
    plot_data(df)
    """
                       IBM         GOOG
    2017-12-01  154.759995  1010.169983
    2017-12-04  156.460007   998.679993
    2017-12-05  155.350006  1005.150024
    2017-12-06  154.100006  1018.380005
    2017-12-07  153.570007  1030.930054
    2017-12-08  154.809998  1037.050049
    2017-12-11  155.410004  1041.099976
    2017-12-12  156.740005  1040.479980
    2017-12-13  153.910004  1040.609985
    2017-12-15  152.500000  1064.189941
    """

It is easy to compare the data by normalize it.

 

以上是关于[Python] Normalize the data with Pandas的主要内容,如果未能解决你的问题,请参考以下文章

在 Python 2.7 中使用 unicodedata.normalize

更新 Python sklearn Lasso(normalize=True) 以使用管道

python normalize_phone.py

Python:json_normalize 一个熊猫系列给出了 TypeError

Pandas json_normalize 无法在 Python 中使用大型 JSON 文件

CodeForces - 950DA Leapfrog in the Array