Pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
Cookbook
Setup notebook for pandas + bokeh
import os
import pandas as pd
import numpy as np
from bokeh.io import output_notebook, show
setup some globals we’ll use as base paths
BASE_DIR = os.path.realpath(os.path.join(os.getcwd(), ’..’)) DATA_DIR = os.path.join(BASE_DIR, ‘data’)
tell bokeh to output graphs in the notebook
output_notebook()
depends on input data, but maybe handy to see untruncated data if you have a lot of columns
pd.options.display.max_columns = 100
output the computed paths
print(‘base:’, BASE_DIR) print(‘data:’, DATA_DIR)
Reorder columns in a pandas dataframe
df = df[['this', 'is', 'the', 'new', 'order']]
Construct a DataFrame from a list of dicts
import pandas as pd
ds = []
for user in users:
ds.append({
'screen_name': user.screen_name,
'name': user.name
})
df = pd.DataFrame(ds)
Create an index afterward
df = df.set_index('screen_name')