Processing large data with Pandas
Data sets can get large quickly. You can quickly go from looking at:
- a few 100 lines and a handful of columns to…
- a million lines and with hundred of columns.
Python Pandas (with smart use of Categories) can enable one to reduce the size of ones data in memory by up to 90%.
This repository contains a tutorial and supporting scripts to showcase the power of python pandas with categories.
Python Talk: Python Talk