skdata is a Python library of standard data sets for machine learning experiments. The modules of skdata download data sets, load them as directly as possible as Python data structures, and provide convenient views for standard machine learning problems.
The recommended installation method is to clone the master branch and install the development head:
git clone https://github.com/jaberg/skdata
python setup.py develop
The goal with skdata is to standardize the representation of community benchmark data sets (including large and awkward ones), and facilitate the development of broadly applicable machine learning algorithm implementations. Skdata is meant to interoperate with other Python machine learning software such as sklearn and pandas.