Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.3 / 3....
  • LibreOffice 3.6.6 / 4.0.3
  • MPlayer 1.1.1
  • systemd 204
  • Arch Linux 2013.05.01
  • Blender 2.67
  • KDE Software Compilatio...
  • CrunchBang Linux Stable...
  • Elementary OS 0.1 / 0.2...
  • SystemRescueCd 3.6.0
  • Home > Linux > Utilities

    Gloo 0.1.2

    Download button

    No screenshots available
    Downloads: 151  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Trent Hauck | More programs
    Other/Proprietary Li... / FREE
    July 11th, 2012, 02:53 GMT
    ROOT / Utilities

     Read user reviews (0)  Refer to a friend  Subscribe

    Gloo description

    Project management for data analysis projects

    Gloo provides utilities and functions for managing data projects in Python. Requires use of IPython and Pandas.

    A quick workflow example:

    from gloo import interactive

    interactive.create_project("MyProject")

    #now if we have some some scripts to use and some data in the data folder we
    #can load the project

    interactive.load_project()


    Introduction

    Gloo's goal is to tie together a lot of the data analysis actions that happen regularly and make that processes easy. Automatically loading data into the ipython environment, running scripts, making utitlity functions available. These are things that have to be done often, but aren't the fun part.

    What Happens When You Call create_project("MyProject")

    create_project(project_name = "MyProject", **kwds)

    project_name: This is a string that is the name of your project.

    Current Config Options:

     full_structure A boolean that if true creates a full folder structure. If True the folder structure outline below. Defaults to True.

     packages A list of strings of python packages to load when load_project() is called. Defaults to empty.

     logging A boolean to dictate if logging is started when load_project() is called. Defaults to False.

     git A boolean to dictate if a git repo is init'd. Defaults to False.

    Those options are saved into a json file called .config.json at the root of the project directory.

    What Happens When You Call load_project()

    load_project()

    1. The config is loaded into a dictionary.
    2. Data is the data directory is loaded into the environment. This is done recursively so you can have subdirectories. If you do, the parent folder of the data file will be prepended to data file, folder_file. The plan is to make the prepending optional.
    3. Files in the munge directory are run. This folder is where you would put files necessary for preprocessing the data.
    4. Files in the lib directory are imported. This folder is where you would put files that you would like to load as a module.
    5. Packages specified in the config are loaded into the environment.
    6. Logging starts

    Folder Structure

    The full structure is as follows:

    data/ : data
    doc/ : documentation
    diagnostics/ : automatically check for data issues
    graphs/ : graph domicile
    lib/ : utility functions
    munge/ : preprocessing scripts
    profiling/ : benchmark performance
    reports/ : reports you'll produce
    tests/ : tests


    Contributing

    Because this project is in such an early state I would love for anybody and everybody to help contribute. I think this could be very valuable for those working with python for data projets.


    Product's homepage

      


    TAGS:

    project management | data analysis | project | management | data

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM