Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.3 / 3....
  • LibreOffice 3.6.6 / 4.0.3
  • MPlayer 1.1.1
  • systemd 204
  • Arch Linux 2013.05.01
  • Blender 2.67a
  • KDE Software Compilatio...
  • CrunchBang Linux Stable...
  • Elementary OS 0.1 / 0.2...
  • SystemRescueCd 3.6.0
  • Home > Linux > Utilities

    data_hacks 0.2

    Download button

    No screenshots available
    Downloads: 211  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Jehiah Czebotar | More programs
    Other/Proprietary Li... / FREE
    October 25th, 2010, 09:20 GMT
    ROOT / Utilities

     Read user reviews (0)  Refer to a friend  Subscribe

    data_hacks description

    Command line utilities for data analysis

    The data_hacks package provides command-line utilities for data analysis.

    Installing: pip install data_hacks

    Installing form github pip install -e git://github.com/bitly/data_hacks.git#egg=data_hacks

    Installing from source python setup.py install

    data_hacks are friendly. Ask them for usage information with --help

    histogram.py

    A utility that parses input data points and outputs a text histogram

    Example:

    cat /tmp/data | histogram.py
    # NumSamples = 29; Max = 10.00; Min = 1.00
    # Mean = 4.379310; Variance = 5.131986; SD = 2.265389
    # each * represents a count of 1
     1.0000 - 1.9000 [ 1]: *
     1.9000 - 2.8000 [ 5]: *****
     2.8000 - 3.7000 [ 8]: ********
     3.7000 - 4.6000 [ 3]: ***
     4.6000 - 5.5000 [ 4]: ****
     5.5000 - 6.4000 [ 2]: **
     6.4000 - 7.3000 [ 3]: ***
     7.3000 - 8.2000 [ 1]: *
     8.2000 - 9.1000 [ 1]: *
     9.1000 - 10.0000 [ 1]: *


    ninety_five_percent.py

    A utility script that takes a stream of decimal values and outputs the 95% time.

    This is useful for finding the 95% response time from access logs.

    Example (assuming response time is the last column in your access log):

     cat access.log | awk '{print $NF}' | ninety_five_percent.py

    sample.py

    Filter a stream to a random sub-sample of the stream

    Example:

     cat access.log | sample.py 10% | post_process.py

    run_for.py

    Pass through data for a specified amount of time

    Example:

     tail -f access.log | run_for.py 10s | post_process.py

    bar_chart.py

    Generate an ascii bar chart for input data (this is like a visualization of uniq -c)

    cat data | bar_chart.py --sort-keys
    # each * represents a count of 2
    19:0 [ 1]
    19:1 [ 24] ************
    19:2 [ 3] *
    19:3 [ 9] ****
    19:4 [ 5] **
    19:5 [ 41] ********************
    20:0 [ 115] *********************************************************
    20:1 [ 181] ******************************************************************************************
    20:2 [ 136] ********************************************************************
    20:3 [ 155] *****************************************************************************
    20:4 [ 150] ***************************************************************************
    20:5 [ 79] ***************************************
    21:0 [ 64] ********************************
    21:1 [ 8] ****



    Product's homepage

    Requirements:

    · Python

      


    TAGS:

    data analysis | command-line utilities | data | analysis | analyzer

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM