Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Linux Kernel 3.9.6 / 3....
  • Linux Kernel 3.0.82 LTS...
  • KDE Software Compilatio...
  • PulseAudio 4.0
  • Wireshark 1.10.0
  • NetworkManager 0.9.8.2
  • LibreOffice 3.6.6 / 4.0...
  • SystemRescueCd 3.7.0
  • Linux Kernel 3.10 RC6
  • Ubuntu Tweak 0.8.5
  • Home > Linux > Programming > Perl Modules

    Bio::SAGE::DataProcessing 1.20

    Download button

    No screenshots available
    Downloads: 402  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Scott Zuyderduyn | More programs
    Perl Artistic License / FREE
    July 15th, 2007, 03:05 GMT
    ROOT / Programming / Perl Modules

     Read user reviews (0)  Refer to a friend  Subscribe

    Bio::SAGE::DataProcessing description

    A module which processes raw serial analysis of gene expression (SAGE) data.

    Bio::SAGE::DataProcessing module processes raw serial analysis of gene expression (SAGE) data.

    SYNOPSIS

    use Bio::SAGE::DataProcessing;
    $sage = Bio::SAGE::DataProcessing->new();

    # open sequence and quality files
    open( READS, "library.fasta" );
    open( QUAL, "library.qual.fasta" );

    # collect ditags and statistics from reads
    $sage->process_library( *READS, *QUAL );

    # close files
    close( READS );
    close( QUAL );

    # output tags in descending order of expression
    my %tags = %{$sage->get_tagcounts()};
    open( TAGS, ">library.tags" );
    map { print TAGS join( "t", $_, $tags{$_} ) . "n" } sort { $tags{$b} $tags{$a} } keys %tags;
    close( TAGS );

    # tag AAACCGGGTT matches two different genes
    # so 15th base pair may help resolve this
    $sage->print_extra_base_calculation( $sage->get_extract_base_calculation( "AAACCGGGTT" ) );

    This module provides several tools for processing and analyzing serial analysis of gene expression (SAGE) libraries.

    Serial analysis of gene expression (SAGE) is a molecular technique for generating a near-global snapshot of a cell population’s transcriptome. Briefly, the technique extracts short sequences at defined positions of transcribed mRNA. These short sequences are then paired to form ditags. The ditags are concatamerized to form long sequences that are then cloned. The cloned DNA is then sequenced. Bioinformatic techniques are then employed to determine the original short tag sequences, and to derive their progenitor mRNA. The number of times a particular tag is observed can be used to quantitate the amount of a particular transcript. The original technique was described by Velculescu et al. (1995) and utilized an ~14bp sequence tag. A modified protocol was introduced by Saha et al. (2002) that produced ~21bp tags.

    PURPOSE

    This module facilitates the processing of SAGE data. Specifically:

    1. extracting ditags from raw sequence reads.
    2. extracting tags from ditags, with the option to
    exclude tags if the Phred scores (described by
    Ewing and Green, 1998a and Ewing et al., 1998b)
    do not meet a minimum cutoff value.
    3. calculating descriptive values
    4. statistical analysis to determine, where possible,
    additional nucleotides to extend the length of the
    SAGE tag (thus facilitating more accurate tag to
    gene mapping).

    Both regular SAGE (14mer tag) and LongSAGE (21mer tag) are supported by this module. Future protocols should be configurable with this module.


    Product's homepage

    Requirements:

    · Perl

      


    TAGS:

    gene analysis | SAGE analysis | Perl module | Bio | gene | expression

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM