Softpedia
 


LINUX CATEGORIES:



GLOBAL PAGES >>
NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • BackTrack 5 R2
  • Wine 1.4 / 1.5.5
  • Mozilla Firefox 12...
  • Ubuntu 11.04
  • Angry Birds 1.1.2.1
  • Ubuntu 10.04.4 LTS
  • Linux Kernel 3.4
  • Ubuntu Manual 10.10
  • Adobe Flash Player...
  • Pidgin 2.10.4
  • Home > Linux > Multimedia > Video

    ccextractor 0.60

    Download button

    Downloads: 1,968  View global page NEW!  Tell us about an update
    User Rating:
    Rated by:
    Good (3.7/5)
    22 user(s)
    Developer:

    License / Price:

    Last Updated:

    Category:
    Carlos Fernandez | More programs
    GPL / FREE
    January 25th, 2012, 07:20 GMT [view history]
    ROOT / Multimedia / Video

     Read user reviews (0)  Refer to a friend  Subscribe

    ccextractor description

    A fast closed captions extractor for MPEG files.

    ccextractor project is a fast closed captions extractor for MPEG files.

    ccextractor is mostly a mildly optimized C port of McPoodle's excellent but painfully slow Perl script SCC_RIP. It lets you rip the raw closed captions (read: subtitles) data from a number of sources, such as DVD or replay TV.

    As an added bonus compared to the original SCC_RIP, ccextractor can extract subtitles from the HDTV transport streams that are becoming more common.

    At this point ccextractor extracts the line 21 captions (which must legally be present for a number of years until the transition to digital is complete). Note that in most .ts you can find, there will be subtitle data for both analog (EIA-608) decoders and digital (EIA-708). AFAIK there are not
    freely available EIA-708 rippers.

    Anyway, since line 21 captions will be available for some time, we have time to build a decent 708 ripper.

    Basic Usage:

    For details on CC, please go to McPoodle's page:

    http://www.geocities.com/mcpoodle43/SCC_TOOLS/DOCS/SCC_TOOLS.HTML

    You will need his tools to use ccextrator's output.

    The basic idea is that you get the raw closed caption dump from ccextractor.

    Then you need other tools (which vary depending on what you want to do) to continue processing.

    To get a transcript from a .ts file in .srt (I assume this will be the most common use) do this:

    ccextractor -12 input_file

    -12 means "extract both subtitle tracks" (actually technical names are fields but tracks is easier to understand). 1 is almost always English. 2 is Spanish in HBO (at least in the few samples I've seen) but could be anything. Just extract both of them and check.

    Example: cctractor -12 house315.ts

    ccextractor will create two files, called house315_1.bin and _2.

    Then use McPoodle's RAW2SCC to create a temporary SCC file (means Scenerist, which is originally the native format for some program, it's not important here).

    raw2scc house315_1.bin

    This creates house315_1.scc

    From this .scc file, you can get the final .srt by using McPoodle's CCASDI:

    ccasdi -s house315_1.srt

    Which looks like this (just 3 random lines shown).

    514
    00:24:07,400 --> 00:24:09,300
    They've got another trial
    going on at Duke.

    515
    00:24:09,367 --> 00:24:12,567
    15% extend their lives
    beyond five years.

    516
    00:24:12,634 --> 00:24:13,701
    If you're positive
    for protein PHF--


    Product's homepage

    What's New in This Release: [ read full changelog ]

    · MP4 support has been added.
    · The Windows version was writing text files with double \r.
    · Closed captions blocks with no data could cause a crash.
    · -noru (to generate files without duplicate lines in roll-up) was broken, with complete lines being missing.
    · bin format was not working as input.

      


    TAGS:

    closed captions extractor | HDTV transport streams | DVD transport streams | ccextractor | closed | captions



    HTML code for linking to this page:


    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM