ccextractor is an open source software is a very fast closed captions extractor for MPEG files.
ccextractor lets you rip the raw closed captions (read: subtitles) data from a number of sources, such as DVD or replay TV.
Product's homepage
What's New in This Release: [ read full changelog ]
· Changed Window GUI size (larger).
· Added Teletext options to GUI.
· Added -teletext to force teletext mode even if not detected
· Added -noteletext to disable teletext detection. This can be needed for streams that have both 608 data and teletext packets if you need to process the 608 data (if teletext is detected it will take precedence otherwise).
· Added -datapid to force a specific elementary stream to be used for data (bypassing detections).
· Added -ru2 and -ru3 to limit the number of visible lines in roll-up captions (bypassing whatever the broadcast says).
· Added support for a .hex (hexadecimal) dump of data.
· Added support for wtv in Windows. This is done by using a new program (wtvccdump.exe) and a new DirectShow filter (CCExtractorDump.dll) that process the .wtv using DirecShow's filters and export the line 21 data to a .hex file. The GUI calls wtvccdump.exe as needed.
· Added --nogoptime to force PTS timing even when CCExtractor would use GOP timing otherwise.