Tabular data as published on the web is often not well formatted and structured. Messytables tries to use heuristical approaches to detect and fix errors in the data. Typical examples include:
- Guessing the type of columns in CSV data.
- Finding the header of a table when there are explanations and text fragments in the first few rows of the table.
messytables is a Python library that provides data structures and some heuristics to fix these problems and read a wide number of different tabular abominations.
See the full documentation at: http://messytables.readthedocs.org
Product's homepage
Requirements:
· Python