Data profiling is the process of examining the data available in existing data sources (e.g. databases, applications, files, etc.) and collecting statistics and information about this data. Data profiling enables the assessment of the quality level of the data contained in the information system, according to a defined set of metrics and goals.
Talend Open Profiler is a sophisticated, yet simple-to-use data profiling tool that defines the content, structure, and quality of highly complex data structures. It allows business users and data management staff to perform a large variety of analyses using a set of indicators, patterns and rules for each data element being analyzed or monitored. It analyzes data on an ongoing basis, and analyzes changes to source data over time to help improve data quality.
These indicators can range from simple or advanced statistics to text string analysis, including summary data and statistical distributions of records. The patterns are preset or customized expressions that define the expected form of data analyzed and the data quality rules help define custom business thresholds and value ranges.
Talend Open Profiler produces sophisticated reports and graphs that let users gauge at a glance the quality of the data, and the status of the predefined indicators. In addition an embedded data explorer allows users to directly drill down into the tables of the analyzed databases.
The Talend Open Profiler application connects to databases and files to introspect their structures and stores the description of their metadata in its Metadata Repository. The metadata is then used by data analysts to set up metrics and indicators.
· Java 2 Standard Edition Runtime Environment
What's New in This Release: [ read full changelog ]
· Numerous improvements, new features, and bugfixes.