Package org.apache.calcite.profile
Interface Profiler
- All Known Implementing Classes:
ProfilerImpl
,SimpleProfiler
public interface Profiler
Analyzes data sets.
-
Nested Class Summary
Modifier and TypeInterfaceDescriptionstatic class
Column.static class
Value distribution, including cardinality and optionally values, of a column or set of columns.static class
Functional dependency.static class
The result of profiling, contains various statistics about the data in a table.static class
Whole data set.static interface
Statistic produced by the profiler.static class
Unique key. -
Method Summary
Modifier and TypeMethodDescriptionprofile
(Iterable<List<Comparable>> rows, List<Profiler.Column> columns, Collection<ImmutableBitSet> initialGroups) Creates a profile of a data set.
-
Method Details
-
profile
Profiler.Profile profile(Iterable<List<Comparable>> rows, List<Profiler.Column> columns, Collection<ImmutableBitSet> initialGroups) Creates a profile of a data set.- Parameters:
rows
- List of rows. Can be iterated over more than once (maybe not cheaply)columns
- Column definitionsinitialGroups
- List of combinations of columns that should be profiled early, because they may be interesting- Returns:
- A profile describing relationships within the data set
-