abacusai.eda_data_consistency
Classes
Data Consistency for duplication within data |
|
Eda Data Consistency, contained the duplicates in the base version, Comparison version, Deletions between the base and comparison and feature transformations between the base and comparison data. |
Module Contents
- class abacusai.eda_data_consistency.DataConsistencyDuplication(client, totalCount=None, numDuplicates=None, sample={})
Bases:
abacusai.return_class.AbstractApiClassData Consistency for duplication within data
- Parameters:
client (ApiClient) – An authenticated API Client instance
totalCount (int) – Total count of rows in data.
numDuplicates (int) – Number of Duplicates based on primary keys in data.
sample (FeatureRecord) – A list of dicts enumerating rows the rows that contained duplications in primary keys.
- __repr__()
Return repr(self).
- class abacusai.eda_data_consistency.AbstractApiClass(client, id)
- __eq__(other)
Return self==value.
- _get_attribute_as_dict(attribute)
- class abacusai.eda_data_consistency.EdaDataConsistency(client, columnNames=None, primaryKeys=None, transformationColumnNames=None, baseDuplicates={}, compareDuplicates={}, deletions={}, transformations={})
Bases:
abacusai.return_class.AbstractApiClassEda Data Consistency, contained the duplicates in the base version, Comparison version, Deletions between the base and comparison and feature transformations between the base and comparison data.
- Parameters:
client (ApiClient) – An authenticated API Client instance
columnNames (list) – Name of all the features in the data
primaryKeys (list) – Name of the primary keys in the data
transformationColumnNames (list) – Name of all the features that are not the primary keys
baseDuplicates (DataConsistencyDuplication) – A DataConsistencyDuplication describing the number of duplicates within the data
compareDuplicates (DataConsistencyDuplication) – A DataConsistencyDuplication describing the number of duplicates within the data
deletions (DataConsistencyDuplication) – A DataConsistencyDeletion describing the number of deletion between two versions in the data
transformations (DataConsistencyTransformation) – A DataConsistencyTransformation the number of changes that occured per feature in the data
- __repr__()
Return repr(self).