Data Drift
Data drift occurs when the characteristics of the data encountered by a model in production deviate significantly from those of the dataset on which the model was trained.
Basic concepts:
- Baseline: Users can define the baseline basis on ‘Tag’ or segment of data based on ‘date’.
- Frequency: Users can define how frequently they want to calculate the monitoring metrics
- Alerts frequency: Users can configure how frequently they want to be notified about the alerts
Get all dashboards created:
To retrieve any past dashboard, use the following function:
Set up a data drift dashboard in AryaXAI:
Users can easily establish Data Drift monitoring and diagnosis using the AryaXAI Python SDK. While fetching the default dashboard requires no additional payload, creating a new one necessitates passing the following parameters:
You can also use the help function to get all parameters and payloads:
In the config file, to create a data drift dashboard, we need to define 'Baseline' tag, 'Current Tag', which 'statistical test' you would like to use to calculate the drift, threshold of the stat test, which features you would like to run the data drift test for, you can also define the dates in these tag for which you want to calculate the drift.
Seeing Data Drift report between Tags:
Drift Metrics:
AryaXAI offers various statistical tests to analyze data drift
Available Statistical tests:
Compute selection
In addition to the configuration file, you need to specify the compute option where the drift analysis should be performed. You must also decide whether to run the drift analysis in the background or to run it interactively and view the results immediately. If you choose to run it in the background, the cell will initiate the drift analysis, and you can retrieve the results from the logs later.
Data Drift
Data drift occurs when the characteristics of the data encountered by a model in production deviate significantly from those of the dataset on which the model was trained.
Basic concepts:
- Baseline: Users can define the baseline basis on ‘Tag’ or segment of data based on ‘date’.
- Frequency: Users can define how frequently they want to calculate the monitoring metrics
- Alerts frequency: Users can configure how frequently they want to be notified about the alerts
Get all dashboards created:
To retrieve any past dashboard, use the following function:
Set up a data drift dashboard in AryaXAI:
Users can easily establish Data Drift monitoring and diagnosis using the AryaXAI Python SDK. While fetching the default dashboard requires no additional payload, creating a new one necessitates passing the following parameters:
You can also use the help function to get all parameters and payloads:
In the config file, to create a data drift dashboard, we need to define 'Baseline' tag, 'Current Tag', which 'statistical test' you would like to use to calculate the drift, threshold of the stat test, which features you would like to run the data drift test for, you can also define the dates in these tag for which you want to calculate the drift.
Seeing Data Drift report between Tags:
Drift Metrics:
AryaXAI offers various statistical tests to analyze data drift
Available Statistical tests:
Compute selection
In addition to the configuration file, you need to specify the compute option where the drift analysis should be performed. You must also decide whether to run the drift analysis in the background or to run it interactively and view the results immediately. If you choose to run it in the background, the cell will initiate the drift analysis, and you can retrieve the results from the logs later.