Organize Expectations into an Expectation Suite
An Expectation Suite contains a group of Expectations that describe the same set of data. Combining all the Expectations that you apply to a given set of data into an Expectation Suite allows you to evaluate them as a group, rather than individually. All of the Expectations that you use to validate your data in production workflows should be grouped into Expectation Suites.
Prerequisites
- Python version 3.8 to 3.11.
- An installation of GX 1.0.
- Recommended. A preconfigured Data Context.
- Recommended. A preconfigured Data Source and Data Asset connected to your data.
- Procedure
- Sample code
-
Retrieve or create a Data Context.
In this procedure, your Data Context is stored in the variable
context
. For information on creating or connecting to a Data Context see Create a Data Context. -
Create an Expectation Suite.
To create a new Expectation Suite you first need to import the
ExpectationSuite
class:Python inputfrom great_expectations.core.expectation_suite import ExpectationSuite
Next, you will provide a descriptive name and instantiate the
ExpectationSuite
class. In the following code update the variablesuite_name
with a a name relevant to your data. Then execute the code:Python inputsuite_name = "my_expectation_suite"
suite = ExpectationSuite(name=suite_name) -
Add the Expectation Suite to your Data Context
Once you have finalized the contents of your Expectation Suite you should save it to your Data Context:
Python inputsuite = context.suites.add(suite)
With a File or GX Cloud Data Context your saved Expectation Suite will be available between Python sessions. You can retrieve your Expectation Suite from your Data Context with the following code:
Python inputexisting_suite_name = "my_expectation_suite" # replace this with the name of your Expectation Suite
suite = context.suites.get(name=existing_suite_name) -
Create an Expectation.
In this procedure, your Expectation is stored in the variable
expectation
. For information on creating an Expectation see Create an Expectation. -
Add the Expectation to the Expectation Suite.
An Expectation Suite's
add_expectation(...)
method takes in an instance of an Expectation and adds it to the Expectation Suite's configuraton:Python inputsuite.add_expectation(expectation)
If you have a configured Data Source, Data Asset, and Batch Definition you can test your Expectation before adding it to your Expectation Suite. To do this see Test an Expectation.
However, if you test and modify an Expectation after you have added it to your Expectation Suite you must explicitly save those modifications before they will be pushed to the Expectation Suite's configuration:
Python inputexpectation.column = "pickup_location_id"
expectation.save()Because the
save()
method of a modified Expectation updates its Expectation Suite's configuration, thesave()
method will only function if the Expectation Suite has been added to your Data Context. -
Continue to create and add additional Expectations
Repeat the process of creating, testing, and adding Expectations to your Expectation Suite until the Expectation Suite adequately describes your data's ideal state.
import great_expectations as gx
from great_expectations.core.expectation_suite import ExpectationSuite
import great_expectations.expectations as gxe
context = gx.get_context()
expectation = gxe.ExpectColumnValuesToNotBeNull(column="passenger_count")
# Create an Expectation Suite
suite_name = "my_expectation_suite"
suite = ExpectationSuite(name=suite_name)
# Add the Expectation Suite to the Data Context
suite = context.suites.add(suite)
# Add a previously created Expectation to the Expectation Suite
suite.add_expectation(expectation)
# Add an Expectation to the Expectation Suite when it is created
suite.add_expectation(gxe.ExpectColumnValuesToNotBeNull(column="pickup_datetime"))
# Update the configuration of an Expectation, then push the changes to the Expectation Suite
expectation.column = "pickup_location_id"
expectation.save()
# Retrieve an Expectation Suite from the Data Context
existing_suite_name = "my_expectation_suite" # replace this with the name of your Expectation Suite
suite = context.suites.get(name=existing_suite_name)