Data filtering is an operation on a dataset (such as a DataFrame) that defines which data to extract or remove from the dataset. For example, in a dataset containing all customers in the world, when creating training data, you might only want to read the training data for customers resident in a particular geographical region (e.g., North America, Japan,...), Here, you would define a filter on a region feature when creating training data as shown below.