Hi mentor,
I have a dataset that includes two columns of data. one is invoice_id, another one is product type. For one same invoice_id, it may include more than one product types. now, my task is to find out the product types that include a specific product called "AU pre-Registration" under the same invoice_id. The included product types with "AU pre-Registration" are not the same for different invoice_id.
here is the dataset looks like:

So which tool I can use to effectively sort out all results for big datasets? Thank you.