Hello
I would like some help to attack this need of finding duplicated rows that fall in a range of time.
For example, the dataset attached has 7 columns and each row is a transaction. I want to consider in the analysis rows that have the same IP, product_id and a transactio_date that falls in a range of +30s or -30s from the transaction_date.
So in the example below:
transaction_date | product_id | ip |
5/28/22 06:09:00 | EZSHARE-182252029-112 | 190.30.98.190 |
If there is the same combination of product_id and ip and the transaction_date is > 5/28/22 06:08:30 and transaction_date < 5/28/22 06:09:30
Is this possible on Alteryx?
Rodrigo
Hi @calloni
please see if the attached workflow is doing what you need to do.
1) The append tool is creating a cross join of your dataset vs your data set
2)The filter tool is keeping just the rows that match your requirements of combination. But with duplicates.
3)After that, i did some tricks to remove the duplicated combinations.
If the provided answer helped you to solve the problem/was correct, please accept it as a solution :)
Thanks.
Thanks!
I tried to plug my full dataset of 6M rows and it is still running today :)
I think I found an alternative way via Tableau. But thanks a lot for help!
Rodrigo