Do you use Alteryx in a language other than English? If so, we want to hear from you! Please help us improve the international experience of our products by participating in this 5 minute survey.

We are updating the requirements for Community registration. As of 7/21/21 all users will be required to register a phone number with their My Alteryx accounts. If you have already registered, you will be prompted on your next login to add your phone number.
alteryx Community

Alteryx Designer Discussions

SOLVED

Finding the Consecutive 3 or more zeros and Changing less than Consecutive 3 Zeros to 1

6 - Meteoroid

Hi,

Trust you all are well

I have a large dataset. That is with rows when transposed, have 26,874,045. A sample data is attached.

When we include Append option, the same is taking a lot of time. The workflow didn't even complete after 15 Hours.

The data have:

1. Brands

2. Selling Points

3. Date: from 01-10-2018 to 01-12-2020

4. Values: 1 - They ordered. 0- They Didn't ordered

Condition to take into Consideration:

To Calculate Lost Customer Rate per Month: Count of Lost Flag in that Particular Month / Count of Active Sell Points in the Previous Month

Explanation

1. Group by Brand then Group  by Selling Points.

2. After 1, if 3 or more consecutive zeros come it is considered as 1 Lost flag. The month at which the pattern of consecutive 3 or more zero started is the first Lost Month and only that month is taken for Lost calculation. So in the below example:

01/06/2020 is one Lost Flag and 01/02/2019 is another Lost Flag for the Same Brand (A2) in sell point (8). So in this example only 2 Lost Flag. We  are not taking 01/07/2020, 01/07/2020, for Lost Calculation but they are still inactive Sell Point.

3. If there are less than 3 consecutive zeros, it should be considered as sales active months.

For Example: Here the two zero months should be considered as active month for Sell Point:

4. Calculate Count of Active Sell Points in the Previous Month: If we are calculating for April, we should get the total count of One in March (Please refer the first screenshot.

Once this flags are achieved we can ignore the group by Sell Point and bring in Group By Date, so that we can calculate it for particular month.

Sample data is attached.

Is it possible to create a workflow without using the Append Function?

26 REPLIES 26
18 - Pollux

Intresting usecase. Here is how you can do it.

Workflow:

1. Using multirow formula to get count for consecutive 0's

2. Using 2nd multirow formula to check if the consecutive 0's >= 3times and flag it.

EDIT: @varun86vgopal  i have added the summarize by date part. I had missed it.

3. Using summarize tool groupby date and sum of flag.

Hope this helps 🙂

Alteryx

Yes this is indeed possible with a simple Multi-Row Formula.

With the multi-row formula, you look at the previous row and the current row. The logic is as follows:

- If the previous row value is 1 and the current is 0, then the flag will be set to 1

- If the previous row value is 0 and the current is 0, then the flag will increment (i.e. it counts how many consecutive 0 values there are)

- If neither of the above are true, the flag is reset to 0

Finally, using a formula tool, set the Active Flag to 1 if the counter is less than 3 (i.e. more than 2 consecutive rows will be flagged as inactive).

Please let me know if this sorts the issue!

Ben

6 - Meteoroid

Thanks a ton @bensilv

6 - Meteoroid

Thanks a ton @atcodedog05

Alteryx

No problem @varun86vgopal!

If I am understanding correctly, you're looking to essentially apply this logic but group it by Brands and Sell Point?

e.g. in the below image, the counter should restart when sell point goes from 14 to 20, therefore sell point 20 should be active? Some clarification would be great, if A9 20 is 0 for 01/10/2018, 01/11/2018 would they be classes as active or not?

18 - Pollux

Groupby is already been taken into consideration refer the multi-row formula tool.

Please check and let me know

Alteryx

If my understanding is correct, that has indeed been covered by @atcodedog05

If we have solved the problem, please remember to mark as solved 🙂

6 - Meteoroid

Hi @atcodedog05,

Only the zeros after the one should be taken into consideration. So below, it shouldn't count the first zeros from A1 21 since the first One didn't happen. So this is the reason, I was using two different flows and append the data but the workflow is taking more than 18 to 20 hours.

Thanks a lot for your help.

6 - Meteoroid

Hi @bensilv

Only the zeros after the one should be taken into consideration. So below, it shouldn't count the first zeros from A1 21 since the first One didn't happen.

Thanks a lot for your help.

Labels