Hi community, hope everyone is doing well.
I have got stuck with a formatting and data extraction flow. The usual input is an outlook email, but for the purpose of the issue I attach a mockup data formatted in similar fashion to what Alteryx reads when connected to the outlook tool.
The data that comes in contains company names, machine numbers, depot id, stock id... all combined, and they may come or not in the same order (as you may see in the body column of the attached excel dataset). when it comes to this data tickets, they can contain 1 or more machine numbers that need to be extracted along with the depot id and the stock id and arranged as columns so it can be used as an input to other process.
The attached flow does first the trimming of the left and right text portion that I do not need and after that continues with the tools to read and post in columns the machine numbers. The machine numbers that are valid to extract are the ones with 6-digit codes (the ones with 5-digit codes are not valid...some will not have as they are other type of tickets) and they are needed to enrich a number of fields in a SharePoint list. My key data item is the stock id, as this will be the item to look out in the SharePoint list to match and enrich the required fields (this is another flow).
Part of this flow uses regex formulas and tool which I am not too familiar with. The issue I am having is that the regex tool formula to read all available 6-digit code machine numbers also reads the invalid 5-digit codes or the stock id or even the depot id and it does not provide an accurate read and extraction of the needed 6-digit machine number codes (they start with 19 when you look at the attached excel dataset). Tickets can contain a single 6-digit machine number code, or it can come with more than 1 (on the range between 2 and 15), so part of the formula in the regex tool states that, so if the case shows up, I am able to read the listed 6-digit machine number codes provided. Part of what I did was to use the tokenize option in the regex tool and use the formula \b\d{6}\b, but the result in most cases would always show null, with a few that would bring one or some of the 6-digit machine number codes.
The goal is to be able to read, format and extract all 6-digit machine number codes whether 1 or many (up to 15 if possible), and have the stock id also listed to be able to find the SharePoint list record id and update the required fields in the other build flow (not attached). Any advice or suggestion to improve the flow and be able to accomplish the above goal, is welcomed. Thanks.
Ananias