Be sure to review our Idea Submission Guidelines for more information!
Submission GuidelinesThe tokenize would be more powerful if in addition to Drop Extra with Warning / Without Warning / Error, you could opt to have extra tokens concatenated with the final column.
Example: I have a values in a column like these:
3yd-A2SELL-407471
3vd-AAABORMI-3238738
3vd-RMLSFL-RX-10326049
In all 3 cases, I want to split to 3 columns (key, mlsid, mlsnumber), though I only care about the last two. But in the third example, the mlsnumber RX-10326049 actually contains a hyphen. (Yes, the source for this data picked a very bad delimiter for a concatenated value).
I can parse this a lot of different ways - here's how I do it in SQL:
MlsId = substr(substr(listingkey, instr(listingkey, '-')+1), 1, instr(substr(listingkey, instr(listingkey, '-')+1), '-')-1)
MlsNumber = substr(substr(listingkey, instr(listingkey, '-')+1), instr(substr(listingkey, instr(listingkey, '-')+1), '-')+1);
With Regex tokenize, I can split to 4 or more columns and then with a formula test for a 4th+ column and re-concatenate. BUT it would be awesome if in the Regex tokenize I could instead:
1. split to columns
2. # of columns 3
3. extra columns = ignore, add to final column
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.