This is tricky for me.
I need to fill down a set of values. There is only 1 value per [File]
How can I get it?
I tried summary tool and then sort down however I am unable to get it absolutely correctly.
Current example dataframe:
RecordID | File | RegExOut1 |
14391 | 1.pdf | |
14392 | 1.pdf | 574252 |
14393 | 1.pdf | |
14394 | 1.pdf | |
14395 | 1.pdf | |
14396 | 1.pdf | |
14397 | 1.pdf | |
14398 | 1.pdf | 574252 |
14399 | 1.pdf | |
14400 | 1.pdf | |
14401 | 2.pdf | |
14402 | 2.pdf | |
14403 | 2.pdf | |
14404 | 2.pdf | |
14405 | 2.pdf | 574578 |
Expected dataframe/output:
RecordID | File | RegExOut1 |
14391 | 1.pdf | |
14392 | 1.pdf | 574252 |
14393 | 1.pdf | 574252 |
14394 | 1.pdf | 574252 |
14395 | 1.pdf | 574252 |
14396 | 1.pdf | 574252 |
14397 | 1.pdf | 574252 |
14398 | 1.pdf | 574252 |
14399 | 1.pdf | 574252 |
14400 | 1.pdf | 574252 |
14401 | 2.pdf | 574578 |
14402 | 2.pdf | 574578 |
14403 | 2.pdf | 574578 |
14404 | 2.pdf | 574578 |
14405 | 2.pdf | 574578 |
How can I create a workflow to get here?
Solved! Go to Solution.
@HW1
Why the value for "14391" is empty? 😁
That's cause after parsing the PDF, the value appears at the second row in "this example".
Its not necessary that the first row has to be blank.. it can be filled.
However, I figured out (discovered 😁) that sort can be by multiple columns hence grouping and then filling down.
@HW1
I hope my interpretion is correct.
@HW1
Thank you for the accept mark, even though you have figured out by yourself. 😁