Advent of Code is back! Unwrap daily challenges to sharpen your Alteryx skills and earn badges along the way! Learn more now.

Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.
SOLVED

Parsing PDFs - concatenate indefinite number of multiple rows containing strings

Rachael_OS
8 - Asteroid

Hi folks,

 

I'm parsing a PDF in Alteryx and trying to put the data into a more usable format.

 

An issue I've run up against is that there is a description coming in over a few lines, which I would like to concatenate into one line. The first row contains a value which also matches a value in the last row of the description. Is there a multi-row formula I can use for this? I've had a go at a couple but not having much luck! 

 

Thanks in advance

3 REPLIES 3
DavidP
17 - Castor
17 - Castor

Could you post an example of the data as extracted from the PDF?

danilang
19 - Altair
19 - Altair

Hi @Rachael_OS 

 

If your input looks like this 

 

i.png

 

You can use a multi-row tool to fill in the missing marker values and then use a Summarize tool to concatenate the values 

 

w.png

If your input has the marker on all the rows, you can get rid of the Multi-row tool.  Either way, your output looks like this

 

r.png

 

Dan

 

 

Rachael_OS
8 - Asteroid

Worked like a charm - thanks a mill! 

Labels