I am trying to parse paragraphs of text that appear in a larger HTML document.
- The target paragraphs are in embedded in the middle of an html document. See example below.
- The target paragraphs are always preceded by a record that only contains <br />.
- The target paragraphs always end in <br />
I want to extract the target paragraphs as rows. I assume I need a multi-row formula tool. But, I don't know how to write the expression and then parse.
| other html. |
| <br /> |
| Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. <br /> |
| <br /> |
| Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur. <br /> |
| <br /> |
| Excepteur sint occaecat cupidatat non proident, sunt in culpa qui officia deserunt mollit anim id est laborum. <br /> |
| other html |
| |