Hi Bernardo,
Based on the fact that the return data contains newline characters and a few other things, a RegEx script is probably your best bet. Try this in a
Formula Tool:REGEX_Replace([DownloadData], '.*?(?:<h1>)(.*?)(?:</td>
).*', '$1')
If you want to also keep the <H1> tag, try this in your Formula Tool expression:
REGEX_Replace([DownloadData], '.*?((?:<h1>
.*?)(?:</td>
).*', '$1')
Huge thanks to Garth Miles for his help with this!
Chad
Follow me on Twitter! @AlteryxChad