I need to extract the below text strings hightlighted in yellow from the attached excel file into the the Output format below. The text strings needed are embeded, surrounded by html language. The pattern is that the text string that ends in a "%" symbol is followed by the text string in brackets [ ]
I've tried a combination of Regex, and Text to rows but keep on getting stuck.
Input
Output
Dispay Field | Field |
Management Fee % | [Management Fee] |
Trustee Fee % | [TrusteeFee] |
Service Provider Fee % | [ServiceProviderFee] |
Other Expenses % | [OtherFee] |
Underlying Funds Fees % | [AcquiredFee] |
Gross Ratio% | [TotalExpRatio] |
Fee Waiver % | [WaiverFee] |
Net Expense Ratio % | [NetExpRatio] |
Solved! Go to Solution.
Had another go at it @tbuenaflor, I followed the logic of word with % then [word]
Regex used:
([\w\s]+%|\[[\w]+\])
However this logic does not seem to hold here:
Hope the workflow helps, please ask if any questions.
@IraWatt and @PhilipMannering Thanks for the replies and solutions! Really appreciate it as I was spining my wheels.
No worries @tbuenaflor glad it helped!