I have been playing with this data set for quite some time and just can't get it right, so I wanted to reach out and see if I could get some assistance from a wiz in the community.
1. This is mocked up data that sort of matches some of the characteristics that I'm dealing with
2. Not all tags are on a new line so you will see in the sample file that the first line of table data is at the end of the last table header

3. Character values included in sample data text are the only characters that I have found from source (- , " ' . ~ & ; ) so find/replace can't be used on these values but you can use ^ $ @ | if you need to create generic break points
4. All relevant table row tags are self terminating like </tr> and in the actual file are never on a new line, however for the sake of viewing in the sample, I have modified the layout so that it is easier to see the breakpoints and validate parsing a little easier.
5. the <BR> tags are not necessary and can be removed entirely, these values will be going into one cell as a comma separated string
Thanks in advance for any assistance you could provide 
Ideally I would like to have each of the TH rows to be the table header (8 columns) and all of the data within each tr to be listed as a value under each of these columns.
Please let me know if you have any questions