Tried to keep it simple whilst also making the final table flexible for analysis. Opted for a wildcard input to simplify the flow. Split tweets with multiple hashtags into separate rows. We will just need to remember to use a count distinct for IDs when conducting particular analysis to avoid duplication error. I also parsed out the tweet content columns and date time to make them work more effectively. I filtered out cases where the tweet content was unrecognized characters (?????? ????).