Lot's of the data we have reside in unstructured or semi-structured sources...
Really would love to see best practice cases on parsing sources such as;
- web pages with or without tables for competitior price monitoring
- PDF documents like bunch of CV's for HR analytics
- Lists of internal E-mail communications for auditing
- Social media data like Facebook timeline
For the unstructured case;
- Getting a series of photographs for analytics,
- Radar, sonar or similar sound data for preventive maintenance into Alteryx