Inspire 2017 | Tech Track Ideas

Shape the agenda. Submit your ideas for the Tech Track.

Best practices on parsing semi or unstructured data

Lot's of the data we have reside in unstructured or semi-structured sources...

 

Really would love to see best practice cases on parsing sources such as;

  • web pages with or without tables for competitior price monitoring
  • PDF documents like bunch of CV's for HR analytics
  • Lists of internal E-mail communications for auditing
  • Social media data like Facebook timeline

 

For the unstructured case;

  • Getting a series of photographs for analytics,
  • Radar, sonar or similar sound data for preventive maintenance into Alteryx
2 Comments
Atabarezz
13 - Pulsar

I'd like to provide some info from a TDWI research, a little bit old but provides an example list of sources;

Which Types of Data and Source Systems Feed Your Data Warehouse? (Numbers are based on 370 respondents.)

 

research-fig2.gif

 

 

 

 

LeahK
Alteryx Alumni (Retired)
Status changed to: Maybe Next Year

Thank you for the idea @Atabarezz! Unfortunately, we won't be offering this as part of the tech track this year.  The good news is that I confirmed that we will be offering beginner and intermediate/advanced parsing courses as part of our training offerings at Inspire. I'll make sure to provide an update with the details, as soon as the official training schedule is released.