I've been reading some posts on XML parsing, but just not finding what I'm looking for.
Basic use case: I receive XML files from several sources, they all contain the following pieces of data at a minimum:
Standard GUID
Grade Level
Subject
DOK
However, each file uses a different tag for the above data and each file has the tag in a different place, with different numbers of child elements involved. And from one vendor, this can change from package to package.
My immediate main concern was the Standad GUID, and just tonight it dawned on me to use RegEx Parse tool to extract all GUIDs. While this is definitely useful and can get me to end-of-job, the problem is here that the label isn't included. And there are some cases where they use GUIDs to identify things other than a Standard, or they have multiple standards and the label would be of help for me to determine the source that GUID came from.
But - the real question here is - given all the other data points in the XML file listed above, and the fact that the files will contain many additional data points that I'm not even aware of, but would like to consume. Is there a method in Alteryx to first supply a list of ALL tags used, no matter how deeply embedded, that I could then pass through some other tool like XML Pars to use that list of selected tags to pull out the actual data?
@andrewkim80916 - this was part of what we were discussing tonight, thought I'd throw it out there in writing in case you had any additional thoughts.