Hi All,
My team is currently in the process of implementing Collibra. Part of the initial upload, we are using the information schema of our database to upload and define the list of columns, tables and databases we have. Additionally, we are looking to get example values for each column (Ideally the top 10 most frequent occurring values for each field - so that we dont blow up the database when trying to query, like instances where we have a unique identifier).
So - The gist of the issue here is that I cant figure out how to iteratively change the field name, and table name of the query and then spit out a list of values for each column across the database. i.e. Grab all distinct values for column in table, move to next column, after all column values in that table are listed out, move to next table.
I've attempted to use a batch macro with update formula, but I keep encountering a schema related error as it runs. Has anyone attempted at fulfilling a similar kind of request? What would be the best way to generate the list iteratively.
Solved! Go to Solution.
First, can you confirm that the process works if you only feed in one table/column?
If it does, the below might be helpful:
Inside the batch macro, try going into the View->Interface Designer and selecting "Auto Configure by Name" or "Auto Configure by Position" in the properties tab of that screen (under the wrench icon).
You may also want to check off "output fields change based on macro's configuration or data input", but it is not required.
That should fix the schema error you are encountering.
I can confirm that the process works when i feed multiple columns and a singular table name, but cant seem how to incorporate getting it to switch tables. I've attached a screenshot for reference - Although I am no longer getting errors thrown, i just dont see anything in the output.
Heres a quick rundown of the below screen shot:
So I was able to figure it out.
Using this forum question/answer as inspiration:
Breakdown of how i did this: