This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
My team is currently in the process of implementing Collibra. Part of the initial upload, we are using the information schema of our database to upload and define the list of columns, tables and databases we have. Additionally, we are looking to get example values for each column (Ideally the top 10 most frequent occurring values for each field - so that we dont blow up the database when trying to query, like instances where we have a unique identifier).
So - The gist of the issue here is that I cant figure out how to iteratively change the field name, and table name of the query and then spit out a list of values for each column across the database. i.e. Grab all distinct values for column in table, move to next column, after all column values in that table are listed out, move to next table.
I've attempted to use a batch macro with update formula, but I keep encountering a schema related error as it runs. Has anyone attempted at fulfilling a similar kind of request? What would be the best way to generate the list iteratively.
I can confirm that the process works when i feed multiple columns and a singular table name, but cant seem how to incorporate getting it to switch tables. I've attached a screenshot for reference - Although I am no longer getting errors thrown, i just dont see anything in the output.
Heres a quick rundown of the below screen shot:
control parameter is set to the Column name
Using "replace a specific string" i update the input node with the control parameter that is being passed through
use the field info to get source info
Parse and filter source info to append the table name as a field
Filter out nulls
Add ranking to popularity of field value since it came out sorted via the input node