Alteryx Designer Desktop Discussions

Find answers, ask questions, and share expertise about Alteryx Designer Desktop and Intelligence Suite.

Differentiate Null and Blank cell in an IN-DB dataset

vaibhavmarathey
5 - Atom

Hello

 

I have an IN-DB dataset on HIVE (hdfs), in this certain cell are Blank as well Null. In an IN-DB workflow I am able to idenntify NULL cells but having trouble filtering out blank cell. 

 

Any pointers on solving this issue shall be great help. 

 

Thanks

Vaibhav

7 REPLIES 7
apathetichell
18 - Pollux

i believe '' (or empty value) is empty in Hive syntax so: where "column" =''  should work.

 

note -I had a recent situation in a Databricks query where I needed to drop the column name quotes to get it to run. And yes I do have field names quoted checked.

PanPP
Alteryx Alumni (Retired)

Hi @vaibhavmarathey,

 

In Designer, you can also utilize the formulas below to filter these out. 

 

IsEmpty([FieldName]) would identify the empty/blank values columns.

IsNull([FieldName]) would identify the null values in a column.

 

Hope this helps, if it does please mark it as a solution and like the post.

apathetichell
18 - Pollux

@PanPP  those don't work In-DB. Formula In-DB uses the native SQL format of the back end DB. Fields are (usually) denoted in quotes.

PanPP
Alteryx Alumni (Retired)

You can utilize the data stream out tool to utilize a normal formula tool with the expressions that were mentioned.

apathetichell
18 - Pollux

@PanPP - that's fine on 1000 records. or 10000 records. Have you tried to datastream out 10,000,000 or 100,000,000 or 1,000,000,000 records?

maygross
8 - Asteroid

Have you ever figured this out for the in DB filter tool? im currently having the same issue

apathetichell
18 - Pollux

are you using filter-in-db? What kind of SQL are you using? That would dicate your solution.

Labels