Alteryx Designer Desktop Discussions

johnyd01 · ‎01-04-2017

Good morning,

New to Alteryx (the product, and hence this community). After some beginners assistance please; I can't seem to solve an issue.

Overall objective: I want to take values from a spread sheet, that I can then use to retrieve data from a SQL server, and finally blend the two different sources. The spread sheet has unique identifiers in it, that I want to use to query the SQL server, because I don't want to return all the data.

For example:

Spread sheet has Unique IDs , 112233, 223344, 334455 - there will be thousands and they won't always be the same (i.e. I want to create a repeatable workflow that has different spread sheets as the source).

I then want to retrieve (select *) the 'product type' where the Unique ID is equal to 112233, 223344, 334455, and so on.

What is 'the best' way of taking all the unique IDs from the spread sheet and using those values to construct the select statement?

Appreciate the help and guidance.

Regards,

JD.

JonA · ‎01-04-2017

Hey JD. I'll briefly explain how I'd do it, then I'll point you to some helpful resources that show you other examples.

Start with your spreadsheet as an input data tool.
Next, drag a dynamic input tool downstream and connect your spreadsheet to it.
- Configure the dynamic input tool by selecting 'Edit...' (here you're defining the template so the values from your spreadsheet will update the query, you're not solidifying the end result just yet)

In 'Connect to file or database box, select the connection string to the db you'd like to query from
- Under options, box 3, select the box with 3 dots to edit the query
- Enter in a simple statement that will soon be replaced (I used this one as I have a unique list of Customer IDs: Select * From Demo.dbo.Transactions_Dates Where Customer_ID=3)
- Select the toggle button to 'Modify SQL Query'
- Then click the 'Add (down arrow)' button and choose SQL: Update WHERE clause - this will read your statement and populate most boxes for you
- Verify the claus you want to update, the text (or IDs in your case) from your template query you want to replace, and the field in your spreadsheet that will replace the value in your query

Click OK and run the workflow

Helpful resources:

Check out this virtual training from our product training page: http://www.alteryx.com/virtual-training
- Under 'Watch a Past Session' - find 'You Get What You Need - Dynamic Inputs'
- The link to playback the recording is here
Here's another link from the Community that will help explain the process in more detail: https://community.alteryx.com/t5/Alteryx-Knowledge-Base/Modifying-SQL-Query-using-the-Dynamic-Input-...
Lastly, the Community has an incredible 'Tool Mastery' series that has featured the dynamic input tool in the past.

johnyd01 · ‎01-05-2017

Absolutely spot on, thank you.

Got this working within minutes.

Thank you sir.

nbarai · ‎01-05-2017

Hi Jon

Thanks for the detailed explanation/instructions - very useful.

Just a quick follow up question... I see there may be a limit in the "SQL IN Clause". What would be the recommened solution if we exceed the limit.

King Regards and Thank You.

Naresh

RodL · ‎01-05-2017

@nbarai,

In regards to your question on the limitation for the "SQL IN Clause", this setting when checked will allow for a single query to be run (where your query actually contains an IN clause). It will basically combine a list of values brought into the Dynamic Input tool into a list for the IN clause. If the size limit is exceeded by the number of characters coming through the list, it will split up the query into multiple queries.

Check out Help for the details.

Not certain if this is the reason for this option, but there is a limitation (at least in MS SQL Server) of the characters allowed in a query statement...https://msdn.microsoft.com/en-us/library/ms143432.aspx

Ruchi · ‎08-30-2018

Hi John,

I implemented the same method as you have suggested below, but I can see in the output that the connection is establishing after every select query.

I use Hive connection(Cloud DB) and I see the below statement after every select query:

Dynamic Input (2) ODBC Driver version: 03.80

Dynamic Input (2) ODBC Driver version: 03.80

Is there any way to include loop or something so that we can get rid of establishing DB connection again and again.

nbarai · ‎08-31-2018

Hi Ruchi

I don't think you can just do one connection and subsequently use different queries using the initial connection.

I could be wrong but I believe you try to get as much information as you possibly can in the first connection (this way you reduce the number of odbc connection). I fully acknowledge that the performance may need to be balanced

Also..when you start the workflow, the connections are initialised at the beginning

Alteryx Designer Desktop Discussions

Using values from a spread sheet to query SQL server

Re: Date Time Function - Prioritization Base on Du...

Re: Running multiple alteryx workflows within alte...

Re: Selecting the columns coming after a specific ...

Re: Regex(?) formula to remove values matching the...

Re: Python ECC SAP Extract into Alteryx Workflow