Be sure to review our Idea Submission Guidelines for more information!
Submission GuidelinesHello,
After used the new "Image Recognition Tool" a few days, I think you could improve it :
> by adding the dimensional constraints in front of each of the pre-trained models,
> by adding a true tool to divide the training data correctly (in order to have an equivalent number of images for each of the labels)
> at least, allow the tool to use black & white images (I wanted to test it on the MNIST, but the tool tells me that it necessarily needs RGB images) ?
Question : do you in the future allow the user to choose between CPU or GPU usage ?
In any case, thank you again for this new tool, it is certainly perfectible, but very simple to use, and I sincerely think that it will allow a greater number of people to understand the many use cases made possible thanks to image recognition.
Thank you again
Kévin VANCAPPEL (France ;-))
Thank you again.
Kévin VANCAPPEL
At the moment, we are not able to use input data field names and its values in Output tool, mainly in the Pre-SQL and Post-SQL statement. I see some discussions on this in the community and in many scenarios we require that. It will be great if we have this option.
Idea:
A funcionality added to the Impute values tool for multiple imputation and maximum likelihood imputation of fields with missing at random will be very useful.
Rationale:
Missing data form a problem and advanced techniques are complicated. One great idea in statistics is multiple imputation,
filling the gaps in the data not with average, median, mode or user defined static values but instead with plausible values considering other fields.
SAS has PROC MI tool, here is a page detailing the usage with examples: http://www.ats.ucla.edu/stat/sas/seminars/missing_data/mi_new_1.htm
Also there is PROC CALIS for maximum likelihood here...
Same useful tool exists in spss as well http://www.appliedmissingdata.com/spss-multiple-imputation.pdf
Best
I've come to realize that the JOIN tool is case-sensitive by design but it would be helpful if you could turn that behavior on/off (via checkbox?) within the JOIN tool. For those of us that work predominantly in database environments that are not case-sensitive, this default behavior has caused me problems many times. Having to force the case to either upper or lower upstream of the JOIN on both flows in order to ensure a successful join is an extra step that would not be necessary if you could disable case-sensitive with a checkbox.
There is a web hosted trial that anyone can have a hands on experiance with alteryx tutorials without even downoading the tool.
That's awesome... http://goo.gl/dpSoe2
It may be a nice idea to;
1) either start seperate "Alteryx-kaggle" instances with data sets specific to each kaggle competition so that anyone want to try out may have a go with those well known examples thru the Alteryx site,
2) Or even better have a partnership with kaggle so that anyone can just have it's own Alteryx trial per specific competition on the kaggle website...
I'm sure this will draw a lot of attention...
Rationale;
You'll immediately have a greater reach in Kaggle community, some data hobbiyists and cs, ie students and acedemics (which will eventually end up doing lot's of data blending when ther are going to be hired by top notch firms...
I find it very difficult to read Warnings in the Messages palette because the text is a light yellow against a white backgroud.
I'd love to be able to change either the text color, the background color of the palette, or both.
I think it would be great to add metadata to a yxdb. For example, I was back tracking and trying to figure out which module/app I used to create an old yxdb. Now I use Notepad++ and do a "Find In Files" Search. Wouldn't it be great it the module path would be available when you look at the properties of a yxdb in Alteryx?
In v10, I am using the summarize tool a lot and getting tired of selecting one or more fields and doing a sum function and having to revisit each summary tool when you add a numeric field upstream... I was hoping there would be a more dynamic method, e.g. select all numeric fields and then doing a SUM on _currentfield_.
Then I remembered the Field Info tool. (on a side note, I'd bet this tool is overlooked a lot). This tool is great because for each numeric field you get Min, Max, Median, Std Dev, Percent Missing, Unique Values, Mean, etc.
The one thing that's missing is SUM. Can you add it?
Also, can you give the user option to turn off layouts and reports so it runs faster? I only care out the data side.
or is there another way to do sum on dynamically selected numeric fields? (include Sum on Unknown field)
It would be great to have a spatial function that could be used to evaluate whether two spatial objects are equal/identical. I see this being available in at least three places:
Would be nice if could use something like $Field rather than repeating the field name in the Condition and Loop expression within the Generate Rows tool
Currently there is no option to edit an existing macro search path from Options-> User Settings -> Macros. Only options are Add / Delete. Ideally we need the Edit option as well.
Existing Category needs to be deleted and created again with the correct path, if search path is changed from one location to another.
Tools within a workflow needs to be able to run in parallel whereever applicable.
For example: Extracting 10 million rows from one source, 12 million rows from a different source to perform blending.
currently the order of execution is the order in which tools are dragged into the canvas. Hence Source1 first, Source2 second and then the JOIN.
Here Source1 & Source2 are completely independent, hence can be run in parallel. Thus saving the workflow execution time.
Execution time is quite crucial when you have tight data loading window.
Hopefully alteryx considers this in the next release!
Idea:
A tool for encryption/decription of a column with multiple encrypiton options is the idea.
Both one way and two way encription should be possible.
Rationale:
Clients are in need of encrypting customers' personal identification data
before sharing it with a third party like consultants and analytics service providers etc.
When insights are provided back the data owner needs to quickly decrypt the ID field and get results or decide actions.
Clients:
This is especially an important case for banks, non bank financial institutions and telecom companies in EU countries and similar (Turkey has similar strict rules)
Best
The sum function is probably the one I use most in the summarize tool. It is a silly thing, but it would be nice for "Sum" to be in the single-click list, rather than in the "Numeric" category...
It would be cool if a connector line would turn red when you select it, making it easier to trace the path (similar to how the lines turn red when you click on a join tool).
This setting is currently in the Options menu under user settings, but I think it would be more intuitive and more consistent with the norm for most software if the check box were directly on the splash screen.
Idea:
As a method of deploying preprocessing and ML models it would be awesome to be able to convert a workflow to java...
Rationale:
models are needed to be deployed into Complex event processes or decision systems. Even for SAS there is a need to implement the datastep algorithms and procs to run in JVM.
Quickwin:
It is possible to convert a workflow into a PMML file and then use JMML package to convert that to Java. Yet the full workflow with all preprocessing alternatives and a series of ML methods may not be captured fully.
Competitor example:
For SAS case here is a similar solution: http://www.dullesresearch.com/carolina-features/
When you do a search for a tool, don't zoom to a blank area of the canvas (because that's where the tool would have been with an enabled container). So when a container is disabled, zoom to the actual container instead containing the tool you're looking for or temporarily enable the container so we can see the tool?
Idea:
In forecasting and in commercial/sme risk scoring there is a need for trying vast number of algebraic equations which is a very cumbersome prosess. Let's add symbolic regression as a new competitive capability.
Rationale:
Summations, ratios, power transforms and all combinations of a like are needed to be tested as new variables for a forecasting or prediction model. Doing this by hand manually is a though and long business... And there is always a possibility for one to skip a valuable combination.
Symbolic regression is a novel techinique for automatically generating algebraic equations with use of genetic programming,
In every evolution a variable is selected checked if the equation is discriminatitive of the target variable at hand. In every next step frequently observed variables will be selected more likely.
Benefit for clients:
This method produces variables mainly with nonlinear relationships. It is a technique that will help in corporate/commercial/sme risk modelling, such that powerful risk models are generated from a hort list of B/S and P/L based algebraic equations.
There is potential use cases in algorithmic trading as well...
There are 3 very interesting world problems solved with symbolic regression here.
A very relevant thesis by sean Wouter is attached as a pdf document for your reading pleasure...
R side of things:
I've found Rgp package for genetic programming, here is a link.
Competition:
I haven't seen something similar in SAS, SPSS but there is this; http://www.nutonian.com/products/eureqa/
Also there is Bruce Ratner's page
Is it possible to add some color coding to the InDB tool. I am building out models InDB and I end up with a sea of navy blue icons. Maybe they could generally correspond to the other tools. For example the summary would be orange. Etc Formula Lime Green.
User | Likes Count |
---|---|
7 | |
4 | |
4 | |
3 | |
3 |