This site uses different types of cookies, including analytics and functional cookies (its own and from other sites). To change your cookie settings or find out more, click here. If you continue browsing our website, you accept these cookies.
You could use the multirow formula with an if statement similar to if keyword then row-1 value +1 but i don't think thats pretty efficient since you would need to create one for each keyword. In the end it would result in a lot of coding for a little effort. If you don't want it to show splitted into rows you could always in the end transpose (or summarize) your answer to get the result showing everything on the same row.
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
When performing lexical analysis, the accepted first step is to tokenize your input. This usually performed by some some method that is analogous to splitting the sentences in the input to words and then transposing the words into rows.
You can then analyze each word, extract summary statistics, etc.
Maybe there is a way to perform this without splitting to words, but it would probably be enormously complex.
Why does the client have an aversion to splitting?
Agreeing with @danilang, tokenizing would be your standard first step when performing text analysis. But if your client has an aversion to the splitting approach, would they be okay using a python script? Attached is a sample solution to help you get a word frequency count.
It is somewhat of a cheat approach, only from the standpoint that I am not using standard Alteryx tools, but it gets the work done.
from ayx import Alteryx import pandas as pd import nltk import numpy as np
#Alteryx data read data = Alteryx.read("#1")
word_dist = nltk.FreqDist(data['Concat_Field1']) data_series = pd.Series(np.concatenate([x.split() for x in word_dist])).value_counts()
#conversion of dataSeries to dataFrame df = data_series.to_frame().reset_index() Alteryx.write(df,1)
Please let us know if this solution is acceptable (if yes, please mark it as Accept).
Thank you all for your time and hard work in helping me with this question. I really appreciate it very much. I have a solution but the solution is not quite dynamic as the workflow needs to be altered when there is a new keyword. However, no split method was used in this workflow.
Here I attach the workflow that I did for reference.