Hi I am doing a kaggle competition on the SF crime classification. We need to break down our address so it would provide some useful variables into our decison tree. I was thinking of using a text to columns to separate it into mutiple parts but I do not know where to go from there.
|
OAK ST / LAGUNA ST |
OAK ST / LAGUNA ST |
VANNESS AV / GREENWICH ST |
1500 Block of LOMBARD ST |
100 Block of BRODERICK ST |
0 Block of TEDDY AV |
AVALON AV / PERU AV |
|
OAK ST / LAGUNA ST |
OAK ST / LAGUNA ST |
VANNESS AV / GREENWICH ST |
1501 Block of LOMBARD ST |
101 Block of BRODERICK ST |
1 Block of TEDDY AV |
AVALON AV / PERU AV |
|
OAK ST / LAGUNA ST |
OAK ST / LAGUNA ST |
VANNESS AV / GREENWICH ST |
1502 Block of LOMBARD ST |
102 Block of BRODERICK ST |
2 Block of TEDDY AV |
AVALON AV / PERU AV |
|
Solved! Go to Solution.
Intersection
IIF(CONTAINS(Address,"/"),1,0)
block
IIF(CONTAINS(Address,"block",1),1,0)
just a a suggestion for these two variables
Another alternative would be to use the Regex tool an provide a match on the "/" or word "block" or some other flexible regex statement