Hi I am doing a kaggle competition on the SF crime classification. We need to break down our address so it would provide some useful variables into our decison tree. I was thinking of using a text to columns to separate it into mutiple parts but I do not know where to go from there.
|
|
|
OAK ST / LAGUNA ST |
|
OAK ST / LAGUNA ST |
|
VANNESS AV / GREENWICH ST |
|
1500 Block of LOMBARD ST |
|
100 Block of BRODERICK ST |
|
0 Block of TEDDY AV |
|
AVALON AV / PERU AV |
|
|
|
OAK ST / LAGUNA ST |
|
OAK ST / LAGUNA ST |
|
VANNESS AV / GREENWICH ST |
|
1501 Block of LOMBARD ST |
|
101 Block of BRODERICK ST |
|
1 Block of TEDDY AV |
|
AVALON AV / PERU AV |
|
|
|
OAK ST / LAGUNA ST |
|
OAK ST / LAGUNA ST |
|
VANNESS AV / GREENWICH ST |
|
1502 Block of LOMBARD ST |
|
102 Block of BRODERICK ST |
|
2 Block of TEDDY AV |
|
AVALON AV / PERU AV |
|
|
Solved! Go to Solution.
Intersection
IIF(CONTAINS(Address,"/"),1,0)
block
IIF(CONTAINS(Address,"block",1),1,0)
just a a suggestion for these two variables
Another alternative would be to use the Regex tool an provide a match on the "/" or word "block" or some other flexible regex statement
