Hi all,
I need to classify categorical variables to groups and then calculate the sizes of the groups (frequency or percentage of the variables in the groups).
I need help for the classification especially. I have a column of about 100 variables consisting of letters and numbers. The goal is to classify them by the first two letters. For example, if the cell value begins with ”AA” then it belongs to Group 1 and so on. I would like to create a column where each group is once and the amount of variables in that group is next to it in another column.
Is this even possible? Any help for classifying categorical variables is much appreciated.
Bookmarks