hstate Comhpsu hpsu hhno gg08 gg114
<dbl+lbl> <dbl> <dbl> <dbl> <dbl> <chr>
1 10 [Bihar] 151 1 70 4 "S"
2 10 [Bihar] 151 1 83 9 "S"
3 10 [Bihar] 151 1 221 3 "S"
4 10 [Bihar] 151 1 344 4 "FS"
5 10 [Bihar] 152 2 43 5 "S"
6 10 [Bihar] 152 2 53 3 "C"
7 10 [Bihar] 152 2 55 7 "Y"
8 10 [Bihar] 152 2 136 3 "HN"
9 10 [Bihar] 152 2 386 4 "S"
10 10 [Bihar] 152 2 404 3 "N"
11 10 [Bihar] 152 2 494 4 "N"
12 10 [Bihar] 153 3 8 4 "LS"
13 10 [Bihar] 153 3 9 3 "N"
14 10 [Bihar] 153 3 12 4 "T"
15 10 [Bihar] 153 3 41 3 "S"
16 10 [Bihar] 153 3 95 6 ""
17 10 [Bihar] 153 3 153 3 "S"
18 10 [Bihar] 153 3 202 2 "RS"
19 10 [Bihar] 153 3 219 3 "AB"
20 10 [Bihar] 153 3 402 3 "S"
gg114 contains keys for reasons of dropping out of school. Now I want to make a summary of the reasons of dropping out, specifically I want to show how much each reason contribute in dropping out; but there are cases where there are more than one reason of dropping out, so I also am not sure what would be correct mathematical approach of doing the summary(whether a sort of linear approach where you get a percentage of a specific reason, or any other like alloting weights where there is more than one reason. Note each letter is a distinct reason, a string of letters like 'LS' would mean reason L and reason S. Also if you can suggest ways to show this graphically, it would be much appreciated. Thank you.