I am currently working Logistic regression where the variable whether_worked (binary variable of 1/0) is explained by the categorical variable experience during the study period.
The explanatory variable is the categorical variable(work_experience) taking values:
mandatory internship
optional internship
volunteering
-work in accordance with the field of study
work incompatible with the field of study
lack of experience
The question was in multiple choice form and written in 3 columns q5_1 q5_2 q5_3 so you could write out up to 3 options. How should I include this in the regression? When the dependent variable is explained by gender or education you can't be female and male at the same time you can't have college and high school education at the same time. In contrast, here someone has both mandatory internship and volounteering .
Currently it looks like this
glm(does_work~ work_experience, family ='binomial)
Unfortunately yes I do not include the answers from q5_2 and q5_3 . I'll add that q5_2 and q5_3 have 70% empty columns rather each gave one main, but still have additionally interesting information
Main question is about type of work expierence during education where mainly anwsers are in p5_1 but some observation use also column p5_2 and p5_3 casue you could choose up to 3 how to use it ;D
If you want to include work experience include the first four possible work answers and drop lack of experience. Or maybe copy and paste here a short piece of your real data.
This is the 3 columns that are anwsers to question: What was your last work expierence during education period? You could just write an anwser from 6 posibilities in 5_1 and then 5_2 and 5_3 you could add also from the same posibilites more choices from the same 6 category. Most of people just anwser with 1 type of work but some of them add also info in 5_2 and 5_3 And how can I use it in regression casue normally you someone is women or men somene has 1 level of education. But here you have multiple choices