insure=read.csv(choose.files(),header = TRUE)
insure
View(insure)
dg=ggplot(data=insure,aes(x=age,y=charges,color=region))+geom_point()+geom_smooth()
dg
set.seed(12345)
insure2=subset(insure,select = -c(region))
View(insure2)
partss=createDataPartition(insure2$smoker,p=0.75,list=F)
partss
train7=insure2[partss,]
train7
test7=insure2[-partss,]
test7
na.omit(insure2)
is.na(insure2)
dim(train7)
table(insure2)
dim(test7)
modelfit7=train(as.factor(smoker)~.,data=train7,method="glm")
modelfit7
prediction7=predict(modelfit7,data=test7)
prediction7
prediction7b=confusionMatrix(prediction7,as.factor(test7$smoker))
prediction7b
Hi, and welcome!
I'm sure that you have heard of lazy evaluation
in R
. The same principle applies in the community here, which is why a reproducible example, called a reprex is important. The question will attract more and better answers.
Hi, Thanks for the reply
I am a beginner and dont know much about R. I am getting the mentioned error. Any visible mistake that i did in the coding??
There may be, but it's too difficult to peer at what we're working with, which is why a reprex
is so helpful.
The error message comes from this line of code, apparently prediction7
and test7
don't have the same number of rows, possibly because there are NA
s in test 7
but we can't be sure with out a proper REPRoducible EXample (reprex).
This topic was automatically closed 21 days after the last reply. New replies are no longer allowed.