embed package target encoding issue

The question can't be fairly evaluated without a reprex. See the FAQ. High cardinality data such as yours is an active area of research. This recent paper provides guidance for lmer (regression) and glmer (classification) functions from the lme4 package in R as an efficient way to fit glmms. (With R code linked.)