Hi, I have some basic question regarding seeds when using parallelization. Suppose I would like to create a new column named SUM based on the numeric columns from the iris dataset:
I think you'll need to set the seed within your sum function.
Edit:
Actually, @DBScan, I apologize that is kind of hacky advice for making parallel processing behave like purrr's sequential processing. I think furrr uses a different random number generation algorithm for working in parallel and you won't ever observe purrr and furrr behaving in the same way there. In your case, passing the furrr_options(seed = 42) ensures your generation of random numbers within your furrr processing returns equivalent results whether you use plan(sequential) or plan(multisession).