I am trying to do some modeling of crop yield data using weather data. I have created a list of data models in a tibble, similar to how Hadley does his in his "managing many models" tutorial. However, I am running into a severe Unknown or Uninitialized Column error, which now occurs with half or more of the lines of code I run.
Here is the beginning of my code, and then I show 8 lines that create the nested tibbles in each row/column of the giant tibble that contains the models.
corn <- read.csv('I:\\HATCH_Project\\Corn.csv')
corn <- corn[corn$Single == 'Yes',]
corn$County[which(corn$County == 'Mcpherson')] <- 'McPherson'
corn$County[which(corn$County == 'Mccook')] <- 'McCook'
corn_df_values <- data.frame(corn$Year,corn$County,corn$Value); colnames(corn_df_values) <- c('Year','County','Yield')
corn_final <- corn_df_values %>% pivot_wider(names_from = County, values_from = Yield)
corn_final <- corn_final[order(corn_final$Year),]
corn_final_wide <- corn_final %>% tibble %>% select("Year", sort(colnames(.)))
corn_final_long <- corn_final_wide %>% pivot_longer(!Year, names_to = 'County', values_to = 'Yield')
corn_final_long <- corn_final_long[-which(is.na(corn_final_long$Yield)),]
corn_by_county <- corn_final_long %>% group_by(County) %>% mutate(n = n())
corn_by_county <- corn_by_county %>% group_by(County,n) %>% nest()
gdd_runsum_df <- tibble(Date = seq(as.Date('2020-04-01'),as.Date('2020-11-01'),by='day'))
gdd_runsum_df$month <- month(gdd_runsum_df$Date); gdd_runsum_df$day <- day(gdd_runsum_df$Date)
gdd_runsum_df$runsum1 <- NA; gdd_runsum_df$runsum5 <- NA; gdd_runsum_df$runsum10 <- NA; gdd_runsum_df$runsum15 <- NA; gdd_runsum_df$runsum20 <- NA;
gdd_runsum_df$runsum25 <- NA; gdd_runsum_df$runsum30 <- NA; gdd_runsum_df$runsum35 <- NA; gdd_runsum_df$runsum40 <- NA; gdd_runsum_df$runsum45 <- NA;
gdd_runsum_df$runsum50 <- NA; gdd_runsum_df$runsum60 <- NA; gdd_runsum_df$runsum70 <- NA; gdd_runsum_df$runsum75 <- NA; gdd_runsum_df$runsum80 <- NA; gdd_runsum_df$runsum90 <- NA;
gdd_runmean_df <- tibble(Date = seq(as.Date('2020-04-01'),as.Date('2020-11-01'),by='day'))
gdd_runmean_df$month <- month(gdd_runmean_df$Date); gdd_runmean_df$day <- day(gdd_runmean_df$Date)
gdd_runmean_df$runmean1 <- NA; gdd_runmean_df$runmean5 <- NA; gdd_runmean_df$runmean10 <- NA; gdd_runmean_df$runmean15 <- NA; gdd_runmean_df$runmean20 <- NA;
gdd_runmean_df$runmean25 <- NA; gdd_runmean_df$runmean30 <- NA; gdd_runmean_df$runmean35 <- NA; gdd_runmean_df$runmean40 <- NA; gdd_runmean_df$runmean45 <- NA;
gdd_runmean_df$runmean50 <- NA; gdd_runmean_df$runmean60 <- NA; gdd_runmean_df$runmean70 <- NA; gdd_runmean_df$runmean75 <- NA; gdd_runmean_df$runmean80 <- NA; gdd_runmean_df$runmean90 <- NA;
Please let me know if I am doing something wrong here. I asked this on Stack Overflow (https://stackoverflow.com/questions/64902925/how-to-fix-unknown-or-uninitialised-column-error-in-r-tidyverse?noredirect=1#comment114764828_64902925), but no answers came. My nested tibble is 271 columns wide, and 66 rows long. Maybe it's just too big. I don't know. But I know this error is really starting to wear on me. Please help:) Thank you.