Hello! Thank you for your response and assistance.
So - in my original dataset which is a .csv file - I import it into R using 'read_csv'. It becomes a tibble.
My dataset is indexed by year and an id. Now - each of these observations have more columns(variables) associated with them (in ML-speak - more "features"?)
So, my data (tibble) is the following (example): (df)
Year ID ColA ColB ColC ................Col Z
2014 Adam 10 154 123
2015 Bob 20 30 51
.
.
.
I think I fixed this, and was able to create a tsibble that work. I used the following code:
dff <- df |>
as_tsibble (key = c(ID),
index = Year)
dfff <- dff |>
filter(ID = "Adam")
dfff |>
select(ColA)
With this code, only the Year, ID, and ColA remained.
However, what is the point of the 'key' function then? In my "solution", I just "keyed" in on the "id" variable that designates each observation over time. In this regard, I get why the index variable is what it is for a 'time series' object, i.e tsibble. On the other hand, what about all the other columns that I think I would include in the "key" designation? Shouldnt they be "keyed' in too? In the above example - I am referrring to 'Col B', 'Col C', etc. In your explanation - it would seem that if I had included 'Col B', 'Col C'., etc, then they cannot be "deselected". Is that correct?
I hope this makes sense and I apologize for my lack of the appropriate vernacular to describe my situation.
Thank you for your assistance. I appreciate it!