Hi, and welcome!
Please see the FAQ: What's a reproducible example (`reprex`) and how do I create one? Using a reprex, complete with representative data will attract quicker and more answers. We're not quite there yet, however. But keep in mind for coding related questions. Also, please review the homework policy for expectations on academic assignment related questions.
The first thing to do is to take a really deep diaphragm stretching breath. There's enough background stress in everyone's life that it's easy to get knocked off one's emotional balance with what seems like an impossible task. But you can do it.
The first thing is to describe the goal:
Using the XYZ dataset of [some set of variables], the goal is to select the [best][reasonable] variables and features to complete a [type of analysis] of the XYZ data set.
For example, strategies for variable selection are similar but quite distinct between ordinary least squares regression using lm
and logistic regression, using glm
.
So, the two questions are what data? and what model? And you can go from data to model or from model to data. Select the data and then the model that can take that type of data or select the model and find data that it will be able to handle.
I'm not entirely sure what filter
methods that you have in mind. Does it involving selecting observations based on some set of criteria?