Hello everyone,
I would like to find a way to eliminate outliers that are beyond the 25th and 75th percentile of my dataset. The issue is that each row of my dataset represents a trajectory, and I would like to remove not only singular values, but a whole trajectory that, at least at one point along its duration (colummns F11 to F110), is considered outlier.
Here is a sample of my data:
dput(donneesCVaf)
structure(list(C = c("w", "w", "w", "w", "w", "l", "l", "l",
"w", "w"), F10 = c(858L, 831L, 614L, 802L, 782L, 472L, 449L,
629L, 560L, 565L), F11 = c(864L, 825L, 615L, 750L, 738L, 446L,
454L, 510L, 565L, 567L), F12 = c(872L, 812L, 618L, 654L, 680L,
430L, 453L, 474L, 556L, 558L), F13 = c(898L, 772L, 621L, 563L,
642L, 428L, 457L, 472L, 561L, 544L), F14 = c(853L, 718L, 621L,
529L, 625L, 438L, 452L, 481L, 558L, 531L), F15 = c(691L, 677L,
617L, 515L, 626L, 482L, 465L, 491L, 543L, 519L), F16 = c(642L,
642L, 615L, 533L, 576L, 506L, 494L, 503L, 569L, 512L), F17 = c(639L,
619L, 615L, 566L, 611L, 511L, 515L, 512L, 549L, 512L), F18 = c(630L,
603L, 614L, 605L, 627L, 507L, 562L, 576L, 582L, 517L), F19 = c(630L,
590L, 617L, 640L, 630L, 514L, 622L, 610L, 580L, 527L), F110 = c(645L,
579L, 624L, 630L, 606L, 562L, 648L, 673L, 597L, 540L)), row.names = c(NA,
10L), class = "data.frame")
I would appreciate any help! If you have any questions or if I did not express myself clearly, please do not hesitate to ask.