In today's video, I'll show how to find and remove (potential) outliers by means of boxplots.
Some main points are
-creating decent boxplots in R Studio
-looking up the exact values for outliers
-basic strategies for when & how to exclude outliers
-removing outliers with a basic ifelse function and checking the results.
Hope it helps!
TIMESTAMPS:
00:54 brief description data file
01:28 minimal boxplot in R Studio
02:47 prettier boxplot in R Studio
05:08 when should you exclude (potential) outliers?
05:56 how should you exclude (potential) outliers?
07:00 count NA values per variable
08:32 find exact values for outliers
09:39 exclude outliers with elseif function
14:09 check if results are correct
RESOURCES:
Everything about boxplots: https://www.spss-tutorials.com/boxplot-what-is-it/
Download example data file from: https://www.statistics-made-simple.com/downloads/speedtask.csv
Download finished R code from: https://www.statistics-made-simple.com/downloads/find-outliers-from-boxplots-01.R
FOLLOW US:
Website: https://www.statistics-made-simple.com
Youtube: https://www.youtube.com/@statistics-made-simple/
LinkedIn: https://www.linkedin.com/company/statistics-made-simple/
Facebook: https://www.facebook.com/stats.made.simple
#r #statistics