Z-score
Possible outliers:|z|>2
Highly suspicious outliers:|z|>3
Detect outliers for the "Fare" by using Z-score
A |
|
1 |
=file("D://titanic.csv").import@qtc() |
2 |
=A1.avg(Fare) |
3 |
=sqrt(var@s(A1.(Fare))) |
4 |
=A1.derive((Fare-A2)/A3:Fare_z) |
5 |
=A4.select(Fare_z>3 || Fare_z<-3) |
A2 Calculate the mean of Fare
A3 Calculate the standard deviation of Fare
A4 Calculate the Z-score of Fare
A5 Select the samples which |z|>3
SPL Official Website 👉 https://www.scudata.com
SPL Feedback and Help 👉 https://www.reddit.com/r/esProc_SPL
SPL Learning Material 👉 https://c.scudata.com
SPL Source Code and Package 👉 https://github.com/SPLWare/esProc
Discord 👉 https://discord.gg/cFTcUNs7
Youtube 👉 https://www.youtube.com/@esProc_SPL