pyspark.sql.functions.var_samp#

pyspark.sql.functions.var_samp(col)[source]#

Aggregate function: returns the unbiased sample variance of the values in a group.

New in version 1.6.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
colColumn or column name

target column to compute on.

Returns
Column

variance of given column.

See also

pyspark.sql.functions.variance()
pyspark.sql.functions.var_pop()
pyspark.sql.functions.std_samp()

Examples

>>> from pyspark.sql import functions as sf
>>> df = spark.range(6)
>>> df.select(sf.var_samp(df.id)).show()
+------------+
|var_samp(id)|
+------------+
|         3.5|
+------------+