pyspark.pandas.Series.clip#

Series.clip(lower=None, upper=None, inplace=False)[source]#

Trim values at input threshold(s).

Assigns values outside boundary-to-boundary values.

Parameters
lowerfloat or int, default None

Minimum threshold value. All values below this threshold will be set to it.

upperfloat or int, default None

Maximum threshold value. All values above this threshold will be set to it.

inplacebool, default False

if True, perform operation in-place

Returns
Series

Series with the values outside the clip boundaries replaced

Notes

One difference between this implementation and pandas is that running pd.Series([‘a’, ‘b’]).clip(0, 1) will crash with “TypeError: ‘<=’ not supported between instances of ‘str’ and ‘int’” while ps.Series([‘a’, ‘b’]).clip(0, 1) will output the original Series, simply ignoring the incompatible types.

Examples

>>> psser = ps.Series([0, 2, 4])
>>> psser
0    0
1    2
2    4
dtype: int64
>>> psser.clip(1, 3)
0    1
1    2
2    3
dtype: int64

Clip can be performed in-place.

>>> psser.clip(2, 3, inplace=True)
>>> psser
0    2
1    2
2    3
dtype: int64