PandasOnRayFrameAxisPartition

This class is the specific implementation of PandasFrameAxisPartition, providing the API to perform operations on an axis partition, using Ray as an execution engine. The axis partition is a wrapper over a list of block partitions that are stored in this class.

Public API

class modin.engines.ray.pandas_on_ray.frame.axis_partition.PandasOnRayFrameAxisPartition(list_of_blocks, get_ip=False)

The class implements the interface in PandasFrameAxisPartition.

Parameters
  • list_of_blocks (list) – List of PandasOnRayFramePartition objects.

  • get_ip (bool, default: False) – Whether to get node IP addresses to conforming partitions or not.

classmethod deploy_axis_func(axis, func, num_splits, kwargs, maintain_partitioning, *partitions)

Deploy a function along a full axis.

Parameters
  • axis ({0, 1}) – The axis to perform the function along.

  • func (callable) – The function to perform.

  • num_splits (int) – The number of splits to return (see split_result_of_axis_func_pandas).

  • kwargs (dict) – Additional keywords arguments to be passed in func.

  • maintain_partitioning (bool) – If True, keep the old partitioning if possible. If False, create a new partition layout.

  • *partitions (iterable) – All partitions that make up the full axis (row or column).

Returns

A list of pandas.DataFrame-s.

Return type

list

classmethod deploy_func_between_two_axis_partitions(axis, func, num_splits, len_of_left, other_shape, kwargs, *partitions)

Deploy a function along a full axis between two data sets.

Parameters
  • axis ({0, 1}) – The axis to perform the function along.

  • func (callable) – The function to perform.

  • num_splits (int) – The number of splits to return (see split_result_of_axis_func_pandas).

  • len_of_left (int) – The number of values in partitions that belong to the left data set.

  • other_shape (np.ndarray) – The shape of right frame in terms of partitions, i.e. (other_shape[i-1], other_shape[i]) will indicate slice to restore i-1 axis partition.

  • kwargs (dict) – Additional keywords arguments to be passed in func.

  • *partitions (iterable) – All partitions that make up the full axis (row or column) for both data sets.

Returns

A list of pandas.DataFrame-s.

Return type

list

instance_type

alias of ray._raylet.ObjectRef

partition_type

alias of modin.engines.ray.pandas_on_ray.frame.partition.PandasOnRayFramePartition

PandasOnRayFrameColumnPartition

Public API

class modin.engines.ray.pandas_on_ray.frame.axis_partition.PandasOnRayFrameColumnPartition(list_of_blocks, get_ip=False)

The column partition implementation.

All of the implementation for this class is in the parent class, and this class defines the axis to perform the computation over.

Parameters
  • list_of_blocks (list) – List of PandasOnRayFramePartition objects.

  • get_ip (bool, default: False) – Whether to get node IP addresses to conforming partitions or not.

PandasOnRayFrameRowPartition

Public API

class modin.engines.ray.pandas_on_ray.frame.axis_partition.PandasOnRayFrameRowPartition(list_of_blocks, get_ip=False)

The row partition implementation.

All of the implementation for this class is in the parent class, and this class defines the axis to perform the computation over.

Parameters
  • list_of_blocks (list) – List of PandasOnRayFramePartition objects.

  • get_ip (bool, default: False) – Whether to get node IP addresses to conforming partitions or not.