PandasOnDaskDataframePartitionManager¶
This class is the specific implementation of PandasDataframePartitionManager
using Dask as the execution engine. This class is responsible for partition manipulation and applying a funcion to
block/row/column partitions.
Public API¶
- class modin.core.execution.dask.implementations.pandas_on_dask.partitioning.partition_manager.PandasOnDaskDataframePartitionManager¶
The class implements the interface in PandasDataframePartitionManager.
- classmethod broadcast_apply(axis, apply_func, left, right, other_name='r')¶
Broadcast the right partitions to left and apply apply_func function.
- Parameters
axis ({0, 1}) – Axis to apply and broadcast over.
apply_func (callable) – Function to apply.
left (np.ndarray) – NumPy array of left partitions.
right (np.ndarray) – NumPy array of right partitions.
other_name (str, default: "r") – Name of key-value argument for apply_func that is used to pass right to apply_func.
- Returns
NumPy array of result partition objects.
- Return type
np.ndarray
- classmethod get_indices(axis, partitions, index_func)¶
Get the internal indices stored in the partitions.
- Parameters
axis ({0, 1}) – Axis to extract the labels over.
partitions (np.ndarray) – The array of partitions from which need to extract the labels.
index_func (callable) – The function to be used to extract the indices.
- Returns
A pandas Index object.
- Return type
pandas.Index
Notes
These are the global indices of the object. This is mostly useful when you have deleted rows/columns internally, but do not know which ones were deleted.