HdkOnNativeDataframePartition#

Public API#

class modin.experimental.core.execution.native.implementations.hdk_on_native.partitioning.partition.HdkOnNativeDataframePartition(data: Union[DbTable, Table, DataFrame])#

A partition of HdkOnNativeDataframe frame.

Class holds either a DbTable or pandas.DataFrame or pyarrow.Table.

Parameters:

data (DbTable or pandas.DataFrame or pyarrow.Table) – Partition data in either pandas or PyArrow format.

_data#

Partition data in either pandas or PyArrow format.

Type:

DbTable or pandas.DataFrame or pyarrow.Table

_length_cache#

Length of the partition.

Type:

int

_width_cache#

Width of the partition.

Type:

int

get(to_arrow: bool = False) Union[DbTable, DataFrame, Table]#

Get partition data.

Parameters:

to_arrow (bool, default: False) – Convert the data to pyarrow.Table.

Return type:

DbTable or pandas.DataFrame or pyarrow.Table

insert(idx: int, name: str, value: Union[ExtensionArray, ndarray, Index, Series])#

Insert column into this raw partition.

Parameters:
  • idx (int) –

  • name (str) –

  • value (AnyArrayLike) –

Return type:

tuple of HdkOnNativeDataframePartition, dtype

classmethod put(obj)#

Create partition from DbTable or pandas.DataFrame or pyarrow.Table.

Parameters:

obj (DbTable or pandas.DataFrame or pyarrow.Table) – Source frame.

Returns:

The new partition.

Return type:

HdkOnNativeDataframePartition

property raw#

True if the partition contains a raw data.

The raw data is either pandas.DataFrame or pyarrow.Table.

Return type:

bool

to_numpy(**kwargs)#

Transform to NumPy format.

Parameters:

**kwargs (dict) – Additional keyword arguments to be passed in to_numpy.

Return type:

np.ndarray

to_pandas()#

Transform to pandas format.

Return type:

pandas.DataFrame