PyarrowQueryCompiler#

PyarrowQueryCompiler is responsible for compiling efficient Dataframe algebra queries for the PyarrowOnRayDataframe, the frames which are backed by pyarrow.Table objects.

Each PyarrowQueryCompiler contains an instance of PyarrowOnRayDataframe which it queries to get the result.

Public API#

PyarrowQueryCompiler implements common query compilers API defined by the BaseQueryCompiler. Most functionalities are inherited from PandasQueryCompiler, in the following section only overridden methods are presented.

class modin.core.storage_formats.pyarrow.query_compiler.PyarrowQueryCompiler(modin_frame)#

Bases: modin.core.storage_formats.pandas.query_compiler.PandasQueryCompiler

Query compiler for the PyArrow storage format.

This class translates common query compiler API into the DataFrame Algebra queries, that is supposed to be executed by PyarrowOnRayDataframe.

Parameters

modin_frame (PyarrowOnRayDataframe) – Modin Frame to query with the compiled queries.

property dtypes#

Get columns dtypes.

Returns

Series with dtypes of each column.

Return type

pandas.Series

query(expr, **kwargs)#

Query columns of the QueryCompiler with a boolean expression.

Parameters
  • expr (str) –

  • **kwargs (dict) –

Returns

New QueryCompiler containing the rows where the boolean expression is satisfied.

Return type

BaseQueryCompiler

Notes

Please refer to modin.pandas.DataFrame.query for more information about parameters and output format.