pd.DataFrame
supported APIs#
The following table lists both implemented and not implemented methods. If you have need of an operation that is listed as not implemented, feel free to open an issue on the GitHub repository, or give a thumbs up to already created issues. Contributions are also welcome!
The following table is structured as follows: The first column contains the method name.
The second column contains link to a description of corresponding pandas method.
The third column is a flag for whether or not there is an implementation in Modin for
the method in the left column. Y
stands for yes, N
stands for no, P
stands
for partial (meaning some parameters may not be supported yet), and D
stands for
default to pandas.
Note
Currently third column reflects implementation status for Ray and Dask engines. By default, support for a method
in the Omnisci engine could be treated as D
unless Notes
column contains additional information. Similarly,
by default Notes
contains information about Ray
and Dask
engines unless Omnisci
is explicitly mentioned.
DataFrame method |
pandas Doc link |
Implemented? (Y/N/P/D) |
Notes for Current implementation |
|
Y |
||
|
Y |
||
|
Y |
Ray and Dask: Shuffles data in operations
between DataFrames.
Omnisci: |
|
|
Y |
||
|
Y |
||
|
P |
|
|
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
Y |
See |
|
|
Y |
||
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
Correlation floating point precision may slightly differ from pandas. For now pearson method is available only. For other methods defaults to pandas |
|
|
D |
||
|
Y |
Omnisci: |
|
|
Y |
Covariance floating point precision may slightly differ from pandas |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
Y |
See |
|
|
Y |
||
|
Y |
Omnisci: |
|
|
Y |
||
|
D |
||
|
Y |
Omnisci: |
|
|
Y |
Omnisci: |
|
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
Y |
Requires shuffle, can be further optimized |
|
|
Y |
||
|
D |
||
|
D |
||
|
Y |
||
|
Y |
||
|
P |
|
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
D |
||
|
D |
||
|
Y |
See |
|
|
Y |
||
|
Y |
Not yet optimized for all operations.
Omnisci: |
|
|
Y |
See |
|
|
Y |
||
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
D |
||
|
Y |
||
|
Y |
||
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
P |
Modin does not parallelize iteration in Python |
|
|
P |
Modin does not parallelize iteration in Python |
|
|
P |
Modin does not parallelize iteration in Python |
|
|
P |
When |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
P |
We do not support: boolean array, callable.
Omnisci: |
|
|
D |
||
|
Y |
See |
|
|
Y |
||
|
D |
||
|
Y |
Omnisci: |
|
|
P |
Modin defaults to pandas if given the |
|
|
P |
Modin defaults to pandas if given the |
|
|
Y |
||
|
Y |
||
|
P |
Implemented the following cases:
|
|
|
Y |
Omnisci: |
|
|
Y |
See |
|
|
Y |
||
|
Y |
See |
|
|
Y |
See |
|
|
Y |
||
|
Y |
See |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
D |
||
|
Y |
||
|
Y |
See |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
P |
Local variables not yet supported |
|
|
Y |
See |
|
|
Y |
||
|
Y |
See |
|
|
Y |
Shuffles data |
|
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
Y |
See |
|
|
Y |
See |
|
|
Y |
See |
|
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
Y |
See |
|
|
Y |
See |
|
|
Y |
||
|
Y |
||
|
P |
Modin defaults to pandas if given the |
|
|
Y |
||
|
Y |
||
|
Y |
Omnisci: |
|
|
Y |
||
|
Y |
||
|
P |
Modin defaults to pandas if given the |
|
|
Y |
||
|
Y |
||
|
Y |
Shuffles data; Omnisci: |
|
|
N |
||
|
Y |
||
|
Y |
||
|
P |
Modin defaults to pandas if given the |
|
|
D |
||
|
Y |
See |
|
|
Y |
See |
|
|
Y |
Omnisci: |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
D |
||
|
Y |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
Experimental implementation: to_pickle_distributed |
|
|
D |
||
|
Y |
||
|
D |
||
|
D |
||
|
D |
||
|
D |
||
|
Y |
||
|
Y |
||
|
Y |
See |
|
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
Y |
||
|
D |
||
|
P |
Modin defaults to pandas if given the |
|
|
Y |