Modin Usage Examples#
This section shows Modin usage examples in different scenarios like Modin on a local/remote cluster, Modin in the cloud, the use of Modin spreadsheet.
Tutorials#
The following tutorials cover the basic usage of Modin. Here is a one hour video tutorial that walks through these basic exercises.
Exercise 1: Introduction to Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
Exercise 2: Speed Improvements with Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
Exercise 3: Defaulting to pandas with Modin [Source PandasOnRay, Source PandasOnDask, Source HdkOnNative]
The following tutorials covers more advanced features in Modin:
Exercise 4: Experimental Features in Modin (Spreadsheet, Progress Bar) [Source PandasOnRay, Source PandasOnDask]
Exercise 5: Setting up Modin in a Cluster Environment [Source PandasOnRay]
Exercise 6: Running Modin in a Cluster Environment [Source PandasOnRay]
How to get required dependencies for the tutorial notebooks and to run them please refer to the respective README.md file.
Data Science Benchmarks#
Using Modin with the NYC Taxi Dataset [Source]
Using Modin with the Census Dataset (coming soon…)
Using Modin with the Plasticc Dataset (coming soon…)
Modin in the Cloud#
Using Experimental Modin in the cloud with the NYC Taxi Dataset on an AWS cluster [Source]
Modin Spreadsheets#
Using Modin along with the Spreadsheets API [Source]
Modin with scikit-learn#
Modin for Machine Learning with scikit-learn [Source]