Examples and Resources#
Here you can find additional resources to learn about Modin. To learn more about advanced usage for Modin, please refer to this section.
The following notebooks demonstrate how Modin can be used for scalable data science:
The following tutorials cover the basic usage of Modin. Here is a one hour video tutorial that walks through these basic exercises.
The following tutorials covers more advanced features in Modin:
Exercise 5: Setting up Modin in a Cluster Environment [Source PandasOnRay]
Exercise 6: Running Modin in a Cluster Environment [Source PandasOnRay]
How to get required dependencies for the tutorial notebooks and to run them please refer to the respective README.md file.
Talks & Podcasts#
Scaling Interactive Data Science with Modin and Ray (20 minute, Ray Summit 2021)
Unleash The Power Of Dataframes At Any Scale With Modin (40 minute, Python Podcast 2021)
[Russian] Distributed Data Processing and XGBoost Training and Prediction with Modin (30 minute, PyCon Russia 2021)
[Russian] Efficient Data Science with Modin (30 minute, ISP RAS Open 2021)
Modin: Scaling the Capabilities of the Data Scientist, not the Machine (1 hour, RISE Camp 2020)
Modin: Pandas Scalability with Devin Petersohn (1 hour, Software Engineering Daily Podcast 2020)
Introduction to the DataFrame and Modin (20 minute, RISECamp 2019)
Scaling Interactive Pandas Workflows with Modin (40 minute, PyData NYC 2018)
Here are some blogposts and articles about Modin:
Here are some articles contributed by the international community:
If you would like your articles to be featured here, please submit a pull request to let us know!