You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The purpose of this issue is to have a single place for all issues good for new users/projects we are working on/etc. It is also intended to give prospective contributors an idea of what would be highest impact to work on.
Overview
If you are interested in contributing to Modin, welcome! If you are interested in contributing something that isn't here, let us know and we'll add it! A good place to start looking for issues to contribute to is in the Good First Issue 🔰 or Help Wanted 🌐 tagged issues.
Table of Contents
1.) APIs 2.) Experimental Features 3.) Query Processing/Planning 4.) System Level Enhancements
1.) APIs
New APIs
The following APIs are highest priority in Modin (in no particular order). This is based on the number of people requesting it and the number of people using it from our study of usage. If you would like to contribute a particular API, please comment on the related issue:
Modin Potential Projects
The purpose of this issue is to have a single place for all issues good for new users/projects we are working on/etc. It is also intended to give prospective contributors an idea of what would be highest impact to work on.
Overview
If you are interested in contributing to Modin, welcome! If you are interested in contributing something that isn't here, let us know and we'll add it! A good place to start looking for issues to contribute to is in the Good First Issue 🔰 or Help Wanted 🌐 tagged issues.
Table of Contents
1.) APIs
2.) Experimental Features
3.) Query Processing/Planning
4.) System Level Enhancements
1.) APIs
New APIs
The following APIs are highest priority in Modin (in no particular order). This is based on the number of people requesting it and the number of people using it from our study of usage. If you would like to contribute a particular API, please comment on the related issue:
pivot_table
: pivot_table function implementation #1200Added in Add support for grouping by multiple columns when doing a reduction #987groupby
with list of columns: NotImplementedError: Groupby with lists of columns not yet supported. #75parallelAdded in Add parallel read_excel feature #1648read_excel
: Implementation for parallelread_excel
#467read_pickle
: Request to implement read_pickle function #529parallelAdded in Preliminary read_json implementation #715read_json
: Request to implement read_json function #554CompressedAdded in reading gzipped csv files #682read_csv
: Feature Request: read_csv on compressed files #630Added in support for duplicated() and drop_duplicates() #892drop_duplicates
: New Feature Request: Add support for drop_duplicates() #634Added in Addpd.value_counts
/Series.value_counts
: New method request:Series.value_counts
andpd.value_counts
#653value_counts
implementation forSeries
and as free function #1535Added in Add support for text manipulation operations #713Series.str
: New method Request:Series.str
#654Added in Add implementation of read_fwf #1448read_fwf
: Request to implement read_fwf #752Improve performance of existing methods
The following APIs need some improvement. These are typically issues requiring deeper knowledge of Modin internals.
Added in Improve performance of Groupby #659groupby
performance: Enhancement:groupby
performance #655sort_values
performance: Improve sort_values #158merge
/join
performance: FEAT: Support right and outer types of join formerge
/join
#656Added in Enhancement:insert
performance: Enhancement: Rewriting insert to use apply_func_to_select_indices instead of apply_func_to_select_indices_along_full_axis #670insert
and__setitem__
performance #6962.) Experimental Features
The following features will either augment the pandas API by providing hints to the system or add some new feature that doesn't exist in pandas.
read_sql
parallelism parameter: read_sql() parallelism parameter #4553.) Query Processing/Planning
More here later...
4.) System Level Enhancements
Add Windows support: Windows support #657Added in Add Dask futures implementation #732Ray autoscaler support: Support for Ray autoscaler #660Added in Initial ray autoscaler support #661The text was updated successfully, but these errors were encountered: