Skip to content

Latest commit

 

History

History
53 lines (35 loc) · 1.7 KB

CONTRIBUTING.md

File metadata and controls

53 lines (35 loc) · 1.7 KB

Contributing Code

Categorical encoding is a python library in the scikit-learn-contrib family for encoding categorical variables as numeric. Our goals are to:

  • Provide a variety of different techniques for encoding
  • Provide a common, familiar API to all of these encoders
  • Support both numpy arrays and pandas dataframes equally
  • Be an all around nice project to use and contribute to

With that in mind, we welcome and in fact would love some help.

How to Contribute

The preferred workflow to contribute to git-pandas is:

  1. Fork this repository into your own github account.

  2. Clone the fork on your account onto your local disk:

    $ git clone git@github.com:YourLogin/category_encoders.git $ cd category_encoders

  3. Create a branch for your new awesome feature, do not work in the master branch:

    $ git checkout -b new-awesome-feature

  4. Write some code, or docs, or tests.

  5. When you are done, submit a pull request.

Guidelines

This is still a very young project, but we do have a few guiding principles:

  1. Maintain semantics of the scikit-learn API
  2. Write detailed docstrings in numpy format
  3. Support pandas dataframes and numpy arrays as inputs
  4. Write tests

Running Tests

To run the tests, use:

$ pytest

Easy Issues / Getting Started

There are usually some issues in the project github page looking for contributors, if not you're welcome to propose some ideas there, or a great first step is often to just use the library, and add to the examples directory. This helps us with documentation, and often helps to find things that would make the library better to use.