adebayoj · Aylr · Jun 28, 2017 · Jun 28, 2017 · Jun 28, 2017 · Jun 28, 2017
diff --git a/.gitignore b/.gitignore
@@ -61,3 +61,6 @@ target/
 
 # vim
 *.sw*
+
+# jetbrains
+.idea
diff --git a/MANIFEST.in b/MANIFEST.in
@@ -0,0 +1 @@
+include README.md
diff --git a/README.md b/README.md
@@ -1,16 +1,13 @@
-========
+# FairML: Auditing Black-Box Predictive Models
+
+FairML is a python toolbox auditing the machine learning models for bias.
 
 [![Build Status](https://travis-ci.org/adebayoj/fairml.svg?branch=master)](https://travis-ci.org/adebayoj/fairml/)
 [![Coverage Status](https://coveralls.io/repos/github/adebayoj/fairml/badge.svg?branch=master)](https://coveralls.io/github/adebayoj/fairml?branch=master)
 [![GitHub license](https://img.shields.io/badge/license-MIT-blue.svg)](https://github.com/raw/adebayoj/fairml/master/LICENSE)
 [![GitHub issues](https://img.shields.io/github/issues/adebayoj/fairml.svg)](https://github.com/adebayoj/fairml/issues)
 
-FairML: Auditing Black-Box Predictive Models
-=====================
-FairML is a python toolbox auditing the machine learning models for bias. 
-
-<img src="https://github.com/raw/adebayoj/fairml/master/doc/images/logo2.png" width="700">
-
+![Logo](fairml/master/doc/images/logo2-small.png)
 
 ### Description
 
@@ -20,10 +17,11 @@ employment. Despite societal gains in efficiency and
 productivity through deployment of these models, potential 
 systemic flaws have not been fully addressed, particularly the
 potential for unintentional discrimination. This discrimination
-could be on the basis of race, gender, religion, sexual orientation, or other characteristics. This project addresses
-the question: how can an analyst determine the relative
-significance of the inputs to a black-box predictive model in 
-order to assess the model’s fairness (or discriminatory extent)?
+could be on the basis of race, gender, religion, sexual orientation,
+or other characteristics. This project addresses the question:
+how can an analyst determine the relative significance of the inputs
+to a black-box predictive model in order to assess the model’s
+fairness (or discriminatory extent)?
 
 We present FairML, an end-to-end toolbox for auditing predictive
 models by quantifying the relative significance of the model’s
@@ -37,32 +35,37 @@ difficult to interpret.s of black-box algorithms and corresponding input data.
 
 ### Installation 
 
-You can pip install this package, via github - i.e. this repo - using the 
-following commands:
+#### Recommended
 
-pip install https://github.com/adebayoj/fairml/archive/master.zip
+You can install the latest stable version via PyPI:
 
-or you can clone the repository doing:
+`pip install fairml`
 
-git clone https://github.com/adebayoj/fairml.git
+#### Bleeding Edge
 
-sudo python setup.py install
+If you are intested in potentially less stable bleeding edge version, install directly from github:
 
-### Methodology
+- `pip install https://github.com/adebayoj/fairml/archive/master.zip`
+
+#### Development
+
+If you are a developer and prefer to install via a clone:
 
+1. `git clone https://github.com/adebayoj/fairml.git`
+2. `sudo python setup.py install`
 
-<img src="https://github.com/adebayoj/fairml/blob/master/doc/images/fairml_methodology_picture.png" width="700">
+### Methodology
+
+![Methodology](fairml/doc/images/fairml_methodology_picture.png)
 
 ### Code Demo
 
 Now we show how to use the fairml python package to audit
 a black-box model.
 
 ```python
-"""
-First we import modules for model building and data
-processing.
-"""
+# First we import modules for model building and data processing.
+
 import pandas as pd
 import numpy as np
 from sklearn.linear_model import LogisticRegression
@@ -78,14 +81,16 @@ audit_model takes:
   - number_of_runs : number of iterations to perform
   - interactions : flag to enable checking model dependence on interactions.
 
-audit_model returns an overloaded dictionary where keys are the column names of input pandas dataframe and values are lists containing model  dependence on that particular feature. These lists of size number_of_runs.
+audit_model returns an overloaded dictionary where keys are the column names of input pandas dataframe and values are
+    lists containing model  dependence on that particular feature. These lists of size number_of_runs.
 
 """
 from fairml import audit_model
 from fairml import plot_generic_dependence_dictionary
 ```
 
-Above, we provide a quick explanation of the key fairml functionality. Now we move into building an example model that we'd like to audit.
+Above, we provide a quick explanation of the key fairml functionality. Now we move into building an example model that
+we'd like to audit.
 
 ```python
 # read in the propublica data to be used for our analysis.
@@ -130,12 +135,11 @@ plt.savefig("fairml_ldp.eps", transparent=False, bbox_inches='tight')
 
 The demo above produces the figure below. 
 
-<img src="https://github.com/raw/adebayoj/fairml/master/doc/images/feature_dependence_plot_fairml_propublica_linear_direct.png" width="700">
+![Example Output](fairml/master/doc/images/feature_dependence_plot_fairml_propublica_linear_direct_small.png)
 
 
 Feel free to email the authors with any questions:  
-[Julius Adebayo](https://github.com/adebayoj) (julius.adebayo@gmail.com)   
-
+[Julius Adebayo GitHub](https://github.com/adebayoj) [julius.adebayo@gmail.com](mailto:julius.adebayo@gmail.com)
 
 ### Data
 

diff --git a/doc/images/feature_dependence_plot_fairml_propublica_linear_direct_small.png b/doc/images/feature_dependence_plot_fairml_propublica_linear_direct_small.png
diff --git a/doc/images/logo2-small.png b/doc/images/logo2-small.png
diff --git a/setup.py b/setup.py
@@ -1,10 +1,33 @@
 from setuptools import setup
 
+
+def convert_to_rst(filename):
+    """
+    Nice markdown to .rst hack. PyPI needs .rst.
+
+    Uses pandoc to convert the README.md
+
+    From https://coderwall.com/p/qawuyq/use-markdown-readme-s-in-python-modules
+    """
+    try:
+        import pypandoc
+        long_description = pypandoc.convert(filename, 'rst')
+        long_description = long_description.replace("\r", "")
+    except (ImportError, OSError):
+        print("Pandoc not found. Long_description conversion failure.")
+        import io
+        # pandoc is not installed, fallback to using raw contents
+        with io.open(filename, encoding="utf-8") as f:
+            long_description = f.read()
+
+    return long_description
+
 setup(
     name='fairml',
-    version='0.1.1.5',
+    version='0.1.1.5.rc08',
     description=("Module for measuring feature dependence"
                  " for black-box models"),
+    long_description=convert_to_rst('README.md'),
     url='https://github.com/adebayoj/fairml',
     author='Julius Adebayo',
     author_email='julius.adebayo@gmail.com',
@@ -18,5 +41,15 @@
         'seaborn>=0.7.1',
         'pandas>=0.19.0'],
     include_package_data=True,
-    zip_safe=False
+    zip_safe=False,
+    classifiers=[
+        'Intended Audience :: Science/Research',
+        'License :: OSI Approved :: MIT License',
+        'Natural Language :: English',
+        'Operating System :: OS Independent',
+        'Programming Language :: Python',
+        'Intended Audience :: Developers',
+        'Topic :: Scientific/Engineering :: Information Analysis',
+        'Topic :: Software Development :: Libraries :: Python Modules',
+    ],
 )