Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update dataset preview page #309

Open
hjwilli opened this issue Jan 22, 2021 · 1 comment
Open

Update dataset preview page #309

hjwilli opened this issue Jan 22, 2021 · 1 comment
Assignees

Comments

@hjwilli
Copy link
Collaborator

hjwilli commented Jan 22, 2021

Update the dataset preview page with more data analysis plots.

(Notes from our initial pass at this are in issue #209, in particular some ui ideas are here #209 (comment) )

@nupurbaghel
Copy link
Collaborator

All changes can be seen on the data-preview branch.

A list of the new components / changes to existing components is as follows:

  • DatasetTabbed : Contains 3 tabs.

    1. The Preview tab contains the original data preview content (Dataset component)
    2. The Basic Stats tab contains newly added Plotly graphs (DataStats component)
    3. The Sample tab contains sample plots using custom test data (code can be seen in the parent component i.e. DatasetTabbed component)
  • DataStats component :

    1. Modifies data from row-wise format received from Papa parse to column-wise format (currently top 1000 rows are only selected for the graphs)
    2. Creates a list of categorical, ordinal and numerical features (slices to select first 20 from each category)
    3. Currently it only makes use of PlotlyBarPlot and PlotlyBoxPlot components.
    4. For future scope, we can separate data into multiple parts for each class and make use of PlotlyStackedBarPlot
  • PlotlyBarPlot : uses histogram count method to generate histograms, can be updated to any other custom function (see example usage in sample tab)

  • PlotlyBoxPlot : customized to stack one or more box plots (see example usage in sample tab)

  • PlotlyStackedBarPlot : stacked histogram plot (see example usage in sample tab)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants