fixing access to mml spark repo access from databricks_install in mml… #714

gramhagen · 2019-04-05T02:10:04Z

Description

fixing issue pulling in maven repository link in mmlspark lightgbm notebook
i'm still seeing issues locally on windows, but we should try this fix on windows dsvm because my environment is probably not very clean.

Related Issues

#709

Checklist:

My code follows the code style of this project, as detailed in our contribution guidelines.
I have added tests.
I have updated the documentation accordingly.

…spark_lightgbm_criteo notebook

review-notebook-app · 2019-04-05T02:10:05Z

Check out this pull request on ReviewNB: https://app.reviewnb.com/Microsoft/Recommenders/pull/714

Visit www.reviewnb.com to know how we simplify your Jupyter Notebook workflows.

miguelgfierro · 2019-04-05T09:30:21Z

notebooks/02_model/mmlspark_lightgbm_criteo.ipynb

@@ -80,7 +80,7 @@
    "    # get the maven coordinates for MML Spark from databricks_install script\n",
    "    from scripts.databricks_install import MMLSPARK_INFO\n",
    "    packages = [MMLSPARK_INFO[\"maven\"][\"coordinates\"]]\n",
-    "    repo = MMLSPARK_INFO[\"maven\"].get(\"repositories\", None)\n",
+    "    repo = MMLSPARK_INFO[\"maven\"][\"repo\"]\n",


good catch, however, we have to add .get("repo", None), if not spark function will break if we use something similar to what we have originally, where there was only coordinates.

i wanted to force an error in that case, should we carry forward if they get out of synch?
also, fwiw if you use dict.get(), None is automatically provided as the default. reference

if you use dict.get(), None is automatically provided as the default.

yeah, so the idea is if we are using the old dictionary, that doesn't have repo, the spark loader doesn't fail:

MMLSPARK_INFO = {"maven": {"coordinates": "Azure:mmlspark:0.16"}}

I guess that the dictionary that we have now is temporal:

MMLSPARK_INFO = {"maven": {"coordinates": "com.microsoft.ml.spark:mmlspark_2.11:0.16.dev8+2.g6a5318b", "repositories": "https://mmlspark.azureedge.net/maven"} }

i wanted to force an error in that case

don't understand very well what you are thinking, can you elaborate?

I see so, you're just saying we shouldn't bother updating this notebook if we drop the repo piece in the future (which we're planning on doing), i'm fine with that.

miguelgfierro · 2019-04-05T09:32:16Z

it happened again that the tests hasn't triggered, this is weird

gramhagen · 2019-04-05T13:03:44Z

I pushed several prs in close succession, i wonder if there's a limit to the queue of pending tests or some kind of time limit before the test status is no longer accessible?

change to get repo to limit changes when maven coordinates change

fixing access to mml spark repo access from databricks_install in mml…

fixing access to mml spark repo access from databricks_install in mml…

9e0b2de

…spark_lightgbm_criteo notebook

gramhagen requested review from miguelgfierro and roalexan April 5, 2019 02:10

gramhagen requested a review from yueguoguo as a code owner April 5, 2019 02:10

miguelgfierro requested changes Apr 5, 2019

View reviewed changes

Update mmlspark_lightgbm_criteo.ipynb

198e16e

change to get repo to limit changes when maven coordinates change

miguelgfierro approved these changes Apr 5, 2019

View reviewed changes

miguelgfierro merged commit e184a11 into staging Apr 5, 2019

miguelgfierro deleted the gramhagen/mmlspark_lgbm_win_bug branch April 5, 2019 19:03

yueguoguo pushed a commit that referenced this pull request Sep 9, 2019

Merge pull request #714 from Microsoft/gramhagen/mmlspark_lgbm_win_bug

809960d

fixing access to mml spark repo access from databricks_install in mml…

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fixing access to mml spark repo access from databricks_install in mml… #714

fixing access to mml spark repo access from databricks_install in mml… #714

gramhagen commented Apr 5, 2019

review-notebook-app bot commented Apr 5, 2019

miguelgfierro Apr 5, 2019

gramhagen Apr 5, 2019

miguelgfierro Apr 5, 2019

gramhagen Apr 5, 2019

miguelgfierro commented Apr 5, 2019

gramhagen commented Apr 5, 2019

fixing access to mml spark repo access from databricks_install in mml… #714

fixing access to mml spark repo access from databricks_install in mml… #714

Conversation

gramhagen commented Apr 5, 2019

Description

Related Issues

Checklist:

review-notebook-app bot commented Apr 5, 2019

miguelgfierro Apr 5, 2019

Choose a reason for hiding this comment

gramhagen Apr 5, 2019

Choose a reason for hiding this comment

miguelgfierro Apr 5, 2019

Choose a reason for hiding this comment

gramhagen Apr 5, 2019

Choose a reason for hiding this comment

miguelgfierro commented Apr 5, 2019

gramhagen commented Apr 5, 2019