Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update source data storage #3260

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
# This code is autogenerated.
# Code is generated by running custom script: python3 readme.py
# Any manual changes to this file may cause incorrect behavior.
# Any manual changes will be overwritten if the code is regenerated.

name: sdk-featurestore_sample-automation-test-test_featurestore_cli_samples
# This file is created by sdk/python/readme.py.
# Please do not edit directly.
on:
workflow_dispatch:
schedule:
- cron: "56 10/12 * * *"
pull_request:
branches:
- main
paths:
- sdk/python/featurestore_sample/automation-test/**
- .github/workflows/sdk-featurestore_sample-automation-test-test_featurestore_cli_samples.yml
- sdk/python/dev-requirements.txt
- infra/bootstrapping/**
- sdk/python/setup.sh
- sdk/python/featurestore_sample/**
permissions:
id-token: write
concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: setup python
uses: actions/setup-python@v2
with:
python-version: "3.10"
- name: pip install notebook reqs
run: pip install -r sdk/python/dev-requirements.txt
- name: azure login
uses: azure/login@v1
with:
client-id: ${{ secrets.OIDC_AZURE_CLIENT_ID }}
tenant-id: ${{ secrets.OIDC_AZURE_TENANT_ID }}
subscription-id: ${{ secrets.OIDC_AZURE_SUBSCRIPTION_ID }}
- name: bootstrap resources
run: |
echo '${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}';
bash bootstrap.sh
working-directory: infra/bootstrapping
continue-on-error: false
- name: setup SDK
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: sdk/python
continue-on-error: true
- name: validate readme
run: |
python check-readme.py "${{ github.workspace }}/sdk/python/featurestore_sample"
working-directory: infra/bootstrapping
continue-on-error: false
- name: setup-cli
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: cli
continue-on-error: true
- name: setup feature-store resources
run: |
bash -x automation-test/setup-resources-cli.sh automation-test/test_featurestore_cli_samples.ipynb
working-directory: sdk/python/featurestore_sample
continue-on-error: true
- name: run featurestore_sample/automation-test/test_featurestore_cli_samples.ipynb
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" generate_workspace_config "../../.azureml/config.json";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" replace_template_values "test_featurestore_cli_samples.ipynb";
[ -f "../../.azureml/config" ] && cat "../../.azureml/config";
papermill -k python test_featurestore_cli_samples.ipynb test_featurestore_cli_samples.output.ipynb
working-directory: sdk/python/featurestore_sample/automation-test
- name: upload notebook's working folder as an artifact
if: ${{ always() }}
uses: actions/upload-artifact@v2
with:
name: test_featurestore_cli_samples
path: sdk/python/featurestore_sample/automation-test
Original file line number Diff line number Diff line change
@@ -0,0 +1,90 @@
# This code is autogenerated.
# Code is generated by running custom script: python3 readme.py
# Any manual changes to this file may cause incorrect behavior.
# Any manual changes will be overwritten if the code is regenerated.

name: sdk-featurestore_sample-automation-test-test_featurestore_sdk_samples
# This file is created by sdk/python/readme.py.
# Please do not edit directly.
on:
workflow_dispatch:
schedule:
- cron: "22 2/12 * * *"
pull_request:
branches:
- main
paths:
- sdk/python/featurestore_sample/automation-test/**
- .github/workflows/sdk-featurestore_sample-automation-test-test_featurestore_sdk_samples.yml
- sdk/python/dev-requirements.txt
- infra/bootstrapping/**
- sdk/python/setup.sh
- sdk/python/featurestore_sample/**
permissions:
id-token: write
concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: setup python
uses: actions/setup-python@v2
with:
python-version: "3.10"
- name: pip install notebook reqs
run: pip install -r sdk/python/dev-requirements.txt
- name: azure login
uses: azure/login@v1
with:
client-id: ${{ secrets.OIDC_AZURE_CLIENT_ID }}
tenant-id: ${{ secrets.OIDC_AZURE_TENANT_ID }}
subscription-id: ${{ secrets.OIDC_AZURE_SUBSCRIPTION_ID }}
- name: bootstrap resources
run: |
echo '${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}';
bash bootstrap.sh
working-directory: infra/bootstrapping
continue-on-error: false
- name: setup SDK
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: sdk/python
continue-on-error: true
- name: validate readme
run: |
python check-readme.py "${{ github.workspace }}/sdk/python/featurestore_sample"
working-directory: infra/bootstrapping
continue-on-error: false
- name: setup-cli
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: cli
continue-on-error: true
- name: setup feature-store resources
run: |
bash -x automation-test/setup-resources.sh automation-test/test_featurestore_sdk_samples.ipynb
working-directory: sdk/python/featurestore_sample
continue-on-error: true
- name: run featurestore_sample/automation-test/test_featurestore_sdk_samples.ipynb
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" generate_workspace_config "../../.azureml/config.json";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" replace_template_values "test_featurestore_sdk_samples.ipynb";
[ -f "../../.azureml/config" ] && cat "../../.azureml/config";
papermill -k python test_featurestore_sdk_samples.ipynb test_featurestore_sdk_samples.output.ipynb
working-directory: sdk/python/featurestore_sample/automation-test
- name: upload notebook's working folder as an artifact
if: ${{ always() }}
uses: actions/upload-artifact@v2
with:
name: test_featurestore_sdk_samples
path: sdk/python/featurestore_sample/automation-test
Original file line number Diff line number Diff line change
@@ -0,0 +1,85 @@
# This code is autogenerated.
# Code is generated by running custom script: python3 readme.py
# Any manual changes to this file may cause incorrect behavior.
# Any manual changes will be overwritten if the code is regenerated.

name: sdk-featurestore_sample-automation-test-test_featurestore_vnet_samples
# This file is created by sdk/python/readme.py.
# Please do not edit directly.
on:
workflow_dispatch:
schedule:
- cron: "3 8/12 * * *"
pull_request:
branches:
- main
paths:
- sdk/python/featurestore_sample/automation-test/**
- .github/workflows/sdk-featurestore_sample-automation-test-test_featurestore_vnet_samples.yml
- sdk/python/dev-requirements.txt
- infra/bootstrapping/**
- sdk/python/setup.sh
- sdk/python/featurestore_sample/**
permissions:
id-token: write
concurrency:
group: ${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}
cancel-in-progress: true
jobs:
build:
runs-on: ubuntu-latest
steps:
- name: check out repo
uses: actions/checkout@v2
- name: setup python
uses: actions/setup-python@v2
with:
python-version: "3.10"
- name: pip install notebook reqs
run: pip install -r sdk/python/dev-requirements.txt
- name: azure login
uses: azure/login@v1
with:
client-id: ${{ secrets.OIDC_AZURE_CLIENT_ID }}
tenant-id: ${{ secrets.OIDC_AZURE_TENANT_ID }}
subscription-id: ${{ secrets.OIDC_AZURE_SUBSCRIPTION_ID }}
- name: bootstrap resources
run: |
echo '${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}';
bash bootstrap.sh
working-directory: infra/bootstrapping
continue-on-error: false
- name: setup SDK
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: sdk/python
continue-on-error: true
- name: setup-cli
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash setup.sh
working-directory: cli
continue-on-error: true
- name: setup feature-store resources
run: |
bash -x automation-test/setup-resources-vnet.sh automation-test/test_featurestore_vnet_samples.ipynb
working-directory: sdk/python/featurestore_sample
continue-on-error: true
- name: run featurestore_sample/automation-test/test_featurestore_vnet_samples.ipynb
run: |
source "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh";
source "${{ github.workspace }}/infra/bootstrapping/init_environment.sh";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" generate_workspace_config "../../.azureml/config.json";
bash "${{ github.workspace }}/infra/bootstrapping/sdk_helpers.sh" replace_template_values "test_featurestore_vnet_samples.ipynb";
[ -f "../../.azureml/config" ] && cat "../../.azureml/config";
papermill -k python test_featurestore_vnet_samples.ipynb test_featurestore_vnet_samples.output.ipynb
working-directory: sdk/python/featurestore_sample/automation-test
- name: upload notebook's working folder as an artifact
if: ${{ always() }}
uses: actions/upload-artifact@v2
with:
name: test_featurestore_vnet_samples
path: sdk/python/featurestore_sample/automation-test
Original file line number Diff line number Diff line change
Expand Up @@ -11,7 +11,7 @@ index_columns:
- name: accountID
type: string
source:
path: wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet
path: wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet
timestamp_column:
name: timestamp
type: parquet
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@ index_columns:
- name: accountID
type: string
source:
path: wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet
path: wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet
source_delay:
days: 0
hours: 0
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -565,7 +565,7 @@
"outputs": [],
"source": [
"# Remove the \".\" in the roor directory path as we need to generate absolute path to read from Spark.\n",
"transactions_source_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\"\n",
"transactions_source_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\"\n",
"transactions_src_df = spark.read.parquet(transactions_source_data_path)\n",
"\n",
"display(transactions_src_df.head(5))\n",
Expand Down Expand Up @@ -640,7 +640,7 @@
"transactions_featureset_spec = create_feature_set_spec(\n",
" source=FeatureSource(\n",
" type=SourceType.parquet,\n",
" path=\"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\",\n",
" path=\"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\",\n",
" timestamp_column=TimestampColumn(name=\"timestamp\"),\n",
" source_delay=DateTimeOffset(days=0, hours=0, minutes=20),\n",
" ),\n",
Expand Down Expand Up @@ -930,7 +930,7 @@
},
"outputs": [],
"source": [
"observation_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_df = spark.read.parquet(observation_data_path)\n",
"obs_data_timestamp_column = \"timestamp\"\n",
"\n",
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -471,7 +471,7 @@
},
"outputs": [],
"source": [
"accounts_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\"\n",
"accounts_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\"\n",
"accounts_df = spark.read.parquet(accounts_data_path)\n",
"\n",
"display(accounts_df.head(5))"
Expand Down Expand Up @@ -526,7 +526,7 @@
"accounts_featureset_spec = create_feature_set_spec(\n",
" source=FeatureSource(\n",
" type=SourceType.parquet,\n",
" path=\"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\",\n",
" path=\"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\",\n",
" timestamp_column=TimestampColumn(name=\"timestamp\"),\n",
" ),\n",
" index_columns=[Column(name=\"accountID\", type=ColumnType.string)],\n",
Expand Down Expand Up @@ -703,7 +703,7 @@
"from azureml.featurestore import get_offline_features\n",
"\n",
"# Load the observation data. To understand observatio ndata, refer to part 1 of this tutorial\n",
"observation_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_df = spark.read.parquet(observation_data_path)\n",
"obs_data_timestamp_column = \"timestamp\""
]
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -603,7 +603,7 @@
"outputs": [],
"source": [
"# Copy sample source data for transactions feature set used in this tutorial series from the public storage account to the newly created storage account\n",
"transactions_source_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\"\n",
"transactions_source_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/transactions-source/*.parquet\"\n",
"transactions_src_df = spark.read.parquet(transactions_source_data_path)\n",
"\n",
"transactions_src_df.write.parquet(\n",
Expand Down Expand Up @@ -643,7 +643,7 @@
"outputs": [],
"source": [
"# Copy sample source data for account feature set used in this tutorial series from the public storage account to the newly created storage account\n",
"accounts_data_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\"\n",
"accounts_data_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/datasources/accounts-precalculated/*.parquet\"\n",
"accounts_data_df = spark.read.parquet(accounts_data_path)\n",
"\n",
"accounts_data_df.write.parquet(\n",
Expand Down Expand Up @@ -683,7 +683,7 @@
"outputs": [],
"source": [
"# Copy sample observation data used for training from the public storage account to the newly created storage account\n",
"observation_data_train_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_train_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/observation_data/train/*.parquet\"\n",
"observation_data_train_df = spark.read.parquet(observation_data_train_path)\n",
"\n",
"observation_data_train_df.write.parquet(\n",
Expand Down Expand Up @@ -723,7 +723,7 @@
"outputs": [],
"source": [
"# Copy sample observation data used for batch inference from a public storage account to the newly created storage account\n",
"observation_data_inference_path = \"wasbs://data@azuremlexampledata.blob.core.windows.net/feature-store-prp/observation_data/batch_inference/*.parquet\"\n",
"observation_data_inference_path = \"wasbs://data@azuremlfsexampledata.blob.core.windows.net/feature-store-prp/observation_data/batch_inference/*.parquet\"\n",
"observation_data_inference_df = spark.read.parquet(observation_data_inference_path)\n",
"\n",
"observation_data_inference_df.write.parquet(\n",
Expand Down
Loading
Loading