Get sdk integration tests working with Spark #1190

hsubbaraj-spiral · 2023-04-09T23:10:24Z

Describe your changes and why you are making these changes

Modifies tests to work with Spark. Most of the changes involve adding a case in the test functions to use Pyspark Dataframe specific code. I also add the pytest.mark.skip_for_spark_engines() fixture to skip tests that don't work with Spark.

Related issue number (if any)

Loom demo (if any)

Checklist before requesting a review

I have created a descriptive PR title. The PR title should complete the sentence "This PR...".
I have performed a self-review of my code.
I have included a small demo of the changes. For the UI, this would be a screenshot or a Loom video.
If this is a new feature, I have added unit tests and integration tests.
I have run the integration tests locally and they are passing.
I have run the linter script locally (See python3 scripts/run_linters.py -h for usage).
All features on the UI continue to work correctly.
Added one of the following CI labels:
- run_integration_test: Runs integration tests
- skip_integration_test: Skips integration tests (Should be used when changes are ONLY documentation/UI)

kenxu95

Just a couple things I want to check on!

integration_tests/sdk/aqueduct_tests/checks_test.py

integration_tests/sdk/aqueduct_tests/flow_test.py

integration_tests/sdk/aqueduct_tests/test_functions/simple/model.py

integration_tests/sdk/conftest.py

kenxu95 · 2023-04-10T19:05:09Z

src/golang/lib/job/spark.go

@@ -27,7 +27,7 @@ func NewSparkJobManager(conf *SparkJobManagerConfig) (*SparkJobManager, error) {

 	session, err := livyClient.CreateSession(&spark.CreateSessionRequest{
 		Kind:                     "pyspark",
-		HeartbeatTimeoutInSecond: 600,
+		HeartbeatTimeoutInSecond: 10,


I assume this was a necessary change to get it working with the test suite? This seems like quite a large numeric change - what are the ramifications of this?

So I originally had the Heartbeat timeout set to 10 min for debugging purposes. It left the livy-created spark session alive so I could look at the logs. Realized we can actually check the logs of completed Spark sessions/applications via the Spark UI, so there isn't a need to keep these sessions alive. This also improves memory usage, since each SparkSession is allocated certain amount of disk space on the driver/worker nodes it uses.

integration_tests/sdk/aqueduct_tests/flow_test.py

hsubbaraj added 5 commits April 7, 2023 22:45

changes to flow_test

ab32926

flow tests changes

106342a

checks_test changes

5333f84

metrics_checks

6c4545a

lint

f5e086c

hsubbaraj-spiral requested review from kenxu95 and saurav-c April 9, 2023 23:10

hsubbaraj-spiral added the run_integration_test Triggers integration tests label Apr 10, 2023

hsubbaraj added 3 commits April 10, 2023 04:01

fix none case

9951d36

fix

a8b5bcc

lint

4c96826

kenxu95 requested changes Apr 10, 2023

View reviewed changes

add reason to skipped tests

6b156f2

hsubbaraj-spiral requested a review from kenxu95 April 11, 2023 00:15

kenxu95 approved these changes Apr 11, 2023

View reviewed changes

integration_tests/sdk/aqueduct_tests/flow_test.py Outdated Show resolved Hide resolved

hsubbaraj added 2 commits April 11, 2023 17:55

changes

470119a

made fix for test

640d07f

hsubbaraj-spiral merged commit a3e2206 into main Apr 11, 2023

vsreekanti deleted the eng-2614-get-sdk-integration-tests-working-with branch April 18, 2023 16:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get sdk integration tests working with Spark #1190

Get sdk integration tests working with Spark #1190

hsubbaraj-spiral commented Apr 9, 2023

kenxu95 left a comment

kenxu95 Apr 10, 2023

hsubbaraj-spiral Apr 10, 2023

Get sdk integration tests working with Spark #1190

Get sdk integration tests working with Spark #1190

Conversation

hsubbaraj-spiral commented Apr 9, 2023

Describe your changes and why you are making these changes

Related issue number (if any)

Loom demo (if any)

Checklist before requesting a review

kenxu95 left a comment

Choose a reason for hiding this comment

kenxu95 Apr 10, 2023

Choose a reason for hiding this comment

hsubbaraj-spiral Apr 10, 2023

Choose a reason for hiding this comment