make train test traceable when error occurs #243

witwolf · 2019-10-23T04:13:06Z

logging training stdout & stderr output to stderr (stdout is suppressed at ci) to make it traceable when error occurs.

and this pr can solve the issue: build-times-out-because-no-output-was-received (travis_wait not support for a docker run command )

emailweixu · 2019-10-25T00:21:08Z

alf/bin/train_test.py

@@ -34,17 +36,22 @@ def run_and_stream(cmd, cwd):
        cwd (str): working directory for the process
    """
    logging.info("Running %s", " ".join(cmd))
+
+    logger = logging.ABSLLogger('')


instead of changing the code here, can we simply redirect stdout and stderr to stderr at build.sh?

python3 -m unittest discover -p "*_test.py" -v 1>&2

If not, we need some comment in the code to explain the reason of doing all these stuff so that future readers can understand it.

yes, it can be done withpython3 -m unittest discover -p "*_test.py" -v 1>&2 ,
but there is a potential problem Log length exceeded 4 MB if we log for all stdout (now the log file is about 3.2MB)

I see. Then please add some comments here.

witwolf · 2019-10-25T06:43:42Z

how about other tests, don’t they have the same issue?

other tests are not in sub process like train test (python -m unittest ... -> python .._test_py-> python -m alf.bin.train ... )

emailweixu · 2019-10-25T16:05:08Z

alf/bin/train_test.py

    process = subprocess.Popen(
        cmd, stdout=subprocess.PIPE, stderr=subprocess.STDOUT, cwd=cwd)

    while process.poll() is None:
        with io.TextIOWrapper(process.stdout, encoding="utf-8") as text_io:
            for line in text_io:
-                logging.info(line.strip())
+                logger.info(line.strip())


Now the original stdout goes to stderr, and the original stderr goes to stdout (because stderr=subprocess.STDOUT at line 50). So I am still confused. And why not simply use stdout=subprocess.STDERR at line 50?

emailweixu · 2019-10-25T16:05:30Z

other tests are not in sub process like train test (python -m unittest ... -> python .._test_py-> python -m alf.bin.train ... )

I am still confused. If the stdout of train_test is suppressed, aren't the stdout of other tests also suppressed?

witwolf · 2019-10-26T02:48:00Z

other tests are not in sub process like train test (python -m unittest ... -> python .._test_py-> python -m alf.bin.train ... )

I am still confused. If the stdout of train_test is suppressed, aren't the stdout of other tests also suppressed?

train test cost much time, may encounter such a problem build-times-out-because-no-output-was-received when no stdout output .

solve this with travis_wait in the new commit (travis-ci/travis-ci#6934 )

emailweixu · 2019-10-26T05:41:22Z

alf/bin/train_test.py


+import logging as sys_logging


make train test traceable when error occurs

95f4ee3

witwolf requested review from emailweixu, runjerry and hnyu October 23, 2019 04:13

emailweixu reviewed Oct 25, 2019

View reviewed changes

add comment

edb4f11

witwolf requested a review from emailweixu October 25, 2019 02:25

emailweixu reviewed Oct 25, 2019

View reviewed changes

travis_wait for docker command

df4e001

emailweixu reviewed Oct 26, 2019

View reviewed changes

remove unused import

20a8fa4

emailweixu approved these changes Oct 28, 2019

View reviewed changes

emailweixu merged commit 80a6386 into master Oct 28, 2019

witwolf deleted the PR_train_test1 branch October 29, 2019 03:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

make train test traceable when error occurs #243

make train test traceable when error occurs #243

witwolf commented Oct 23, 2019

emailweixu Oct 25, 2019

witwolf Oct 25, 2019 •

edited

Loading

emailweixu Oct 25, 2019

witwolf Oct 25, 2019

witwolf commented Oct 25, 2019

emailweixu Oct 25, 2019

emailweixu commented Oct 25, 2019

witwolf commented Oct 26, 2019

emailweixu Oct 26, 2019

witwolf Oct 26, 2019

make train test traceable when error occurs #243

make train test traceable when error occurs #243

Conversation

witwolf commented Oct 23, 2019

emailweixu Oct 25, 2019

Choose a reason for hiding this comment

witwolf Oct 25, 2019 • edited Loading

Choose a reason for hiding this comment

emailweixu Oct 25, 2019

Choose a reason for hiding this comment

witwolf Oct 25, 2019

Choose a reason for hiding this comment

witwolf commented Oct 25, 2019

emailweixu Oct 25, 2019

Choose a reason for hiding this comment

emailweixu commented Oct 25, 2019

witwolf commented Oct 26, 2019

emailweixu Oct 26, 2019

Choose a reason for hiding this comment

witwolf Oct 26, 2019

Choose a reason for hiding this comment

witwolf Oct 25, 2019 •

edited

Loading