String Tensor SplitToSequence fix #19942

Craigacp · 2024-03-15T18:35:54Z

Description

The SplitToSequence refactor added a special case for element_size when it's a string, but this stops the computation of the A offset on line 510 from advancing as the offset is zero. So it copies the same string into each sequence element. This PR removes that check and adds a test for the string split behaviour.

I'm not sure why the special case on line 456 was added in the original PR (#18594), but all the tests pass in the new PR so presumably it isn't necessary.

Motivation and Context

SplitToSequence didn't work on string tensors, fixes #19726.

cc @pranavsharma

…iour.

justinchuby · 2024-03-19T17:05:15Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,ONNX Runtime Web CI Pipeline,Windows ARM64 QNN CI Pipeline

justinchuby · 2024-03-19T17:05:19Z

/azp run Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,Windows x64 QNN CI Pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline

azure-pipelines · 2024-03-19T17:05:50Z

Azure Pipelines successfully started running 8 pipeline(s).

azure-pipelines · 2024-03-19T17:05:56Z

Azure Pipelines successfully started running 9 pipeline(s).

Craigacp · 2024-03-19T19:26:48Z

The Windows GPU TensorRT failure says that it couldn't find a CUDA device in tests other than the one I added, and they don't appear at first glance to use SplitToSequence either. Maybe it hit some dodgy hardware?

tianleiwu · 2024-03-20T03:52:57Z

/azp run ONNX Runtime Web CI Pipeline,orttraining-amd-gpu-ci-pipeline,Big Models,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-03-20T03:53:21Z

Azure Pipelines successfully started running 6 pipeline(s).

Fixing SplitToSequence on String tensors, adding a test for the behav…

e585702

…iour.

justinchuby added the core runtime issues related to core runtime label Mar 19, 2024

justinchuby requested a review from yuslepukhin March 19, 2024 23:59

yuslepukhin approved these changes Mar 20, 2024

View reviewed changes

baijumeswani merged commit 19ff4a6 into microsoft:main Mar 20, 2024
81 of 82 checks passed

Craigacp deleted the split-to-sequence-fix branch March 20, 2024 17:56

Craigacp mentioned this pull request Mar 21, 2024

Cherry-pick for 1.17.3 #20013

Merged

YUNQIUGUO pushed a commit that referenced this pull request Mar 21, 2024

String Tensor SplitToSequence fix (#19942)

674c359

TedThemistokleous pushed a commit to TedThemistokleous/onnxruntime that referenced this pull request May 7, 2024

String Tensor SplitToSequence fix (microsoft#19942)

37634b0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

String Tensor SplitToSequence fix #19942

String Tensor SplitToSequence fix #19942

Craigacp commented Mar 15, 2024

justinchuby commented Mar 19, 2024

justinchuby commented Mar 19, 2024

azure-pipelines bot commented Mar 19, 2024

azure-pipelines bot commented Mar 19, 2024

Craigacp commented Mar 19, 2024

tianleiwu commented Mar 20, 2024

azure-pipelines bot commented Mar 20, 2024

String Tensor SplitToSequence fix #19942

String Tensor SplitToSequence fix #19942

Conversation

Craigacp commented Mar 15, 2024

Description

Motivation and Context

justinchuby commented Mar 19, 2024

justinchuby commented Mar 19, 2024

azure-pipelines bot commented Mar 19, 2024

azure-pipelines bot commented Mar 19, 2024

Craigacp commented Mar 19, 2024

tianleiwu commented Mar 20, 2024

azure-pipelines bot commented Mar 20, 2024