Make Xpath accept slightly malformed xml (like html) #1200
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Currently if the xml that a XPath expression is ran against is not perfectly formatted without any errors or warnings you will get
Invalid input XML.
. Thexmldom
package is still able to run XPath expressions even if the XML is not perfectly formatted (like most html documents).Example:
A simple html document that does not close all tags https://gchq.github.io/CyberChef/#recipe=XPath_expression('//meta','%5C%5Cn')&input=CjwhZG9jdHlwZSBodG1sPgo8aHRtbCBsYW5nPSJlbiI%2BCiAgPGhlYWQ%2BCiAgICA8bWV0YSBjaGFyc2V0PSJ1dGYtOCI%2BCiAgPC9oZWFkPgogIDxib2R5PgogIDwvYm9keT4KPC9odG1sPgo
This pull request makes it so the XPath expression will work in the example.