Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ground Truth Layouts for VQA/VQA 2.0 - Not reproducible #19

Open
goonmeet opened this issue Feb 2, 2023 · 0 comments
Open

Ground Truth Layouts for VQA/VQA 2.0 - Not reproducible #19

goonmeet opened this issue Feb 2, 2023 · 0 comments

Comments

@goonmeet
Copy link

goonmeet commented Feb 2, 2023

Hello!

I am trying to generate ground truth layouts for the TDIUC dataset (which is very similar to the VQA and VQA 2.0 datasets) and does not contain functional programs.

Based on the ReadMe, I used these scripts as a starting point: https://gist.github.com/ronghanghu/67aeb391f4839611d119c73eba53bc5f

As a sanity test, I attempted to generate the layouts for the VQA and VQA 2.0 datasets. However, the generated layouts did not match the GT layouts provided in the repo.

The mismatch between the generated layouts might be because of the output of the Lexical Parser.

The following command is provided in the parse.py script:
java -mx150m -cp "$scriptdir/*:" edu.stanford.nlp.parser.lexparser.LexicalizedParser
-outputFormat "words,typedDependencies" -outputFormatOptions "stem,collapsedDependencies,includeTags"
-sentences newline
edu/stanford/nlp/models/lexparser/englishPCFG.ser.gz \

However, this command does not output a parse tree in the following format: (ROOT (SQ (VBP are) (NP (EX there)) (ADJP (VBG patients)) (. ?)))

Instead, the following command can output something similar to the parse tree format mentioned above:
java -mx150m -cp "*" edu.stanford.nlp.parser.lexparser.LexicalizedParser
-outputFormat "oneline" -outputFormatOptions "stem,collapsedDependencies,includeTags"
-sentences newline edu/stanford/nlp/models/lexparser/englishPCFG.ser.gz questions.txt

Based on this analysis, a few questions:

  1. Is there a specific version of Lexical Parser that was used?
  2. What specific command generates the output needed to run the parser?
  3. Is the intermediate output from the parser available anywhere?

Thanks in advance for your time and help! Greatly appreciated!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant