-
Clone this repo and install all the requirements
-
Download papers (pdf format) from ArXiv and (if possible, it also works fine otherwise but higher running time due pdf parsing) save them as REF.XXXX.YYYY, where REF is your custom reference name, and XXXX.YYYYY is the ArXiv code.
-
Run
>> python run.py -p 'your/folder/path'
where 'your/folder/path'
is the path containing the pdfs.
- Find the results in ./results folder. Also, if for any reason (e.g. the pdf wasn't downloaded from ArXiv) the pdf cannot be added to .bib file, you'll see this in the log file and also in the "failed" files folder.
Next versions are planned to include:
- setup
- docker
- other sources than ArXiv, e.g. NIPS, ICML, etc
You are more than welcome to colaborate! Please feel free to reach me out at pytrainteam@gmail.com. If you modify this code or use it don't forget to cite ;)