Skip to content

Simple script to check the latest on the Arxiv. Flask to query the Arxiv API and show results in a clean HTML format. Collapsible abstracts, links to articles, and scirates.

License

Notifications You must be signed in to change notification settings

mekise/speed-the-arxiv

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

86 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

speed-the-arxiv

It is common practice to read the Arxiv periodically, checking the same sections, the same keywords, and the same authors. Speed-the-arxiv tries to speed these searches up. Together with a little HTML, it checks the latest on the Arxiv based on sections, keywords, and authors of choice. These keys are stored in .yaml files and are reused every time you run the script. Additional parameters in the file let you personalize the search criteria. In the folder search/, you can have as many .yaml files as you want. You can choose what to search for on the landing page of speedthearxiv.py. The script uses Flask to query the Arxiv API and it shows the results in a clean HTML page. It includes collapsible abstracts and links to the articles. If needed, it checks and associates the Scirates to each article. Javascript is behind for some dynamicity. Ajax takes care of making the search buttons and Flask talk. As the latest addition, you can search bibtex entries using Crossref.

⚠️ Fetching the Scirates slows down the app quite a bit. Unfortunately, no API is available for this and I had to fetch it "manually".

Try speed-the-arxiv

At this link, you can try speed-the-arxiv with (very) limited functionality -> https://mekise.pythonanywhere.com/

A note on "learned" data

I thought about using ML (I even started the relative branch) to predict what you might be interested in, but I chose to drop the ball. The reason is simple, I want speed-the-arxiv to return a broader spectrum of results, including some exotic catches. Reading exclusively ad-hoc papers based on your past preferences does not incentivize creativity or change, keys in research. On the other hand, I want you to find something relevant to your work, without losing focus. That is why the search criteria can distill the vastest of the fields. There you have it, choose your set of parameters for your search, click the button, and start exploring.

To-do

  • Collapsible overview of .yaml parameters to the landing page.
  • Keywords/keyauthors highlighting in HTML, for literal search.
  • Last-modified search file on top of the index page.
  • Add folder link to quickly access search files.
  • Add Crossref search for bibtex entries
  • Add MathJax support
  • Collapsible stats to the search page (keys count etc.).
  • Fix occasional events of primary-category/category mishandling (see primary-category and category of example paper https://arxiv.org/abs/2307.06627).

How to use it

  • Clone the repo or download
  • Change the keys/parameters in the search/config.yaml file. You can create multiple .yaml files to have different searches ready.
  • Run python speedthearxiv.py in the terminal.
  • Select the search you want from the list.

Dependencies

You will need some packages. To have everything you need, run in the terminal:

    pip install flask waitress datetime feedparser requests pyyaml habanero

What it looks like

About

Simple script to check the latest on the Arxiv. Flask to query the Arxiv API and show results in a clean HTML format. Collapsible abstracts, links to articles, and scirates.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published