NLPExplorer: Exploring the Universe of NLP Papers

Authors - Monarch Parmar, Naman Jain, Pranjali Jain, P. Jayakrishna Sahit, Soham Pachpande, Shruti Singh and Mayank Singh

This repository hosts the dataset used in "NLPExplorer: Exploring the Universe of NLP Papers" accepted at 42nd European Conference on Information Retrieval 2020 (ECIR 2020) in demonstration track.

The dataset is build using the data provided at ACL Anthology website. It is updated till 2019. The details of the dataset can be viewed at link

The data has been split into smaller json files. To combine the data please use the cat and piping commands on Linux systems as follows -

cat filepPrefix* > finalFilename.json

# For example to combine the data of the papers from papers directory, use the following command:
# cat papers* > fullPapers.json

Abstract

Understanding the current research trends, problems, and their innovative solutions remains a bottleneck due to the ever-increasing volume of scientific articles. In this paper, we propose NLPExplorer, a completely automatic portal for indexing, searching, and visualizing Natural Language Processing (NLP) research volume. NLPExplorer presents interesting insights from papers, authors, venues, and topics. In contrast to previous topic modelling based approaches, we manually curate five course-grained non-exclusive topical categories namely Linguistic Target (Syntax, Discourse, etc.), Tasks (Tagging, Summarization, etc.), Approaches (unsupervised, supervised, etc.), Languages (English, Chinese, etc.) and Dataset types (news, clinical notes, etc.). Some of the novel features include a list of young popular authors, popular URLs and datasets, list of topically diverse papers and recent popular papers. Also, it provides temporal statistics such as yearwise popularity of topics, datasets, and seminal papers. To facilitate future research and system development, we make all the processed dataset accessible through API calls.

Paper - Link

Presentation - Link

Cite

  title={NLPExplorer: exploring the universe of NLP papers},
  author={Parmar, Monarch and Jain, Naman and Jain, Pranjali and Sahit, P Jayakrishna and Pachpande, Soham and Singh, Shruti and Singh, Mayank},
  booktitle={European Conference on Information Retrieval},
  pages={476--480},
  year={2020},
  organization={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
author		author
conference		conference
fullText		fullText
papers		papers
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLPExplorer: Exploring the Universe of NLP Papers

Authors - Monarch Parmar, Naman Jain, Pranjali Jain, P. Jayakrishna Sahit, Soham Pachpande, Shruti Singh and Mayank Singh

Abstract

Paper - Link

Presentation - Link

Cite

About

Releases

Packages

License

lingo-iitgn/NLPExplorer

Folders and files

Latest commit

History

Repository files navigation

NLPExplorer: Exploring the Universe of NLP Papers

Authors - Monarch Parmar, Naman Jain, Pranjali Jain, P. Jayakrishna Sahit, Soham Pachpande, Shruti Singh and Mayank Singh

Abstract

Paper - Link

Presentation - Link

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages