In this project we will be using NLP to preprocess the text inside resume and segregate resumes based on domain. As this is a classification task, we will be using word embedding technique to transform raw text into numerical features. Based on these features final predictions will be performed. For detailed walk-through refer Resume_Segmentation.ipynb.
- Reading data from source and preprocessing it to more desirable format.
- Performing EDA and generating Wordcoud.
- Used word embedding technique for transforming text into numerical vectors.
- Trained a KNN model to classifiy unseen resume to its most nearest match.
For proper execution of application firstly create an environment, then to install prerequisite libraries execute below command in terminal.
pip install sklearn
pip install nltk
pip install wordcloud
pip install pandas
pip install numpy
pip install matplotlib
Raj Praveen Pradhan - LinkedIn