The data set used in this project was provided by Wuzzuf jobs in Egypt data set at Kaggle : https://www.kaggle.com/omarhanyy/wuzzuf-jobs
The project purpose was to build all java needed classes (POJO , DAO, web service and a tester client for the web service) and Make a web service to get the following:
1.Read data set and convert it to dataframe or Spark RDD and display some from it.
2.Display structure and summary of the data.
3.Clean the data (null, duplications)
4.Count the jobs for each company and display that in order (What are the most demanding companies for jobs?)
5.Show step 4 in a pie chart
6.Find out What are it the most popular job titles?
7.Show step 6 in bar chart
8.Find out the most popular areas?
9.Show step 8 in bar chart
10.Print skills one by one and how many each repeated and order the output to find out the most important skills required?