This repo contains code providing solution for below queries using the World Bank projects dataset ('dataset/world_bank_projects.json')
- Find the 10 countries with most projects
- Find the top 10 major project themes (using column 'mjtheme_namecode')
- In point 2 above, there are some entries that have only the code and the name is missing. Create a dataframe with the missing names filled in.
This code is written in Python & Jupyter and covers below key concepts
• Reading and manipulating JSON data
• Data wrangling & cleaning
• Flattening JSON structure
• Usage of json_normalize
• Use of Pandas Libraries and Function