Skip to content

click-here/Pandas-vs-Power-Query

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

68 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pandas-vs-Power-Query

It has been my experience that people often work with data inefficiently in Excel. The purpose of this repo is to introduce users to some of the basic tools at their disposal using the Excel Add-in PowerQuery as well as to introduce them to similar methods with the Python data science library Pandas.

If you are new to Power Query I strongly recommend viewing Part 1 as subsequent Parts will assume basic knowledge explained in Part 1.

Part 1 - Reading and Grouping

Read multiple csv files and an Excel file. Group, filter, and aggregate data. Perform column level math and basic grouping/aggregation math.

Links

Full Data set https://collegescorecard.ed.gov/data/

More info regarding this data https://collegescorecard.ed.gov/data/documentation/

Part 2 - Finding the highest ranked x over n years.

Read a csv file from the web. Unpivot/data normalization, table buffering, and duplicate removal.

Part 3 - Time Card Nightmare! - Coming Soon

About

Compare similar tasks in both Power Query and Python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published