Skip to content

A case study that analyzes and filters data using pandas.

Notifications You must be signed in to change notification settings

ksommerdorf/Pandas

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyCity Schools Analysis

Overview of School District Analysis

The chief data scientist for a city school district, Maria, has asked for help in preparing all standardized test data for analysis, reporting, and presentation. The goal is to provide insights about performance trends and patterns to help make strategic decisions at the school and district level. The school board was recently notified that Thomas High School ninth graders' reading and math scores show evidence of academic dishonesty. This project is tasked to meet the goal by removing Thomas High School ninth graders' score data and analyzing the results.

Development Environment

  • Jupyter Notebook
  • Python v3.10.4
    • Python Pandas library
    • Python Numpy library

Resources

Analysis Results

District Summary

  • Original data:

original_district_summary

  • Cleaned data:

district_summary

  • The district summary data is mainly unaffected by the inaccurate data from Thomas High School. The only affect is the slight decrease (~0.1%) in the average math score, however this slight decrease is not significant enough to decrease the percentage of passing math.

School Summary

  • Original data:

original_school_summary

  • Cleaned data:

school_summary

  • Thomas High School's overall percent passing rate decreased by ~0.3%. The percent passing reading when down ~0.3% and the percent passing math went down ~0.1%. The percent passing rate remains well over the average of the district of 65% at 90.6%.

Thomas High School's Performance Compared to the Other Schools:

  • Original ranking:

original_ranking

  • After data cleaning ranking:

new_ranking

  • After replacing the ninth graders' math and reading scores, Thomas High School remains 2nd out of 14 schools in the district.

Math and Reading Scores by Grade:

  • Original data (math scores):

original_math_scores

  • Cleaned data (math scores):

math_scores

  • The new average math score for ninth graders in Thomas High School is now not available. The other scores remain the same.

  • Original data (reading scores):

original_reading_scores

  • Cleaned data (reading scores):

reading_scores

  • The new average reading score for ninth graders in Thomas High Schools is also not available, all other scores remain the same.

Scores by School Spending:

  • Original data:

original_school_spending

  • Cleaned data:

school_spending

  • There is no effect on the original data of average scores by the amount spent per student. Both sets of data show a negative trend where as spending increases per student the overall passing percentage decreases.

Scores by School Size:

  • Original data:

original_school_size

  • Cleaned data:

school_size

  • There is no effect on the original data of score averages by school size. Both sets of data show a negative trend where as the size of the school increases the overall percent passing decreases.

Scores by School Type:

  • Original data:

original_school_type

  • Cleaned data:

school_type

  • There is no effect on the original data of score averages by school type. Both sets of data show a significant difference between charter school performance versus district school performance. Charter schools perform at a much higher rate with an overall passing percentage of 90% compared to district schools with an overall passing percent of 54%.

Analysis Summary

After cleaning the data by removing the ninth graders' reading and math scores from Thomas High School, only slight differences can be observed in the cleaned data. The differences are more noticeable at the school level because at the district level with a total of 39,170 students, 461 students is not as significant as compared to 1,635 students at Thomas High School. The noticeable changes include a ~0.1% decrease (83.42 to 83.35) in the average math scores, ~0.1% decrease (93.27 to 93.19%) in the percent passing math, ~0.15% decrease (83.85 to 83.90) in average reading score, ~0.3% decrease (97.31 to 97.02%) in percent passing reading, and ~0.3% decrease (90.95 to 90.63%) in percent overall passing. Therefore, the dishonest test scores did not significantly affect other schools in the district in comparison to Thomas High School but significantly affected the individual performance of Thomas High School.

About

A case study that analyzes and filters data using pandas.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published