Skip to content

Coding challenge for a BMW job interview involving: 1) an ensemble of supervised binary classifiers; 2) unsupervised outlier detection of text data, dimensionality reduction, and nearest neighbor search

Notifications You must be signed in to change notification settings

kingjosephm/BMW-Tasks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Coding Challenge for a BMW Job Interview

While it might be unorthodox to post code from a job interview coding challenge, I not only found this challenge enjoyable, but also a good opportunity to showcase various modeling techniques.

Project Structure

  • data: this folder contains the CSV files used in the coding challenged. I believe this was a combination of real and synthesized/anonymized data

  • Tasks-BMW.pdf: actual writeup describing each section of the coding challenge. There were four tasks, so I created separate notebooks for tasks 1, 2, and combined 3-4.

  • Task1.ipynb: Ensemble of binary classifiers for a tabular dataset of mixed data types. Classifiers included SVM, naive Bayes, and a gradient boosted classifier. All models were hyperoptimized using sklearn's grid search method. The ensembler was based on hard voting of the predictors.

  • Task2.ipynb: Dimensionality reduction using latent semantic analysis, unsupervised outlier detection using isolation forests, and nearest neighbor search using k-d trees

  • Task3-4.ipynb: Creates some functions to calculate elapsed time between timestamps

About

Coding challenge for a BMW job interview involving: 1) an ensemble of supervised binary classifiers; 2) unsupervised outlier detection of text data, dimensionality reduction, and nearest neighbor search

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published