Hospital Readmission Prediction using Markov Random Fields

This project aims to predict hospital readmissions using Markov Random Fields (MRFs) and compare its performance with traditional machine learning models such as Gradient Boosting.

Dataset

The dataset used in this project contains patient information, including demographic data, medical history, and hospital admission details. The data was collected from a national data warehouse that collects comprehensive clinical records across hospitals throughout the United States.

Methodology

The project follows these main steps:

Data Preprocessing:
- Handling missing values and data inconsistencies.
Feature Engineering:
- Encoding categorical variables.
- Selecting relevant features for readmission prediction.
Model Development:
- Constructing an MRF model using the selected features.
- Defining potential functions and edges based on domain knowledge and correlation analysis.
- Creating a factor for the target variable 'readmitted'.
Model Training and Evaluation:
- Training the MRF model using the preprocessed data.
- Evaluating the model's performance using metrics such as accuracy, precision, recall, F1 score, and AUC-ROC.
- Comparing the performance of the MRF model with traditional models like Gradient Boosting.
Model Refinement:
- Adjusting the potential functions and edges to improve the model's performance.
- Applying techniques like thresholding to balance the trade-off between true positives and false positives.

Results

The MRF model achieved the following performance metrics:

Accuracy: 0.6000
Precision: 0.6000
Recall: 1.0000
F1 Score: 0.7500
AUC-ROC: 0.2928

Compared to the Gradient Boosting model, the MRF model had a lower accuracy and AUC-ROC score but a perfect recall. The low AUC-ROC score suggests that the MRF model had a high number of false positive predictions.

To address this issue, several refinements were made to the MRF model, including:

Adjusting the potential function for the target variable 'readmitted' to assign higher probabilities to the negative class.
Incorporating additional features and domain knowledge to improve the model's discriminative power.
Applying thresholding to the predicted probabilities to control the balance between true positives and false positives.

Usage

To run the code and reproduce the results, follow these steps:

Install the required dependencies:
- Python 3.11
- pgmpy
- numpy
- pandas
- scikit-learn
Prepare the dataset:
- Place the dataset file in the designated directory.
- Ensure that the dataset follows the expected format and structure.
Run the code:
- Open the Jupyter Notebook Notebook-MRF.ipynb.
- Execute the notebook cells in sequential order.
- Modify the notebook as needed to adjust parameters, feature selection, and model configuration.
Evaluate the results:
- Review the performance metrics displayed in the notebook.
- Compare the results of the MRF model with other models if available.

Future Work

Experiment with different inference methods and elimination orders for the MRF model.
Explore the inclusion of temporal and spatial information in the model.
Investigate the impact of different feature selection techniques on model performance.
Validate the model's generalizability using external datasets or cross-validation techniques.

References

pgmpy documentation: https://pgmpy.org/
scikit-learn documentation: https://scikit-learn.org/

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Notebook-MRF.ipynb		Notebook-MRF.ipynb
Paper.pdf		Paper.pdf
README.md		README.md
diabetic_data.csv		diabetic_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hospital Readmission Prediction using Markov Random Fields

Dataset

Methodology

Results

Usage

Future Work

References

About

Releases

Packages

Languages

tramngo1603/MRF-Readmit

Folders and files

Latest commit

History

Repository files navigation

Hospital Readmission Prediction using Markov Random Fields

Dataset

Methodology

Results

Usage

Future Work

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages