-
Notifications
You must be signed in to change notification settings - Fork 0
/
HOB(2021)_Group project.Rmd
98 lines (47 loc) · 3.04 KB
/
HOB(2021)_Group project.Rmd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
---
title: "Hands-on Biostatistics 2021"
author: "Group Hands-on Project"
date: "15/7/2021"
output:
pdf_document: default
word_document: default
html_document: default
---
## Data description
For the hands-on project we will be using a simulated dataset related to Multiple Sclerosis (MS) patients (ms_data.csv). The dataset contains data for 25 patients with MS. The outcome studied here is the the Expanded Disability Status Scale (EDSS). The EDSS scale ranges from 0 to 10 and is based on an examination by a neurologist. A scale 0 indicates normal condition. A high value indicates high dissability progression, with 10 indicating death due to MS. A detailed description of EDSS can be found here: http://www.nationalmssociety.org/nationalmssociety/media/msnationalfiles/brochures/10-2-3-29-edss_form.pdf.
We will consider data for the EDSS measured at first visit and last visit (interval of 6 years). Covariates that were believed to influence EDSS scale are age, MS type and gender.
### Data file
File in drive: ms_data.csv
- Variables:
1. ID: patient id
2. Gender: (male; female)
3. Age_first: age of patient at baseline, numeric
4. MS_type: Type of MS (RRMS; SPMS; PPMS) See more here https://www.mssociety.org.uk/about-ms/types-of-ms
5. edss_first: EDSS measured at first visit
6. edss_last: EDSS measured at the last visit
## Assignment
```{r, echo=FALSE}
#Set working directory
setwd("C:/Users/user/Desktop") # Use the session menu better
#Save the data you have in drive in your working directory and read it with this code:
library(readr)
ms_data <- read_csv("ms_data.csv",
col_types = cols(age_first = col_number(),
edss_first = col_number(), edss_last = col_number(),
gender = col_factor(levels = c("Female",
"Male")), ms_type = col_factor(levels = c("RRMS",
"PPMS", "SPMS")), patient_id = col_number()))
```
Of key interest is to study:
1) Whether the average EDSS has changed from the first visit to the last visit
2) Whether the average EDSS at the last visit is different for male and female patients
3) Whether the average EDSS at the last visit is different for patients with different MS type.
4) Whether age is related to EDSS at the first and the last visit.
Note that you should support each research question with a statistical analysis performed in R (use this notebook) which is appropriate for your data.
## Remarks
- For each question, motivate your choice of techniques, estimation methods, assumptions
you make, and describe possible, problems.
- For each of the above questions, summarize your conclusions and document them as a report to a
clinician.
- Work and document your project in an R Notebook (you will find this in classroom) and submit it via group Classroom before July 17, 2021.
Good luck!