Ask Statistics and Probability Expert

Data:

This assignment involves the analysis of a data set we have seen previously, which is various biomarkers in TB meningitis patients. The goals of the assignment are to develop some capability in carrying out and comparing different linear mixed effects models. The file is called Classfile_TBM_assignmetn2.csv and is available in the datasets folder on Vula. The data we are using today is biomarker data from a cohort of individuals co-infected with HIV and Tuberculous Meningitis (TBM). Although rare, acquiring TBM usually results in very poor outcomes. The paper from which is data is drawn is Marais et al.

Neutrophil-associated central nervous system inflammation in Tuberculous Meningitis Immune reconstitution inflammatory syndrome. Clin Infect Dis. 59 (2014).

A data frame of 34 individuals, each with observations at three time points. There is no missing data.

Variables:

  • ID - individual ID
  • Group - some individuals developed a condition called IRIS, some did not, this distinguishes the two, it remains in the data set, but we will not use it for this assignment.
  • Time - 0, 2, 4, corresponding to baseline, ART initiation, 2 weeks post ART. Note that the data is in wide format, so you will need to create this variable. It is part of the column names.
  • Potential clinical variables/confounders: BICd4 - CD4 count; BIHIVVL - HIV viral load; BMI - body mass index; BINa - blood sodium
  • Outcome variable: CSFNeutrophils OR CSFLymphocytes

A number of analytes. The analytes were measured either in plasma (blood) or in CSF fluid, which is why the prefix of "CSF" or "BI" occurs. We will ONLY use the CSF measured values, and the BI measured analytes have already been removed (except where relevant). The analytes available are grouped at the end of the assignment. You will not use all of the analytes, but a selection of 4 (see details at end of assignment for how to do this).

This will result in an analysis data set of 1 outcome, 4 potential covariates/confounders (CD4, HIVVL, BMI, BINa) and 4 analytes (in CSF, varies by person).

Instructions: Answer the following questions in a TYPED report and submit a pdf file only to Vula. When answering the questions you should be concise, complete and correct. With every table or figure you must provide an appropriate caption, and a brief [no more than 2 or 3 sentences] explanation of what results the table or figure presents.

Q1. To aid the grader, provide a table that indicates the analytes, and the outcome, that were used in YOUR analysis, and their summary statistics.

Q2. From your preliminary investigations, select one figure that effectively describes the data and provide it, with an appropriate caption.

Q3. Fit the following models, using time as a continuous, linear covariate, and ensuring that covariates are scaled:

(a) linear regression model

(b) random intercepts model (on ID)

(c) random slopes model (on ID)

(d) random slopes and random intercepts model (on ID)

Provide a table that presents the coefficient estimates, 95% confidence intervals for estimates of the fixed effects and all estimates of the variance of the random effects for all of the models to enable a direct comparison. Do not provide p-values.

Note: You will have to decide which of the clinical parameters should be included in this model. Be sure to indicate what is included in your explanation of these results, and fit the same covariate model for all components. Although clinical covariates are available at all time points, you may choose to fit them as time in variant by using just the baseline observations. All of your (4) analytes should be included.

Q4. Fit model (3a) and (3c) with and without scaled covariates/analytes. Provide coefficient estimates and CI intervals together in a single table, as well as variance estimates.

Q5. Plot the random effects estimates (forest plot) for models (3c) and (3d) and provide with an appropriate caption.

Q6. Fit model (3c) with a different function of time. Report the estimates here in a single table. Make sure the caption indicates how time went into the model.

Q7. For each of the random effects models you have fit (in Q3 and Q6, 4 in total), report the proportion of variance that is attributable to the random effect.

Q8. Using a maximum of 300 words, compare and contrast the models you have fit, with attention to the estimated effects, the variation between the random effects estimates, the impact of changing how time is modeled and the impact of scaling the covariates.

How to select YOUR analytes:

Outcome variable: Toss a coin, heads you use CSFNeutrophils, tails you use CSFLymphocytes

Analytes:

Use: TNFalpha

Select 1 from Grp A: IL22, IL18, INFgamma, IL1beta, IL2, MIP1beta, MIP1alpha , LL37,

IL1beta, C5a,

HNP13, TIMP2

Select 2 from Grp B: IP10, MMPs (all of them), MIP2, IL8, IL6, MCP1, IFNalpha2, IL10,

IL12p40, TIMP1, IL17, MMP9:TIMP1

Select these independently without discussion with your colleagues. You do not need to justify your selection.

Attachment:- Assignment Files.rar

Statistics and Probability, Statistics

  • Category:- Statistics and Probability
  • Reference No.:- M92307308

Have any Question?


Related Questions in Statistics and Probability

Introduction to epidemiology assignment -assignment should

Introduction to Epidemiology Assignment - Assignment should be typed, with adequate space left between questions. Read the following paper, and answer the questions below: Sundquist K., Qvist J. Johansson SE., Sundquist ...

Question 1 many high school students take the ap tests in

Question 1. Many high school students take the AP tests in different subject areas. In 2007, of the 144,796 students who took the biology exam 84,199 of them were female. In that same year,of the 211,693 students who too ...

Basic statisticsactivity 1define the following terms1

BASIC STATISTICS Activity 1 Define the following terms: 1. Statistics 2. Descriptive Statistics 3. Inferential Statistics 4. Population 5. Sample 6. Quantitative Data 7. Discrete Variable 8. Continuous Variable 9. Qualit ...

Question 1below you are given the examination scores of 20

Question 1 Below you are given the examination scores of 20 students (data set also provided in accompanying MS Excel file). 52 99 92 86 84 63 72 76 95 88 92 58 65 79 80 90 75 74 56 99 a. Construct a frequency distributi ...

Question 1 assume you have noted the following prices for

Question: 1. Assume you have noted the following prices for paperback books and the number of pages that each book contains. Develop a least-squares estimated regression line. i. Compute the coefficient of determination ...

Question 1 a sample of 81 account balances of a credit

Question 1: A sample of 81 account balances of a credit company showed an average balance of $1,200 with a standard deviation of $126. 1. Formulate the hypotheses that can be used to determine whether the mean of all acc ...

5 of females smoke cigarettes what is the probability that

5% of females smoke cigarettes. What is the probability that the proportion of smokers in a sample of 865 females would be greater than 3%

Armstrong faber produces a standard number-two pencil

Armstrong Faber produces a standard number-two pencil called Ultra-Lite. The demand for Ultra-Lite has been fairly stable over the past ten years. On average, Armstrong Faber has sold 457,000 pencils each year. Furthermo ...

Sppose a and b are collectively exhaustive in addition pa

Suppose A and B are collectively exhaustive. In addition, P(A) = 0.2 and P(B) = 0.8. Suppose C and D are both mutually exclusive and collectively exhaustive. Further, P(C|A) = 0.7 and P(D|B) = 0.5. What are P(C) and P(D) ...

The time to complete 1 construction project for company a

The time to complete 1 construction project for company A is exponentially distributed with a mean of 1 year. Therefore: (a) What is the probability that a project will be finished in one and half years? (b) What is the ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Why might a bank avoid the use of interest rate swaps even

Why might a bank avoid the use of interest rate swaps, even when the institution is exposed to significant interest rate

Describe the difference between zero coupon bonds and

Describe the difference between zero coupon bonds and coupon bonds. Under what conditions will a coupon bond sell at a p

Compute the present value of an annuity of 880 per year

Compute the present value of an annuity of $ 880 per year for 16 years, given a discount rate of 6 percent per annum. As

Compute the present value of an 1150 payment made in ten

Compute the present value of an $1,150 payment made in ten years when the discount rate is 12 percent. (Do not round int

Compute the present value of an annuity of 699 per year

Compute the present value of an annuity of $ 699 per year for 19 years, given a discount rate of 6 percent per annum. As