Introduction to Statistics: A Week-long intensive workshop approach

 

Day four data work

I am providing you the data from the High School and Beyond dataset.

I have provided a dataset with detail on students from the high school and beyond dataset in Stata.

There are a number of variables in this datafile and the list is presented in the table below.

Name

Contents

id

ID number

female

Dummy variable for female (1=female, 0 not)

race

Race category

ses

Socio Economic Status

schtyp

type of school

prog

type of program

read

reading score

write

writing score

math

math score

science

science score

socst

social studies score

 

Below is a list of variables that I would like you to examine. Present detail on each and then determine if there are significant relationships between the variables requested.

Write a sentence or two about each result reporting the appropriate statistic (correlation, ttest, ANOVA, chisquare).

 

1.    Please use the appropriate tests to determine if there is a significant relationship between Writing Score and the following variables:

a.     race

b.    socio economic statas

c.     school type

d.    program type

2.    Are socio economic status and school type independent?

3.    Are school type and program type independent?

 

Finally, run a regression of writing as a function of gender, race (use dummy variable "white"), socioeconomic status (use the two dummy variables), type of school (use dummy variable "privateschool") and program type (use the dummy variables). Discuss the R-square value and say something about each coefficent.