Introduction to Statistics (MAT/SST 115.03 2008S)
This is one of those fun times in which our data set combines a number
of essentially independent columns into a single data frame. Since R
pads the empty cells in the data frame with
our analyses may be slightly more complicated.
Let's start by loading the data. There's little enough data that we can look at all of it.
CommuteTimes = read.csv("/home/rebelsky/Stats115/Data/HypoCommute.csv") CommuteTimes
The columns are named
A1 (for Alex's Route 1),
A2 (for Alex's Route 2), B1 (for Barb's Route 1), and
so on and so forth.
You should be able to read the sample size from the table. To get
the sample mean and standard deviation, we can use
sd, but need to tell the functions
to ignore the
NA values. (Having to tell the functions
to deal with the NA values differently is one of the disadvantages of
combining the columns.
mean(CommuteTimes$A1, na.rm=T) sd(CommuteTimes$A1, na.rm=T)
R makes two-sample t-tests very easy to compute.
t.test with the two samples.
We repeat the t-test, telling it to use a different confidence level.
You should be able to figure out how to do these computations by revisiting the Alex examples from above.
Copyright (c) 2007-8 Samuel A. Rebelsky.
This work is licensed under a Creative Commons
Attribution-NonCommercial 2.5 License. To view a copy of this
or send a letter to Creative Commons, 543 Howard Street, 5th Floor,
San Francisco, California, 94105, USA.