Data sets that have been discussed in class or are used in homework
are:
- Week 1:
- Week 2:
- Data for HW 1, problem 26/25. Zinc in
rats. Variables are: Group (A or B) and zinc level (mg/ml).
- Data for HW 1, problem ./26
Votes in Congress. PctPro is the Percent Pro votes for that member.
That's the response variable for the HW problem.
There are lots of variables in the file. You need
to read them all (at least up to PctPro).
You probably won't use most of the variables in the file.
- Lab 1 values to use in lab 2
-
Schizophrenia data (Case Study 2.2) Each line of data
corresponds to a pair of maternal twins. The first number is the
hippocampus volume for the unaffected individual. The second number
is the hippocampus volume for the individuals affected by
schizophrenia.
- Week 3:
- Transgenic mice weight data Data
for HW 2, problem 1.
Weights of transgenic and non-transgenic mice. Each row is a single
mouse. Variables are tg (0: non-transgenic, 1: transgenic) and
weight (in grams)
- Data for HW 2, radon problem. Concentration
of radon in a simple random sample of owner-occupied homes in Ramsey Co., MN.
- Rainfall
data from Case study 3.1 Used in transf.sas (lab on 9 Sept).
The first column is the rainfall, the second is a treatment code: 1
= unseeded day, 2 = seeded day.
- Week 4:
-
Schizophrenia data (Case Study 2.2) Each line of data
corresponds to a pair of maternal twins. The first number is the
hippocampus volume for the unaffected individual. The second number
is the hippocampus volume for the individuals affected by
schizophrenia.
- Darwin
cross/self fertilization data In the Darwin data, each row of the data set represents a pair of plants.
The second is the height (in inches) of the cross-fertilized plant. The
third is the height of the self-fertilized plant.
-
Hamburger bacteria data.
Quantification of bacterial load on hamburger patties that received
a treatment to reduce bacteria (active) or nothing (control). cfu is the
measure of bacteria on the patty.
- Microsatellite counts for mutagen
study. The two columns are the dose of mutagen (0 or 80) and the
count of microsatellite nuclei in 100 cells.
- Exercise and walking data First column is the age when the infant first walked. The second is
the treatment (ACTIVE or NONE) they received.
- Galapagos finch data set The two columns are the year of measurement and the beak depth.
- Traffic fatalities and speed limit data The three columns are the state name, whether they
increased their speed limit in 1997 (No or Yes), and the % change in traffic fatalities between 1996 and 1998.
- hamburger.txt The two columns are the treatment name and the CFU/gm (colony forming units per
gm of meat)
- Patty.txt Bacterial count data with three plate
counts per patty (eu)
- Week 5:
- Bee data for chapter 3, problem
28. First column is proportion of pollen removed, 2nd column is
duration of visit (in seconds) and 3nd column is type of bee:
1 = bumblebee, 2 =
honeybee workers.
- Insulating fluid data Columns are the voltage
(in kV) and the time until breakdown (in minutes).
- Glass roughness data. Data on surface roughness
of polished glass for HW 5. Columns are the treatment, the blank number,
and the roughness in micro-inches.
- Diet
and longevity data set (Case study 5.1), for diet.sas in lab on 19 Sept 2012.
- tobeave.csv for tobeave.sas, parts
of which were discussed in lab on 19 Sept 2012
- Week 6:
- Cavity size data Bird nest cavity size, for
HW 5. Columns are the species name and the size (square mm) of the opening to the
nest cavity.
- Tyrannosaurus data Oxygen
isotope data on bones from a single T. rex for problem 5:23. Column 1 is the oxygen
isotope value; column 2 is the bone number.
- Diet study dataWeight loss after 24 months
on three diets. Column 1 is the subject number, 2 is the treatment, 3 is the weight loss
in kg.
- Week 7:
- logging.csv: logging data for midterm 1. Columns are:
Plot, transect, trt (L=logged, U=unlogged), sdl09 = number of seedlings on the transect in 2009,
sdl10 = number of seedlings on the transect in 2010, change = percent change from 2009 to 2010.
- concrete.txt: concrete data for midterm 1. Columns are:
mile number, type of concrete used for that mile, and the surface smoothness.
- prrs1.txt: PRRS diagnostic test data, version 1, for midterm 1
Columns are the pig number, the log count of PRRS from the ELISA test, and log count of PRRS
from the FA test.
- prrs2.txt: PRRS diagnostic test data, version 2, for midterm 1
Columns are the pig number, the test name, and the log count of PRRS
- Week 8:
- meat.txt Meat pH data (case study 6.2),
Used as lecture example and for meat.sas in lab on 3 Oct 2012.
- Peanut aflatoxin data Peanut and aflatoxin concentration data. The first column is the
percent clean peanuts. The second is the aflatoxin concentration
(ppb).
- lettuce.csv Lettuce yield
response to fertilizer. Data are the fertilize applied to the plot
and the number of heads of lettuce harvested from the plot.
- music.txt Music / brain activity data
First column is the number of years the subject has played a
string instrument. The second is the neuronal activity index, a
measure of brain activity.
- Week 9:
- Wine consumption and heart attack mortality data for HW 7
Columns are the name of the country, the wine consumption (liters per
person per year), and the heart disease mortality.
- Planet data) for HW 7.
Problem 7.14, data from display 1.15. Three columns: planet name,
order from sun, and distance from sun.
- Eruption.txt Wait time
between eruptions of the Old Faithful geyser in Yellowstone.
Columns are date (ignore for hw 7 problem), interval between
eruptions (in minutes), and duration of the interval (in minutes).
- anscombe.txt Anscombe data sets.
- brain.txt Data for brain.sas
- Week 10:
- Week 10:
- Week 11:
- Week 13:
Week 14:
- Week 15:
- Datasets for final exam