Data sets that have
been discussed in class or are used in homework are:
- Fishoil.txt Data file for HW 1
- Tomato.txt Tomato data file for
SAS intro workshop.
- Insect.txt
Insect data file for SAS intro workshop.
- Insect.xls Excel worksheet version of
insect data set.
- Salinity.txt Data file for barley
salinity study. Used to illustrate variance component calculations
- cotton.txt Data on contamination
in seed lots for HW 5. Columns are lot sample contamination.
- hydroxy.txt Updated data on hydroxy value of
two alcohols, nonylphenol and dodecanol, from an interlaboratory
study. Columns are the compound, lab, the
day and the measurement. There are two measurements per day, two
days per lab, and 10 labs per compound. Labs are now numbered 1 to 20.
- porosity.txt Porosity data from HW 5.
Columns are field id, section id, porosity value.
- poro2.txt Corrected porosity data for HW 5.
Columns are field id, section id, porosity value.
- blockex.txt Data for blockex.sas.
Example from Mead, section 5.3, arranged for a paired t-test,
i.e. one row per block (= per pair of e.u.'s)
- brome.txt Data for example of analysis of
a Latin Square. The row blocking variable is proximity to freeway;
the column blocking variability is proximity to a stream. Columns
in the data file are the row, column, treatment, and a measure of
the abundance of the rare species.
- brome2.txt Data for illustration of
multiple latin squares. Made up data based on brome grass study, 3
squares, each 4x4.
- range.txt
Response of blue grama grass to rangeland fertilization for HW 6. Variables
are the treatment, the block number, the response: plant
phosphate concentration, and a treatment code (a - e) that you can
ignore or use instead of the treatment name.
- millet.txt Millet row
spacing study using a latin square design for HW 6. Columns are the row,
column, treatment (spacing), and yield (response).
- Food.txt Data
file for food palatibility study. First column is name of the
protein supplement (c, l, s). Second column is a preference score
(-3 to 3) where 3 = most palatible.
- yogurt.txt Yogurt data for
Midterm I. Columns are the treatment (C or L), the batch number,
the container number, and the K concentration.
- yogurt2.txt Second yogurt data
file for the midterm. Columns are the container number, extract
number, measurement number, and the K concentration.
- grass.txt Grass study for midterm
I. Columns are the county number, the treatment name, and the %
success of bird nests on that field.
- Food2.txt Data file for entire food
palatability study. First column is the treatment code (1 through
6), second column is the sex of the participant, third colume is the
type of the protein supplement (c, l, s). Last column is a
preference score (-3 to 3) where 3 = most palatable.
- multblock.txt Data for example of
multiplicative block effects.
- millet2.txt Millet row spacing study
using two 5 x 5 latin square design for HW 7. Columns are the square, row,
column, treatment (spacing), and yield (response).
- plankton.txt Data for plankton
abundance problem on HW 7. Columns are tow (i.e. the block), species, and
abundance.
- carrot.txt Carrot experiment to look at
effects of seed stock and sowing rate on yield using two way
factorial treatment structure in an RCBD. Columns are treatment
number, stock, rate, block, and yield.
- ratwt.txt Weight gain in rats fed diets
with different protein levels and types. Design is a CRD. Columns
are treatment #, level, type, and weight gain (in gm).
- range2.txt Same data as
range.txt, but labeled for use in a 2 way ANOVA. Variables are
Nrate (3 levels: 0, 50 and 100) and P (- or +), block # and PO4
measurement.
- hyp.txt "Botched" study for HW 9. Study was
designed as an RCBD with 10 blocks and 2 treatments, but many values
are missing. Three columns: block, treatment (a or p) and response.
- alpine1.txt Seedling growth in
alpine tundra. Columns are the experiment (only Penn in this data
set), the species names (nive or grac), the site (Dry, Med, Wet), and
the log transformed weight.
-
Alpine2.txt Potentilla growth at two sites. Columns are place,
species, site and log transformed weight.
- alfalfacut.txt Alfalfa cutting
experiment. 3 varieties, 4 dates of last cutting in fall. Used as
example for a split plot study.
- ryegrass.txt Effect of
manuring and ryegrass strain on pasture productivity.
- splitcornu.txt Effect of
water, P, and N on corn water use efficiency.
-
asparagus.txt Asparagus cutting study. Four harvesting
intensities for asparagus. Plots repeatedly measured in 3 years.
Used as example of repeated measures analysis.
- scab.txt Effect of fungicide on
potato scab for midterm II. Variables are block, 2 treatment codes
(explained in the exam text), amount, season, and disease measure.
- nyssa.txt Height
growth of Nyssa species for Midterm II. Variables are block number,
pool number (from 1 to 15), light level, species code, water depth,
and height growth.
- serum.txt Data
on serum glucose concentration over time for HW 11.
- stream.txt Data on NH4 transport
distance for streams in the US for HW 12.
- camp.txt Data on cAMP, used as class
example of ANCOVA for baseline data.
- heart.txt Data on effects of 2 drugs and a
placebo on heart function.
- viral.txt PRRS data set for final
exam. Columns are the treatment (i.e. the drug), the pig id, the
time, and the reponse: lTCID50, a measure of the amount of virus in a pig.
-
lentil.txt Lentil data set for final exam. Columns are the
location, the weevil treatment, the fertilizer treatment, the weed
treatment, and the yield of lentils.
- spi.txt Soy Protein Isolate data for
final exam. Columns are the subject id, her diet, her baseline bone
mineral content (BMC0) and her BMC after 24 weeks on diet (BMC24).