Most of the datasets I used come from the Sleuth3
package, which you can install and load into R using:
install.packages("Sleuth3")
library(Sleuth3)
Many other datasets come from the wonderfully curated list made by Larry Winner at the University of Florida: http://users.stat.ufl.edu/~winner/datasets.html. I’ll mark these with a “(W)” by the name of the file.
I will limit this list to those datasets that aren’t in the Sleuth3
package.
index
: The order number of the infant.weight
: The weight of the infant (in ounces).foamIndx
: The ratio between the foam and liquid volume (expressed as a percent) measured 30s after extraction.trt_id
: A label for each treatment group.tempC
: The temperature of the water in Celsius (75, 85, and 90).prssBar
: The pressure in bars (15 and 20).conc
: Tissue concentration of LSD in nanograms of LSD per mL.score
: Performance score on the arithmetic questions as a percent of a control score.Skill
: Either Expert
or Average
.Hand
: Either Bad
, Neutral
, or Good
.Limit
: Either Fixed
or None
.Cash
: The final cash balance.sardine
data frame from Clarke (1936) contains the number of vertebrae and the location of 12,858 sardines. Variables include:
location
: The location of the sardine. 1
= Alaska, 2
= British Columbia, 3
= San Francisco, 4
= Monterey, 5
= San Pedro, and 6
= San Diego.number
: The number of vertebrae of the sardine.trek
data frame consists of word counts and proportion of words spoken by the main characters from the excellent television series Star Trek: The Next Generation. I obtained the original dataset here. The variables in this dataset are
season
The season of the show.character
The character of the show.count
The raw word-counts for a given character in a given season.prop
The proportion of words spoken by a given character in a given season.logprop
The log of prop
.Clarke, F.N. 1936. “Variations in the Number of Vertebrae in Sardine, Sardinops Caerulea (Girard).” Copeia 1936 (3): 147–50.
Masella, Piernicola, Lorenzo Guerrini, Silvia Spinelli, Luca Calamai, Paolo Spugnoli, Francesco Illy, and Alessandro Parenti. 2015. “A New Espresso Brewing Method.” Journal of Food Engineering 146. Elsevier: 204–8.
Meyer, Gerhard, Marc von Meduna, Tim Brosowski, and Tobias Hayer. 2013. “Is Poker a Game of Skill or Chance? A Quasi-Experimental Study.” Journal of Gambling Studies 29 (3). Springer: 535–50.
Rosner, Bernard. 2015. Fundamentals of Biostatistics. Nelson Education.
Wagner, John G, George K Aghajanian, and Oscar HL Bing. 1968. “Correlation of Performance Test Scores with "Tissue Concentration" of Lysergic Acid Diethylamide in Human Subiects.” Clinical Pharmacology & Therapeutics 9 (5). Wiley Online Library: 635–38.