| Type: | Package |
| Title: | Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed) |
| Version: | 1.0 |
| Author: | Laura Chihara [aut], Tim Hesterberg [aut, cre] |
| Date: | 2022-09-01 |
| Maintainer: | Tim Hesterberg <timhesterberg@gmail.com> |
| Description: | Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed). |
| Depends: | R (≥ 4.2.0) |
| License: | CC0 |
| URL: | https://github.com/lchihara/MathStatsResamplingR |
| LazyData: | true |
| NeedsCompilation: | no |
| Packaged: | 2022-09-01 15:58:40 UTC; timhesterberg |
| Repository: | CRAN |
| Date/Publication: | 2022-09-02 08:10:02 UTC |
Data Sets for "Mathematical Statistics with Resampling and R" (3rd Ed)
Description
Data sets for Chihara and Hesterberg (2022, ISBN: 978-1-119-87404-1) "Mathematical Statistics with Resampling in R" (3rd Ed). https://github.com/lchihara/MathStatsResamplingR
Examples
# For a list of datasets do:
library(help = resampledata3)
Alcohol content and calories of beers
Description
Alcohol content and calories for a sample of ale and lager beers.
Usage
Alelager
Format
A data frame with 31 observations on the following 4 variables.
IDSubject ID
TypeBeer: ale or lager
AlcoholPercentage alcohol content
CaloriesNumber of calories
Arsenic levels of wells in Bangladesh
Description
Levels of arsenic, chlorine and cobalt in a sample of 271 wells in Bangladesh.
Usage
Bangladesh
Format
A data frame with 271 observations on the following 3 variables.
ArsenicArsenic level, ppb
ChlorineChlorine level, ppb
CobaltCobalt level, ppb
Source
https://www2.bgs.ac.uk/groundwater/health/arsenic/Bangladesh/data.html
References
Reproduced with the permission of the British Geological Survey, copyright UKRI. All Rights Reserved.
Beer and hotwings consumption
Description
Beer and hotwings consumption by a sample of patrons at a Minneapolis bar.
Usage
Beerwings
Format
A data frame with 30 observations on the following 4 variables.
IDSubject ID
HotwingsNumber of hotwings consumed
BeerOunces of beers consumed
GenderGender of patron (M/F)
Source
Data collected by Nicole Catchpole in 2004 (private communication).
Price of textooks at a college bookstore
Description
Price of textbooks at a college bookstore.
Usage
BookPrices
Format
A data frame with 44 observations on the following 3 variables.
SubjectBiologyChemistryComputer ScienceEconomicsEducational StudiesGeologyMathematicsPhysicsPolitical SciencePsychologySOANAreaClassification of subject as either
Math & ScienceorSocial SciencesPricePrice in U.S.~dollars
Source
Data collected by R.~Hien and S.~Becker in 2010 (private communication).
Fish supply and demand for bushmeat in Ghana
Description
Fish supply (kg) and demand for bushmeat in Ghana.
Usage
Bushmeat
Format
A data frame with 30 observations on the following 4 variables.
FishFish supply (in kg.) per capita
BiomassBiomass
YearYear
ChangePercent change in biomass
Details
Biomass of large mammals was calculated for each year by multiplying the number of animals observed in 700 walking counts of 10 to 15 km each by species-specific body weights. The products of these calculations were then summed across all species.
Source
Brashares, Arces, Sam, Coppolillo, Sinclaire, Balmford, Bushmeat hunting, wildlife declines, and fish supply in West Africa, Science. 2004 Nov 12.
Cafeteria
Description
Nutritional data on meals served in a college cafeteria.
Usage
Cafeteria
Format
A data frame with 41 observations on the following 9 variables.
IDa numeric vector
Typetype of meal,
MeatorVegetarianCaloriesnumber of calories
Carbohydratesnumber of carbohydrates
Fiberfiber content
Fatfat content
Cholesterolcholesterol
Proteinprotein
Sodiumsodium
Source
Stephenson (private communication).
Cereals
Description
Nutritional data on a sample of cereals.
Usage
Cereals
Format
A data frame with 43 observations on the following 5 variables.
IDa numeric vector
Agetarget consumer,
adultorchildrenShelflocation of cereal,
bottom,middle, ortopshelfSodiumgramsodium content in grams
Proteingramprotein content in grams
Challenger
Description
Data on O-rings in 23 space shuttle flights prior to the Challenger shuttle disaster of January 1986.
Usage
Challenger
Format
A data frame with 23 observations on the following 3 variables.
DateData of launch
TemperatureAir temperature at launch (F)
IncidentBinary variable, 1 if one of the 0-rings on one of the booster rockets was damaged, 0 otherwise
Source
https://archive.ics.uci.edu/ml/datasets/Challenger+USA+Space+Shuttle+O-Ring
References
Dala, S.~R., Fowlkes, E.~B., Hoadley, B (1989). Risk analysis of the space shuttle: pre-Challenger prediction of failure. J.~American Statistical Association, 84, 945-957.
ChiMarathonMen
Description
Times from a sample of men who completed the Chicago marathon in 2015.
Usage
data("ChiMarathonMen")
Format
A data frame with 80 observations on the following 4 variables.
nameName of competitor
DivisionAge group
FinishFinish time
FinishMinTime in minutes
Source
https://chicago-history.r.mikatiming.com/
Cuckoos
Description
Female cuckoos lay their eggs on the ground and then move them to the nests of other birds. Latter gathered data on the lengths of the cuckoo eggs found in these foster nests.
Usage
data("Cuckoos")
Format
A data frame with 120 observations on the following 2 variables.
EggsLengths of eggs (mm) of cuckoos
BirdSpecies of birds:
HedgeSparrow,MeadowPipit,PiedWagtail,Robin,TreePipit,Wren
Source
Tippett, L. H. C. (1952). The Methods of Statistics, 4th Edition. Wiley.
References
Latter, O. (1902). An enquiry into the dimensions of the Cuckoo's egg and the relation of the variations to the size of eggs of the foster-parent, with notes on coloration. Biometrika 1 (2): 164-176.
Diving 2017
Description
Scores of 12 female divers (10 m platform) in the 2017 FINA World Championships.
Usage
data("Diving2017")
Format
A data frame with 12 observations on the following 4 variables.
NameName of competitor
CountryCountry
SemifinalScore in the semi-finals
FinalScore in the finals.
Details
Competitors perform 5 dives in each round and the sum of these 5 dives determines who moves on to the next round.
Source
https://www.fina.org/competitions/213/17th-fina-world-championships-2017/results?disciplines=DV
Eyes
Description
Measurements of eyes of 40 people.
Usage
data("Eyes")
Format
A data frame with 40 observations on the following 6 variables.
IDSubject ID
ageAge of subject
handDominant hand of subject,
leftorrighteyeDominant eye of subject,
leftorrightleftPDLeft pupillary distance (mm)
rightPDRight pupillary distance (mm)
Source
Westfield (private communication).
Fatalities
Description
A random sample of driver fatalities in 2009 in Pennsylvania.
Usage
Fatalities
Format
A data frame with 100 observations on the following 3 variables.
IDSubject ID
AlcoholAlcohol involved? 1 = yes, 0 = no
AgeAge
Details
The drivers were driving a car, SUV, or light pickup truck (vehicles such as motor homes, convertibles, or commercial vehicles are excluded).
Source
http://www.nhtsa.gov/FARS
Mercury content in a sample of fish in Minnesota
Description
Mercury levels (ppm) in a sample of fish caught in Minnesota
Usage
FishMercury
Format
A data frame with 30 observations on the following variable.
MercuryMercury level in ppm
Source
Minnesota pollution control agency.
Length of delays of airline flights
Description
Length of delays for flights on American Airlines and United Airlines in 2009
Usage
data("FlightDelays")
Format
A data frame with 4029 observations on the following 10 variables.
IDSubject ID
CarrierAirline: American Airlines
AAor United AirlinesUAFlightNoFlight number
DestinationDestination:
BNA,DEN,DFW,IAD,MIA,ORD,STLDepartTimeDeparture time:
4-8am4-8pm8-Mid8-NoonNoon-4pmDayDay of week
MonthMonth:
MayorJuneFlightLengthLength of flight
DelayDelay time (in minutes)
Delayed30Delayed more than 30 minutes?
NoorYes
Details
All departures of AA or UA flights from LaGuardia Airport in May or June of 2009.
Source
https://www.bts.gov/topics/airlines-and-airports/quick-links-popular-air-carrier-statistics
General Social Survey 2018
Description
General Social Survey data from 2018
Usage
GSS2018
Format
A data frame with 2348 observations on the following 17 variables.
IDSubject ID
RegionMidwest,Northeast,South,WestGenderNowGender of subject:
A gender not listed here,Man,Not applicable,Transgender,WomanAgeAge
MaritalMarital status:
Divorced,Married,Never married,Separated,WidowedDegreeEducation:
BachelorGraduate,High schoolJunior college,Less than high schoolEmployedEmployed?
NoorYesIncomeIncome level
PolviewsPolitical views:
Conservative,Extremely liberal,Extremely conservative,Liberal,Moderate,Slightly conservative,Slightly liberalPres16Voted for whom in presidential election of 2016?
Clinton,Other,TrumpDeathPenaltyOpinion on death penalty:
Favor,OpposeCourtsHow courts deal with criminals:
About right,Dont know,Not harsh enough,Too harshAttendAttendance at religious services:
Monthly,Never,Occasionally,WeeklyPostlifeBelieve in life after death?
Dont know,No,YesHappyGeneral happiness level:
Not too happy,Pretty happy,Very happySatfinSatisfaction with financial situation:
More or less,Not at all,SatisfiedEnergyGovernment spending on developing alternative energy sources:
About right,Dont know,Too little,Too much
Source
https://gss.norc.org
Births of girls in Alaska or Wyoming
Description
Data on births of a random sample of girls in Alaska or Wyoming in 2004.
Usage
data("Girls2004")
Format
A data frame with 80 observations on the following 6 variables.
IDSubject ID
StateState:
AKorWYMothersAgeAge of mother:
15-19,20-24,25-29,30-34,35-39,40-44SmokerMother a smoker?
NoorYesWeightWeight of baby (grams)
GestationGestation time (weeks)
Source
http://wonder.cdc.gov/natality-current.html
Groceries
Description
Prices of a sample of grocery items at Target or Walmart.
Usage
Groceries
Format
A data frame with 30 observations on the following 4 variables.
ProductGrocery item
SizePackage size
TargetPrice at Target
WalmartPrice at Walmart
Birth weight of boys born in Illinois
Description
Birth weight of boys born in Illinois.
Usage
ILBoys
Format
A data frame with 241 observations on the following 2 variables.
MothersAgeAge range of mother:
15-19,20-24,25-29WeightWeight of baby (gm)
Details
Random sample of boys born to mothers in Illinois in 2004. Births are restricted to single births only and gestation lengths of at least 37 weeks.
Ice Cream
Description
Nutritional information on a sample of ice cream.
Usage
data("IceCream")
Format
A data frame with 39 observations on the following 7 variables.
BrandBrand of ice cream
VanillaCaloriesCalories in vanilla
VanillaFatFat (gm) in vanilla ice cream
VanillaSugarSugar (gm) in vanilla ice cream
ChocolateCaloriesCalories in chocolate ice cream
ChocolateFatFat (gm) in chocolate ice cream
ChocolateSugarSugar (gm) in chocolate ice cream
Illiteracy
Description
Data on female illiteracy in a sample of countries where illiteracy is more than 5%.
Usage
Illiteracy
Format
A data frame with 94 observations on the following 4 variables.
IDCountry ID
CountryName of country
IllitPercentage of women over 15 years old who are illiterate (2003)
BirthsNumber of births per woman in that country (2005)
Source
www.unesco.org, www.data.worldbank.org
Lottery
Description
Winning lottery numbers for Fantasy 5 in California.
Usage
Lottery
Format
A data frame with 500 observations on the following variable.
WinNumber
Details
In Fantasy 5, a lottery game in California, a player tries to match 5 numbers chosen from 1 through 39. This data are the winning numbers for the daily games from 5 May 2010 through 15 August 2010.
Source
http://www.calottery.com/play/draw-games/fantasy-5
Math Anxiety
Description
Data from a study on math anxiety in a sample of primary and secondary school students in Italy
Usage
MathAnxiety
Format
A data frame with 599 observations on the following 6 variables.
AgeAge
GenderGender:
Boy,GirlGradeGrade:
Secondary,PrimaryAMASScore on Abbreviated Math Anxiety Scale
RCMASScore on Revised Abbreviated Math Anxiety Scale
ArithScore on arithmetic test
Source
Hill, Mammarella, Devine, et al (2016). Maths anxiety in primary and secondary school students: gender differences, developmental changes and anxiety specificity. Learning and Individual Differences 48: 45-53
Carbon dioxide levels collected by Mauna Loa Observatory
Description
Average CO2 levels (ppm) for the month of May from 1990 to 2010.
Usage
Maunaloa
Format
A data frame with 21 observations on the following 3 variables.
IDSubject ID
YearYear
LevelCarbon dioxide level (ppm)
Source
https://www.esrl.noaa.gov/gmd/ccgg/trends
Minnesota groundwater
Description
Measurements on water quality in wells in Minnesota.
Usage
MnGroundwater
Format
A data frame with 895 observations on the following 10 variables.
CountyMinnesota county
Aquifer.GroupType of aquifer:
buried Quaternary,Cambrian,Cretaceous,Devonian,Ordovician,Precambrian,surficial QuaternaryWater.LevelWater level
AlkalinityAlkalinity
AluminumAluminum
ArsenicArsenic
ChlorideChloride
Leadlead
pHpH level
Basin.NameBasin name
Source
Minnesota Pollution Control Agency
Mobile Ads
Description
Google experiment on effectiveness of certain recommendations for bidding on ads.
Usage
MobileAds
Format
A data frame with 655 observations on the following 40 variables.
Campaigna numeric vector
m.impr_posta numeric vector
m.impr_prea numeric vector
m.click_posta numeric vector
m.click_prea numeric vector
m.cost_posta numeric vector
m.cost_prea numeric vector
m.conv_posta numeric vector
m.conv_prea numeric vector
m.value_posta numeric vector
m.value_prea numeric vector
m.cpm_prea numeric vector
m.cpm_posta numeric vector
m.cpc_prea numeric vector
m.cpc_posta numeric vector
m.cpa_prea numeric vector
m.cpa_posta numeric vector
m.cpr_prea numeric vector
m.cpr_posta numeric vector
mult.changea numeric vector
d.impr_posta numeric vector
d.impr_prea numeric vector
d.click_posta numeric vector
d.click_prea numeric vector
d.cost_posta numeric vector
d.cost_prea numeric vector
d.conv_posta numeric vector
d.conv_prea numeric vector
d.value_posta numeric vector
d.value_prea numeric vector
d.cpm_prea numeric vector
d.cpm_posta numeric vector
d.cpc_prea numeric vector
d.cpc_posta numeric vector
d.cpa_prea numeric vector
d.cpa_posta numeric vector
d.cpr_prea numeric vector
d.cpr_posta numeric vector
error.cpr_prea numeric vector
error.cpr_posta numeric vector
Details
Subset of experimental data for one advertiser. See Chihara and Hesterberg textbook for more information.
Source
Ed Lee (Google)
References
Chihara and Hesterberg, Mathematical Statistics with Resampling and R (2022). Wiley.
NBA 2016-2017 season
Description
Basketball statistics for a sample of NBA players from 4 teams for the 2016-2017 season.
Usage
data("NBA1617")
Format
A data frame with 68 observations on the following 13 variables.
NamePlayer name
PositionPosition:
C(center),PF(power forward),PG(point guard),SF(small forward),SG(shooting guard)TeamTeam:
Brooklyn,Charlotte,Cleveland,San AntonioGamesNumber of games played
MinutesNumber of minutes plyaed
PercFGField goal percentage
Perc3P3-point field goal percentage
Perc2P2-point field goal percentage
PercFTFree throw percentage
OffRebOffensive rebounds
DefRebDefensive rebounds
AssistsAssists
BlocksBlocks
Details
Players in this data set played a minimum of 100 minutes during the 2016-2017 season.
Source
https://www.basketball-reference.com/
Birth weights of babies born
Description
Birth weights of babies born in North Carolina in 2004
Usage
NCBirths2004
Format
A data frame with 1009 observations on the following 7 variables.
IDSubject ID
MothersAgeMother's age level
SmokerMother a smoker? codeNo,
YesAlcoholMother consumed alcohol during pregnancy?
No,YesGenderBaby's gender
WeightBaby's weight (gm)
GestationGestation length (weeks)
Details
Babies in this random sample had a gestation period of at least 37 weeks and were single births (that is, not one of a twin or triplet).
Source
http://wonder.cdc.gov/natality-current.html
References
Chihara and Hesterberg, Mathematical Statistics with Resampling and R, 2022 (Wiley).
Nasdaq stock data
Description
Opening and closing stock prices for a random sample of 50 stock funds on NASDAQ on 1 December 2017.
Usage
Nasdaq
Format
A data frame with 50 observations on the following 4 variables.
SymbolStock symbol
OpenOpening price
CloseClosing price
VolumeNumber of shares traded
Source
https://finance.yahoo.com
Olympics 2012
Description
Data on a sample of athletes competing in the 2012 London Olympics.
Usage
Olympics2012
Format
A data frame with 42 observations on the following 7 variables.
NameName of athlete
CountryCountry
AgeAge
SexSex:
F,MHeightHeight (inches)
Weightweight (lb)
SportSport
Oscars
Description
Age and gender of Academy Award winners
Usage
Oscars
Format
A data frame with 188 observations on the following 6 variables.
YearYear of award
ActorName of actor
MovieMovie
GenderGender:
Man,WomanBirthyearBirth year of actor
AgeAge at time of award
Source
https://www.oscars.org/
Philadelphia Phillies data 2009
Description
Baseball data for Philadelphia Phillies during the 2009 season.
Usage
Phillies2009
Format
A data frame with 162 observations on the following 8 variables.
DateDate of game
LocationGame played where:
Away,HomeOutcomeOutcome of game:
Lose,WinOutcome2Outcome recoded: 1=win, 0 = lose
HitsNumber of hits
DoublesNumber of doubles
HomerunsNumber of homeruns
StrikeOutsNumber of strikeouts
Source
https://www.baseball-reference.com/
Quakes
Description
Time between earthquakes for all earthquakes of magnitude 6 or greater (1970-2009).
Usage
data("Quakes")
Format
A data frame with 805 observations on the following 2 variables.
IDSubject ID
TimeDiffTime (days)
Source
http://earthquakes.usgs.gov/earthquakes/eqarchives
Quetzal
Description
Heights of nests and snags for the quetzal (bird).
Usage
Quetzal
Format
A data frame with 21 observations on the following 3 variables.
CountryCountry:
Costa Rica,GuatemalaNestHeight of nest (meters)
SnagHeight of snag (meters)
Details
The quetzal typically nests in abandoned woodpecker nests in dead tree trunks (snags).
Source
Siegfried, D., Linville, D., Hille, D. (2010). Analysis of nest sites and the resplendent quetzal (pharomachrus mocinno): relationship between nest and snag heights. Wilson Journal of Ornithology 122: 608-11.
Rangers and Twins baseball players (2016 season)
Description
Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins.
Usage
data("RangersTwins2016")
Format
A data frame with 27 observations on the following 17 variables.
NameName of player
TeamTeam:
Rangers,TwinsPosPlayer's position
AgeAge in years
GamesNumber of games played
AtBatsNumber of at bats
RunsRuns
HitsHits
DoublesDoubles
TriplesTriples
HRHomeruns
RBIRuns batted in
SBStolen bases
CSCaught stealing
BBBase on balls
SOStrike outs
BABatting average
Details
Data on baseball players (excluding pitchers) who played for the Texas Rangers or Minnesota Twins. These players played at least 50 games. During the 2016 season, the Rangers had the best winning percentage (0.586) in the American League while the Twins had the worst (0.364)
Source
www.baseball-reference.com
Recidivism
Description
Recidivism data from Iowa.
Usage
Recidivism
Format
A data frame with 17022 observations on the following 7 variables.
GenderGender:
F,MAgeAge group:
25-34,35-44,45-54,55 and Older,Under 25Age25Over or Under 25 years of age?
Over 25,Under 25OffenseType of offense:
FelonyMisdemeanorRecidRecidivated?
No,YesTypeReason:
New(new crime),No Recidivism(did not recidivate),Tech(technical violation, such as a parole violation)DaysNumber of days to recidivism; NA if no recidivism
Details
All offenders convicted of either a misdemeanor or felony who were released from an Iowa prison during the 2010 fiscal year ending in June.
Source
https://data.iowa.gov/Public-Safety/3-Year-Recidivism-for-Offenders-Released-from-Pris/mw8r-vqy4
Salaries of baseball players
Description
Salaries of a random sample of baseball players from 1985 and 2015.
Usage
Salaries
Format
A data frame with 70 observations on the following 3 variables.
LeagueLeague:
AmericanNationalSalarySalary (in millions) in 2015 dollars
YearYear: 1985 or 2015
Service times at a college snack bar.
Description
Time to be served at a college snack bar.
Usage
Service
Format
A data frame with 174 observations on the following 2 variables.
IDSubject ID
TimesTime in minutes
Source
Haynor, Lojovich, Syed (private communication, 2010).
Skateboard experiment
Description
Measurement of testosterone levels in males in a skateboard experiment.
Usage
Skateboard
Format
A data frame with 71 observations on the following 3 variables.
AgeAge in years
ExperimenterTreatment (gender of experimenter):
Female,MaleTestosteroneTestosterone level
Details
Results from an experiment where male skateboarders performed tricks in front of either a female or male.
Source
Ronay and Hippel (2010). The presence of an attractive woman elevates testosterone and physical risk taking in young men. Social Psychological and Personality Science 1:57-64.
Figure skating scores for men from the 2010 Winter Olympics.
Description
Short and free skate scores for male figure skaters in the 2010 Winter Olympics (Vancouver).
Usage
Skating2010
Format
A data frame with 24 observations on the following 5 variables.
CountryCountry of skater
NameName
ShortShort program score
FreeFree skate score
TotalTotal
Source
https://skatingscores.com/0910/oly/
Spruce data
Description
Measurements from an experiment on the growth of black spruce seedlings.
Usage
Spruce
Format
A data frame with 72 observations on the following 9 variables.
TreeSubject ID
CompetitionTreatment:
C(competition),NC(no competition)FertilizerTreatment:
F(fertilizer),NF(no fertilizer)Height0Height of seedling at start
Height5Height of seedling after 5 years
Diameter0Diameter of seedling at start
Diameter5Diameter of seedling after 5 years
Ht.changeChange in height
Di.changeChange in diameter
Details
Experiment on growth of black spruce seedlings under treatments of fertilizer-no fertilizer, competition- no competition (weeding).
Source
Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picea mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.
Starcraft
Description
Number of wins by a sample of Korean players in Starcraft, a strategy video game.
Usage
Starcraft
Format
A data frame with 45 observations on the following 4 variables.
IDSubject ID
RaceChosen race of player:
Protoss,Terran,ZergAgeAge of player
WinsNumber of wins
Source
Evans, private communication. http://www.teamliquid.net/tipd/players
TV commercial lengths
Description
Lengths of television commercials on basic and extended cable TV channel.s
Usage
data("TV")
Format
A data frame with 20 observations on the following 3 variables.
IDSubject ID
TimesTime (min)
CableCable:
Basic,Extended
Details
Lengths of TV commercials during any given half-hour time period.
Source
Rodgers, Robinson (private communication).
Texas birth weights
Description
Weights of babies born in Texas in 2004.
Usage
TXBirths2004
Format
A data frame with 1587 observations on the following 8 variables.
IDSubject ID
MothersAgeMother's age:
15-19,20-24,25-29,30-34,35-39,40-44,under 15SmokerMother smokes?
No,YesGenderGender of baby:
Female,MaleWeightWeight of baby (g)
GestationGestation length (weeks)
NumberBaby a single birth (1), twin (2), etc.
MultiplePart of multiple birth (eg twin, triple)?:
No,Yes
Details
Random sample of babies born in Texas in 2004.
Source
http://wonder.cdc.gov/natality-current.html
Titanic
Description
Subset of Titanic data
Usage
Titanic
Format
A data frame with 658 observations on the following 3 variables.
IDSubject ID
SurvivedSurvival status: 1 = survived, 0 = died
AgeAge of passenger
Details
Subset of passenger data on the Titanic.
Source
https://data.world/nrippner/titanic-disaster-dataset
Turbine
Description
Average daily wind speeds (2010) from Carleton College turbine.
Usage
data("Turbine")
Format
A data frame with 168 observations on the following 4 variables.
Date2010Date
AveKWAverage kilowatts
AveSpeedAverage speed (m/s)
ProductionEnergy output (kilowatt hours)
Source
Carleton College, Northfield MN.
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R. (Wiley)
Verizon repair times
Description
Repair times by Verizon for its customers or customers of other telephone companies.
Usage
Verizon
Format
A data frame with 1687 observations on the following 2 variables.
TimeRepair time (h)
GroupCustomer:
CLEC(competing local exchange carrier),ILEC(incumbent local exchange carrier)
Details
Verizon is responsible for providing repair service to both its customers (ILEC) and its competitors (ILEC).
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resamplng and R (Wiley).
Volleyball data
Description
Data on a sample of Division I women volleyball teams.
Usage
Volleyball2009
Format
A data frame with 30 observations on the following 4 variables.
TeamTeam
HitPercentHitting percentage
AsstsAssists
KillsKills
Source
http://www.ncaa.org/championships/statistics/womens-volleyball-statistics
Walleye
Description
Lengths and weights of a sample of walleye caught in Minnesota lakes (1990's).
Usage
Walleye
Format
A data frame with 60 observations on the following 2 variables.
LengthLength (inches)
WeightWeight (pounds)
Source
Monson, Minnesota Pollution Control Agency (private communication)
Watertable
Description
Relationship between the depth of the watertable and survival status of black spruce seedings.
Usage
Watertable
Format
A data frame with 360 observations on the following 2 variables.
DepthDepth of watertable (cm)
AliveStatus of seedling: 1 = alive, 0 = dead
Details
Part of the data from an experiment to study factors associated with the growth of black spruce seedlings under various treatments. Status of seedling at the end of the second year of the experiment is noted here.
Source
Camill, Chihara, Adams, et al (2010). Early life history transitions and recruitment of Picae mariana in thawed boreal permafrost peatlands. Ecology 2:448-459.
References
Chihara and Hesterberg (2022). Mathematical Statistics with Resampling and R (Wiley).