A data set from De Cock (2011) has 82 fields were recorded for 2,930 properties in Ames IA. This version is copies from the AmesHousing package but does not include a few quality columns that appear to be outcomes rather than predictors.

## Source

De Cock, D. (2011). "Ames, Iowa: Alternative to the Boston Housing Data as an End of Semester Regression Project," Journal of Statistics Education, Volume 19, Number 3.

http://jse.amstat.org/v19n3/decock.pdf

## Details

See this links for the sources below for more information as well as ?AmesHousing::make_ames.

For these data, the training materials typically use:

library(tidymodels)

set.seed(4595)
data_split <- initial_split(ames, strata = "Sale_Price")
ames_train <- training(data_split)
ames_test  <- testing(data_split)

set.seed(2453)
ames_folds<- vfold_cv(ames_train)

## Examples

data(ames)
str(ames)
#> tibble [2,930 × 74] (S3: tbl_df/tbl/data.frame)
#>  $MS_SubClass : Factor w/ 16 levels "One_Story_1946_and_Newer_All_Styles",..: 1 1 1 1 6 6 12 12 12 6 ... #>$ MS_Zoning         : Factor w/ 7 levels "Floating_Village_Residential",..: 3 2 3 3 3 3 3 3 3 3 ...
#>  $Lot_Frontage : num [1:2930] 141 80 81 93 74 78 41 43 39 60 ... #>$ Lot_Area          : int [1:2930] 31770 11622 14267 11160 13830 9978 4920 5005 5389 7500 ...
#>  $Street : Factor w/ 2 levels "Grvl","Pave": 2 2 2 2 2 2 2 2 2 2 ... #>$ Alley             : Factor w/ 3 levels "Gravel","No_Alley_Access",..: 2 2 2 2 2 2 2 2 2 2 ...
#>  $Lot_Shape : Factor w/ 4 levels "Regular","Slightly_Irregular",..: 2 1 2 1 2 2 1 2 2 1 ... #>$ Land_Contour      : Factor w/ 4 levels "Bnk","HLS","Low",..: 4 4 4 4 4 4 4 2 4 4 ...
#>  $Utilities : Factor w/ 3 levels "AllPub","NoSeWa",..: 1 1 1 1 1 1 1 1 1 1 ... #>$ Lot_Config        : Factor w/ 5 levels "Corner","CulDSac",..: 1 5 1 1 5 5 5 5 5 5 ...
#>  $Land_Slope : Factor w/ 3 levels "Gtl","Mod","Sev": 1 1 1 1 1 1 1 1 1 1 ... #>$ Neighborhood      : Factor w/ 29 levels "North_Ames","College_Creek",..: 1 1 1 1 7 7 17 17 17 7 ...
#>  $Condition_1 : Factor w/ 9 levels "Artery","Feedr",..: 3 2 3 3 3 3 3 3 3 3 ... #>$ Condition_2       : Factor w/ 8 levels "Artery","Feedr",..: 3 3 3 3 3 3 3 3 3 3 ...
#>  $Bldg_Type : Factor w/ 5 levels "OneFam","TwoFmCon",..: 1 1 1 1 1 1 5 5 5 1 ... #>$ House_Style       : Factor w/ 8 levels "One_and_Half_Fin",..: 3 3 3 3 8 8 3 3 3 8 ...
#>  $Overall_Cond : Factor w/ 10 levels "Very_Poor","Poor",..: 5 6 6 5 5 6 5 5 5 5 ... #>$ Year_Built        : int [1:2930] 1960 1961 1958 1968 1997 1998 2001 1992 1995 1999 ...
#>  $Year_Remod_Add : int [1:2930] 1960 1961 1958 1968 1998 1998 2001 1992 1996 1999 ... #>$ Roof_Style        : Factor w/ 6 levels "Flat","Gable",..: 4 2 4 4 2 2 2 2 2 2 ...
#>  $Roof_Matl : Factor w/ 8 levels "ClyTile","CompShg",..: 2 2 2 2 2 2 2 2 2 2 ... #>$ Exterior_1st      : Factor w/ 16 levels "AsbShng","AsphShn",..: 4 14 15 4 14 14 6 7 6 14 ...
#>  $Exterior_2nd : Factor w/ 17 levels "AsbShng","AsphShn",..: 11 15 16 4 15 15 6 7 6 15 ... #>$ Mas_Vnr_Type      : Factor w/ 5 levels "BrkCmn","BrkFace",..: 5 4 2 4 4 2 4 4 4 4 ...
#>  $Mas_Vnr_Area : num [1:2930] 112 0 108 0 0 20 0 0 0 0 ... #>$ Exter_Cond        : Factor w/ 5 levels "Excellent","Fair",..: 5 5 5 5 5 5 5 5 5 5 ...
#>  $Foundation : Factor w/ 6 levels "BrkTil","CBlock",..: 2 2 2 2 3 3 3 3 3 3 ... #>$ Bsmt_Cond         : Factor w/ 6 levels "Excellent","Fair",..: 3 6 6 6 6 6 6 6 6 6 ...
#>  $Bsmt_Exposure : Factor w/ 5 levels "Av","Gd","Mn",..: 2 4 4 4 4 4 3 4 4 4 ... #>$ BsmtFin_Type_1    : Factor w/ 7 levels "ALQ","BLQ","GLQ",..: 2 6 1 1 3 3 3 1 3 7 ...
#>  $BsmtFin_SF_1 : num [1:2930] 2 6 1 1 3 3 3 1 3 7 ... #>$ BsmtFin_Type_2    : Factor w/ 7 levels "ALQ","BLQ","GLQ",..: 7 4 7 7 7 7 7 7 7 7 ...
#>  $BsmtFin_SF_2 : num [1:2930] 0 144 0 0 0 0 0 0 0 0 ... #>$ Bsmt_Unf_SF       : num [1:2930] 441 270 406 1045 137 ...
#>  $Total_Bsmt_SF : num [1:2930] 1080 882 1329 2110 928 ... #>$ Heating           : Factor w/ 6 levels "Floor","GasA",..: 2 2 2 2 2 2 2 2 2 2 ...
#>  $Heating_QC : Factor w/ 5 levels "Excellent","Fair",..: 2 5 5 1 3 1 1 1 1 3 ... #>$ Central_Air       : Factor w/ 2 levels "N","Y": 2 2 2 2 2 2 2 2 2 2 ...
#>  $Electrical : Factor w/ 6 levels "FuseA","FuseF",..: 5 5 5 5 5 5 5 5 5 5 ... #>$ First_Flr_SF      : int [1:2930] 1656 896 1329 2110 928 926 1338 1280 1616 1028 ...
#>  $Second_Flr_SF : int [1:2930] 0 0 0 0 701 678 0 0 0 776 ... #>$ Gr_Liv_Area       : int [1:2930] 1656 896 1329 2110 1629 1604 1338 1280 1616 1804 ...
#>  $Bsmt_Full_Bath : num [1:2930] 1 0 0 1 0 0 1 0 1 0 ... #>$ Bsmt_Half_Bath    : num [1:2930] 0 0 0 0 0 0 0 0 0 0 ...
#>  $Full_Bath : int [1:2930] 1 1 1 2 2 2 2 2 2 2 ... #>$ Half_Bath         : int [1:2930] 0 0 1 1 1 1 0 0 0 1 ...
#>  $Bedroom_AbvGr : int [1:2930] 3 2 3 3 3 3 2 2 2 3 ... #>$ Kitchen_AbvGr     : int [1:2930] 1 1 1 1 1 1 1 1 1 1 ...
#>  $TotRms_AbvGrd : int [1:2930] 7 5 6 8 6 7 6 5 5 7 ... #>$ Functional        : Factor w/ 8 levels "Maj1","Maj2",..: 8 8 8 8 8 8 8 8 8 8 ...
#>  $Fireplaces : int [1:2930] 2 0 0 2 1 1 0 0 1 1 ... #>$ Garage_Type       : Factor w/ 7 levels "Attchd","Basment",..: 1 1 1 1 1 1 1 1 1 1 ...
#>  $Garage_Finish : Factor w/ 4 levels "Fin","No_Garage",..: 1 4 4 1 1 1 1 3 3 1 ... #>$ Garage_Cars       : num [1:2930] 2 1 1 2 2 2 2 2 2 2 ...
#>  $Garage_Area : num [1:2930] 528 730 312 522 482 470 582 506 608 442 ... #>$ Garage_Cond       : Factor w/ 6 levels "Excellent","Fair",..: 6 6 6 6 6 6 6 6 6 6 ...
#>  $Paved_Drive : Factor w/ 3 levels "Dirt_Gravel",..: 2 3 3 3 3 3 3 3 3 3 ... #>$ Wood_Deck_SF      : int [1:2930] 210 140 393 0 212 360 0 0 237 140 ...
#>  $Open_Porch_SF : int [1:2930] 62 0 36 0 34 36 0 82 152 60 ... #>$ Enclosed_Porch    : int [1:2930] 0 0 0 0 0 0 170 0 0 0 ...
#>  $Three_season_porch: int [1:2930] 0 0 0 0 0 0 0 0 0 0 ... #>$ Screen_Porch      : int [1:2930] 0 120 0 0 0 0 0 144 0 0 ...
#>  $Pool_Area : int [1:2930] 0 0 0 0 0 0 0 0 0 0 ... #>$ Pool_QC           : Factor w/ 5 levels "Excellent","Fair",..: 4 4 4 4 4 4 4 4 4 4 ...
#>  $Fence : Factor w/ 5 levels "Good_Privacy",..: 5 3 5 5 3 5 5 5 5 5 ... #>$ Misc_Feature      : Factor w/ 6 levels "Elev","Gar2",..: 3 3 2 3 3 3 3 3 3 3 ...
#>  $Misc_Val : int [1:2930] 0 0 12500 0 0 0 0 0 0 0 ... #>$ Mo_Sold           : int [1:2930] 5 6 6 4 3 6 4 1 3 6 ...
#>  $Year_Sold : int [1:2930] 2010 2010 2010 2010 2010 2010 2010 2010 2010 2010 ... #>$ Sale_Type         : Factor w/ 10 levels "COD","Con","ConLD",..: 10 10 10 10 10 10 10 10 10 10 ...
#>  $Sale_Condition : Factor w/ 6 levels "Abnorml","AdjLand",..: 5 5 5 5 5 5 5 5 5 5 ... #>$ Sale_Price        : int [1:2930] 215000 105000 172000 244000 189900 195500 213500 191500 236500 189000 ...
#>  $Longitude : num [1:2930] -93.6 -93.6 -93.6 -93.6 -93.6 ... #>$ Latitude          : num [1:2930] 42.1 42.1 42.1 42.1 42.1 ...