Data
Acorns

Acorns

active ARFF Publicly available Visibility: public Uploaded 07-10-2014 by Joaquin Vanschoren
0 likes downloaded by 1 people , 1 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Author: Source: Unknown - Date unknown Please cite: Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file http://lib.stat.cmu.edu/DASL/Datafiles/Acorns.html Acorn Size Oak Distribution Reference: Aizen and Patterson. (1990). Journal of Biogeography, volume 17, p. 327-332. Authorization: contact authors Description: Interest lies is the relationship between the size of the acorn and the geographic range of the oak tree species. Note that the Quercus tomentella Engelm species in the California region grows only on the Channel Islands (total area 1014 sq. km) and the island of Guadalupe (total area 265 sq. km). All other species grow on the Continental United States. Number of cases: 39 Variable Names: Species: Latin name of the species Region: Atlantic or California region Range: The geographic area covered by the species in km2x100 Acorn_size: Acorn size (cm3) Tree_height: Tree Height (m)

4 features

Acorn_size (target)numeric27 unique values
0 missing
Species (ignore)nominal39 unique values
0 missing
Regionnominal2 unique values
0 missing
Rangenumeric38 unique values
0 missing
Tree_Heightnumeric19 unique values
0 missing

19 properties

39
Number of instances (rows) of the dataset.
4
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
3
Number of numeric attributes.
1
Number of nominal attributes.
Number of instances belonging to the least frequent class.
1
Number of binary attributes.
25
Percentage of binary attributes.
0
Percentage of instances having missing values.
-2.65
Average class difference between consecutive instances.
0
Percentage of missing values.
0.1
Number of attributes divided by the number of instances.
75
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
25
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.

13 tasks

0 runs - estimation_procedure: 10 times 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: Acorn_size
0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: mean_absolute_error - target_feature: Acorn_size
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task