{ "data_id": "1058", "name": "humans_numeric", "exact_name": "humans_numeric", "version": 1, "version_label": null, "description": "**Author**: \n**Source**: Unknown - Date unknown \n**Please cite**: \n\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\n\n1. Title: Assessing the Reliability of a Human Estimator\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\nThis is a PROMISE Software Engineering Repository data set made publicly\navailable in order to encourage repeatable, verifiable, refutable, and\/or\nimprovable predictive models of software engineering.\n\nIf you publish material based on PROMISE data sets then, please\nfollow the acknowledgment guidelines posted on the PROMISE repository\nweb page http:\/\/promisedata.org\/repository .\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\n\n%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%\n(c) 2007 : Gary Boetticher : boetticher AT uhcl DOT edu Phone: +1 (281) 283 8305\nThis data set is distributed under the\nCreative Commons Attribution-Share Alike 3.0 License\nhttp:\/\/creativecommons.org\/licenses\/by-sa\/3.0\/\n\nYou are free:\n\n* to Share -- copy, distribute and transmit the work\n* to Remix -- to adapt the work\n\nUnder the following conditions:\n\nAttribution. You must attribute the work in the manner specified by\nthe author or licensor (but not in any way that suggests that they endorse\nyou or your use of the work).\n\nShare Alike. If you alter, transform, or build upon this work, you\nmay distribute the resulting work only under the same, similar or a\ncompatible license.\n\n* For any reuse or distribution, you must make clear to others the\nlicense terms of this work.\n* Any of the above conditions can be waived if you get permission from\nthe copyright holder.\n* Apart from the remix rights granted under this license, nothing in\nthis license impairs or restricts the author's moral rights.\n\n\n2. Sources\n(a) Creator: Gary D. Boetticher\n(b) Date: February 20, 2007\n(c) Contact: boetticher AT uhcl DOT edu Phone: +1 (281) 283 8305\n\n3. Donor: Gary D. Boetticher\n\n4. Past Usage: This data was used for:\n\nBoetticher, G., Lokhandwala, N., James C. Helm, Understanding the Human\nEstimator, Second International Predictive Models in Software Engineering\n(PROMISE) Workshop co-located at the 22nd IEEE International Conference on\nSoftware Maintenance, Philadelphia, PA, September, 2006. More information is\navailable at http:\/\/nas.cl.uh.edu\/boetticher\/research.html\n\nSince PROMISE 2006, the data set expanded by about 50 percent. The additional\ntuples allowed us to divide the data into 3 major categories. Those who severely\nunderestimate (first 25 tuples). Those who accurately estimate (next 25 tuples).\nAnd those who severely overestimate (last 25 tuples). The PROMISE 2007 experiments\ncompare the underestimators with the accurate estimators and the overestimators with\nthe accurate estimators.\n\n5. Number of Instances: 75\n\n6. Number of Attributes: 14 independent variables and 1 dependent variable\n\n7. Attribute Information:\n\nNumeric Degree: This attribute refers to the level of education of the participant.\n2=High School, 3=Bachelors, 4=Masters,5=Ph.D.\n\nTechUGCourses: This refers to the number of technical undergraduate courses that\nthe participant has taken.\n\nTechGCourses: This refers to the number of technical graduate courses that\nthe participant has taken.\n\nMgmtUGCourses: This refers to the number of management undergraduate courses that\nthe participant has taken.\n\nMgmtGCourses: This refers to the number of management graduate courses that\nthe participant has taken.\n\nTotal Workshops: This refers to the total number of workshops that\nthe participant has attended.\n\nTotal Conferences: This refers to the total number of conferences that\nthe participant has attended.\n\nTotalLangExp: This refers to the total number of languages and experience in those\nlanguages that the participant has.\n\nHardware Proj Mgmt Exp: This corresponds to the total amount of time that the\nrespondant has been estimating hardware projects.\n\nSoftware Proj Mgmt Exp: This corresponds to the total amount of time that the\nrespondant has been estimating software projects.\n\nNo Of Hardware Proj Estimated: This refers to the total number of hardware projects\nthat the participant has estimated.\n\nNo Of Software Proj Estimated: This refers to the total number of software projects\nthat the participant has estimated.\n\nDomain Exp: The domain experience refers to how much experience the participant has\nin the oil and gas industry.\n\nProcurement Industry Exp: The procurement industry experience refers to the amount\nof time, in years, that the participant has regarding\nprocurement.\n\nABS((TotalEstimates-TotalActual)\/TotalActual): This is the class variable. It\nrepresents the overall relative error for the participant's\nestimates.", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": null, "contributor": null, "date": "2014-10-06 23:57:28", "update_comment": "set target feature", "last_update": "2014-10-07 02:52:02", "licence": "Public", "status": "active", "error_message": null, "url": "https:\/\/www.openml.org\/data\/download\/53941\/humans_numeric.arff", "default_target_attribute": "ABS((TotalEstimates-TotalActual)\/TotalActual)", "row_id_attribute": null, "ignore_attribute": null, "runs": 0, "suggest": { "input": [ "humans_numeric", "%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% 1. Title: Assessing the Reliability of a Human Estimator %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable, verifiable, refutable, and\/or improvable predictive models of software engineering. If you publish material based on PROMISE data sets then, please follow the acknowledgment guid " ], "weight": 5 }, "qualities": { "NumberOfInstances": 75, "NumberOfFeatures": 15, "NumberOfClasses": 0, "NumberOfMissingValues": 0, "NumberOfInstancesWithMissingValues": 0, "NumberOfNumericFeatures": 15, "NumberOfSymbolicFeatures": 0, "PercentageOfBinaryFeatures": 0, "Quartile2StdDevOfNumericAtts": 4.017002154132643, "MaxAttributeEntropy": null, "MinKurtosisOfNumericAtts": 2.271363206893905, "PercentageOfInstancesWithMissingValues": 0, "Quartile3AttributeEntropy": null, "MaxKurtosisOfNumericAtts": 52.50276073314732, "MinMeansOfNumericAtts": 0.7333333333333334, "PercentageOfMissingValues": 0, "Quartile3KurtosisOfNumericAtts": 16.46951796248645, "MaxMeansOfNumericAtts": 11.466666666666667, "MinMutualInformation": null, "PercentageOfNumericFeatures": 100, "Quartile3MeansOfNumericAtts": 3.68, "MaxMutualInformation": null, "MinNominalAttDistinctValues": null, "PercentageOfSymbolicFeatures": 0, "Quartile3MutualInformation": null, "MaxNominalAttDistinctValues": null, "MinSkewnessOfNumericAtts": -0.05478511363912503, "Quartile1AttributeEntropy": null, "Quartile3SkewnessOfNumericAtts": 3.7868210168181866, "MaxSkewnessOfNumericAtts": 6.793355345721039, "MinStdDevOfNumericAtts": 0.59366019946804, "Quartile1KurtosisOfNumericAtts": 4.05942684542849, "Quartile3StdDevOfNumericAtts": 6.595603231091296, "MaxStdDevOfNumericAtts": 14.243380001736728, "MinorityClassPercentage": null, "Quartile1MeansOfNumericAtts": 1.5366666666666664, "StdvNominalAttDistinctValues": null, "MeanAttributeEntropy": null, "MinorityClassSize": null, "Quartile1MutualInformation": null, "MeanKurtosisOfNumericAtts": 10.87316958278109, "NumberOfBinaryFeatures": 0, "Quartile1SkewnessOfNumericAtts": 1.8379934112625027, "MeanMeansOfNumericAtts": 3.160452583032889, "Quartile1StdDevOfNumericAtts": 2.09658667258849, "AutoCorrelation": 0.8561246947027027, "MeanMutualInformation": null, "Quartile2AttributeEntropy": null, "ClassEntropy": null, "MeanNoiseToSignalRatio": null, "Quartile2KurtosisOfNumericAtts": 5.899033331129038, "Dimensionality": 0.2, "MeanNominalAttDistinctValues": null, "Quartile2MeansOfNumericAtts": 1.7901220788266667, "EquivalentNumberOfAtts": null, "MeanSkewnessOfNumericAtts": 2.7105181364724387, "Quartile2MutualInformation": null, "MajorityClassPercentage": null, "MeanStdDevOfNumericAtts": 4.838271188770876, "Quartile2SkewnessOfNumericAtts": 2.3947944434789177, "MajorityClassSize": null, "MinAttributeEntropy": null }, "tags": [], "features": [ { "name": "ABS((TotalEstimates-TotalActual)\/TotalActual)", "index": "14", "type": "numeric", "distinct": "72", "missing": "0", "target": "1", "min": "0", "max": "10", "mean": "2", "stdev": "2" }, { "name": "Numeric Degree", "index": "0", "type": "numeric", "distinct": "5", "missing": "0", "min": "1", "max": "5", "mean": "3", "stdev": "1" }, { "name": "TechUGCourses", "index": "1", "type": "numeric", "distinct": "31", "missing": "0", "min": "-2", "max": "72", "mean": "11", "stdev": "14" }, { "name": "TechGCourses", "index": "2", "type": "numeric", "distinct": "18", "missing": "0", "min": "0", "max": "30", "mean": "5", "stdev": "7" }, { "name": "MgmtUGCourses", "index": "3", "type": "numeric", "distinct": "9", "missing": "0", "min": "0", "max": "12", "mean": "2", "stdev": "3" }, { "name": "MgmtGCourses", "index": "4", "type": "numeric", "distinct": "9", "missing": "0", "min": "0", "max": "12", "mean": "1", "stdev": "2" }, { "name": "Total Workshops", "index": "5", "type": "numeric", "distinct": "17", "missing": "0", "min": "0", "max": "85", "mean": "4", "stdev": "10" }, { "name": "Total Conferences", "index": "6", "type": "numeric", "distinct": "14", "missing": "0", "min": "0", "max": "23", "mean": "3", "stdev": "5" }, { "name": "TotalLangExp", "index": "7", "type": "numeric", "distinct": "45", "missing": "0", "min": "-8", "max": "40", "mean": "7", "stdev": "8" }, { "name": "Hardware Proj Mgmt Exp", "index": "8", "type": "numeric", "distinct": "13", "missing": "0", "min": "0", "max": "25", "mean": "2", "stdev": "4" }, { "name": "Software Proj Mgmt Exp", "index": "9", "type": "numeric", "distinct": "18", "missing": "0", "min": "0", "max": "15", "mean": "2", "stdev": "3" }, { "name": "No Of Hardware Proj Estimated", "index": "10", "type": "numeric", "distinct": "10", "missing": "0", "min": "0", "max": "25", "mean": "2", "stdev": "4" }, { "name": "No Of Software Proj Estimated", "index": "11", "type": "numeric", "distinct": "14", "missing": "0", "min": "0", "max": "25", "mean": "4", "stdev": "6" }, { "name": "Domain Exp", "index": "12", "type": "numeric", "distinct": "10", "missing": "0", "min": "0", "max": "7", "mean": "1", "stdev": "1" }, { "name": "Procurement Industry Exp", "index": "13", "type": "numeric", "distinct": "9", "missing": "0", "min": "0", "max": "12", "mean": "1", "stdev": "2" } ], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 4, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 4 }