{ "data_id": "538", "name": "colleges_usnews", "exact_name": "colleges_usnews", "version": 1, "version_label": null, "description": "**Author**: \n**Source**: Unknown - Date unknown \n**Please cite**: \n\nThe USNEWS dataset for the ASA Statistical Graphics Section's\n1995 Data Analysis Exposition contains information on over 1300\nAmerican colleges and universities. The data may be obtained in\neither of two formats.\n\nUSNEWS.DATA contains the raw data in comma delimited fields with\na single data line for each school. The order of variables is the\nsame as given below for the fixed column version, although the\nspacing varies for each school.\n\nUSNEWS3.DATA has the data arranged in fixed columns, with three\ndata lines for each school and a maximum line length of 80\ncharacters.\n\nThis dataset is taken from the 1995 U.S. News & World Report's\nGuide to America's Best Colleges. This dataset is protected by\ncopyright, is reproduced with permission of the copyright\nholder(s), and may not be downloaded or otherwise copied, except\nsolely for the purpose of analysis in connection with the\nAmerican Statistical Association's 1995 Data Analysis Exposition.\nThe data are reporduced with the permission of the publisher.\n\nMost of the data are for the 1993-94 school year. You may wish to\nconsult a copy of the U.S. News source for more detailed\ndescriptions of the variables.\n\n\nKEY FOR USNEWS3.DATA\nFixed column format with three data lines per school\n\nLine #1\n1 - 5 FICE (Federal ID number)\n7 - 51 College name\n53 - 54 State (postal code)\n\nLine #2\n1 - 2 Public\/private indicator (public=1, private=2)\n3 - 6 Average Math SAT score\n7 - 10 Average Verbal SAT score\n11 - 15 Average Combined SAT score\n16 - 18 Average ACT score\n19 - 22 First quartile - Math SAT\n23 - 26 Third quartile - Math SAT\n27 - 30 First quartile - Verbal SAT\n31 - 34 Third quartile - Verbal SAT\n35 - 37 First quartile - ACT\n38 - 40 Third quartile - ACT\n41 - 46 Number of applications received\n47 - 52 Number of applicants accepted\n53 - 57 Number of new students enrolled\n58 - 61 Pct. new students from top 10% of H.S. class\n62 - 65 Pct. new students from top 25% of H.S. class\n\nLine #3\n1 - 6 Number of fulltime undergraduates\n7 - 12 Number of parttime undergraduates\n13 - 18 In-state tuition\n19 - 24 Out-of-state tuition\n25 - 29 Room and board costs\n30 - 34 Room costs\n35 - 39 Board costs\n40 - 44 Additional fees\n45 - 49 Estimated book costs\n50 - 54 Estimated personal spending\n55 - 58 Pct. of faculty with Ph.D.'s\n59 - 62 Pct. of faculty with terminal degree\n63 - 67 Student\/faculty ratio\n68 - 70 Pct.alumni who donate\n71 - 76 Instructional expenditure per student\n77 - 80 Graduation rate\n\nMissing values are denoted with *\n\n**************************************************************\nTo obtain the dataset from Statlib, send one of the single line\nmessages below to the address statlib@lib.stat.cmu.edu\n\nsend usnews.data from colleges\nor\nsend usnews3.data from colleges\n\n\nFor more information on the ASA Statistical Graphics Section's\n1995 Data Analysis Exposition send the message\n\nsend readme from colleges\n\n%%%%%%%%%%%%%%\nINFORMATION %\n%%%%%%%%%%%%%%\n\nWHAT'S WHAT AMONG AMERICAN COLLEGES AND UNIVERSITIES?\n\nThis is the subject of the 1995 Data Analysis Exposition\nsponsored by the Statistical Graphics Section of the American\nStatistical Association. The purpose of the Exposition is to\nencourage statisticians to demonstrate techniques, especially\ngraphical, for analyzing data and displaying the results of an\nanalysis. Individuals and groups will work with the same set of\ndata and present their analyses at a special session as part of\nthe annual Joint Statistical Meetings in Orlando, Florida on\nAugust 13th-17th, 1995. The datasets for 1995 are drawn from two\nsources, U.S. News & World Report's Guide to Americas Best\nColleges and the AAUP (American Association of University\nProfessors) 1994 Salary Survey which appeared in the March-April\n1994 issue of Academe.\n\nThe U.S. News data contains information on tuition, room & board\ncosts, SAT or ACT scores, application\/acceptance rates,\ngraduation rate, student\/faculty ratio, spending per student, and\na number of other variables for 1300+ schools. The AAUP data\nincludes average salary, overall compensation, and number of\nfaculty broken down by full, associate, and assistant professor\nranks.\n\nThe raw data and documentation are contained in the files\ndescribed below. To obtain any of these files send a message to\nstatlib@lib.stat.cmu.edu of the following form (substituting the\nfile you want for XXXXX)\n\nsend XXXXX from colleges\n\nAvailable files\n\nusnews.doc Documentation for the U.S. News data\nusnews.data U.S. News data in comma delimited format\nusnews3.data U.S. News data in fixed column format\n\naaup.doc Documentation for the AAUP salary data\naaup.data AAUP salary data in comma delimited format\naaup2.data AAUP salary data in fixed column format\n\nTwo versions of each dataset are provided to accommodate users\nwith different software constraints. The comma delimited\nversions (USNEWS.DATA and AAUP.DATA) contain information for each\ncollege on a separate line with values delimited by commas. The\nfixed column versions (USNEWS3.DATA and AAUP2.DATA) use 2 or 3\ndata lines per school and a maximum line length of 80 characters.\n\nTo participate in the 1995 Data Analysis Exposition you must send\nan abstract form to the American Statistical Association by\nFebruary 1st, 1995. Information is available from the ASA\nMeetings Department by e-mail (meetings@asa.mhs.compuserve.com),\nphone (703-684-1221), fax (703-684-2037), or surface mail (ASA,\n1429 Duke St., Alexandria, VA 22314). Your initial abstract may\nbe fairly general since you may do the bulk of your analysis\nafter the February 1 deadline.\n\nYou may choose your own path to proceed in analyzing the data or\nuse some of the suggested questions below to get started.\n\n... How well can we model tuition using the other variables?\n... How might we cluster colleges into similar comparison groups?\n... How can we best display faculty salary structure?\n... Can we find a reasonable way to rank the schools?\n\nYou may work on your own or put together a team. Show off the\ncapabilities of your favorite software package or use the data\nfor a class project and display your students results. You may\nchoose to consider just a subset of schools or examine regional\npatterns. The main point is to find innovative ways to display\nthe interesting features of the data.\n\nFurther questions about the 1995 Exposition can be directed to\nRobin Lock, Mathematics Department, St. Lawrence University,\nCanton, NY 13617 e-mail rlock@vm.stlawu.edu\n\nIf you would like to be informed about any subsequent adjustments\nor error fixes to the 1995 Exposition data, please send an e-mail\nmessage to register your interest to rlock@vm.stlawu.edu.\n\nSpecial thanks for providing data for the 1995 Exposition to:\nRobert Morse, Director of Research for America's Best Colleges at\nU.S. News & World Report\nMaryse Eymonerie, Consultant to AAUP.\n\n\nInformation about the dataset\nCLASSTYPE: numeric\nCLASSINDEX: none specific", "format": "ARFF", "uploader": "Joaquin Vanschoren", "uploader_id": 2, "visibility": "public", "creator": null, "contributor": null, "date": "2014-09-29 00:09:00", "update_comment": "set targets, ignores", "last_update": "2014-10-07 01:29:41", "licence": "Public", "status": "active", "error_message": "Quality illegal value: AutoCorrelation, value: NaN,Quality illegal value: AutoCorrelation, value: NaN,Quality illegal value: AutoCorrelation, value: NaN,Quality illegal value: AutoCorrelation, value: NaN,Quality illegal value: AutoCorrelation, value: NaN", "url": "https:\/\/www.openml.org\/data\/download\/52650\/colleges_usnews.arff", "default_target_attribute": "Graduation_rate", "row_id_attribute": null, "ignore_attribute": "\"FICE\",\"College_name\"", "runs": 0, "suggest": { "input": [ "colleges_usnews", "The USNEWS dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on over 1300 American colleges and universities. The data may be obtained in either of two formats. USNEWS.DATA contains the raw data in comma delimited fields with a single data line for each school. The order of variables is the same as given below for the fixed column version, although the spacing varies for each school. USNEWS3.DATA has the data arranged in fixed columns, with thr " ], "weight": 5 }, "qualities": [], "tags": [], "features": [], "nr_of_issues": 0, "nr_of_downvotes": 0, "nr_of_likes": 0, "nr_of_downloads": 0, "total_downloads": 0, "reach": 0, "reuse": 5, "impact_of_reuse": 0, "reach_of_reuse": 0, "impact": 5 }