This course introduces students to basic concepts, theories, and methods by way of which political and other social scientists use statistics and microcomputers in making conclusions about evidence that are scientifically valid. This course is designed to serve as an intermediate introduction to the use of systematic quantitative methods in political science.
Satisfactory completion of POLS 515-A or permission of instructor, familiarity with SPSS or willingness to learn.
Required: |
Agresti, Alan, & Finlay Barbara. (1997). Statistical Methods for the Social Sciences. New York: Prentice Hall, 3rd ed. |
Xeroxed materials available through Aztec Shops or electronic reserves. |
Several 3.5" MS-DOS floppy disks for backup of data, setups, and work. |
Students will complete reading and exercises in a manner which maintains pace with the topics being discussed and the lectures. Students are required to attend all lectures and discussions, turn in all computer exercises at the beginning of the class when due, and to take examinations and quizzes in class. Students should expect to spend approximately three hours preparation or more each week for each hour of class time, i.e., at least nine hours or more for a three hour class each week.
Students may expect quizzes at each class meeting and three midterm examinations. No final examination will be given. I will discard the lowest quiz grade and the lowest midterm grade. Students will also be graded on successful completion of weekly exercises to be executed using the SPSS statistical package and data provided for the course by Agresti and Findlay. I recommend that students make two copies of most assignments (one to turn in for a grade and one to keep electronically on a floppy disk in the unlikely event that the assignment is mislaid). keep as a backup) in the unlikely event that work is mislaid. In order to receive credit, all assignments must be turned in on time, i.e., at beginning of class on the day when the problem is due. Meeting this requirement will require prior planning, since SDSU computers may be difficult to access during certain times in a semester. Students would be prudent to familiarize themselves to all campus labs that run SPSS. Instruction and introductory comments on how to use the SPSS and computer will be provided during special sessions in weeks two and three in the
If students do not know how to perform the computer work required for this class, it is their obligation to find out and to complete the work on time. All students in POLS 515 were advised to take the special workshop series on SPSS that SSRL offered last semester. Limited consulting, documentation, and machine access is available at the SSRL and at several other computer facilities on campus.
I expect students who are having difficulties
in this course to contact me during my office hours at the
beginning of the difficulties so that grades do
not suffer.Tutors will also be available during each week for additional help.
Graded practice exercises (homework) are also assigned.
Students would be wise to complete further exercises at the end of each
chapter.
GRADING:
Three Midterm Examinations 25%
Quizzes 25%
Computer problems 20%
Homework exercises 20%
Attendance 10%
Week 1 | Introduction to computers, science, statistics. | January 23. | |
Agresti & Finlay, "Introduction." | |||
Exercises 1-6, pp. 9-10. | |||
Last day to drop or add class is February 7. | |||
Week 2 | SPSS Workshop. | January 30. | |
Week 3 | SPSS Workshop. | February 6. | |
Computer Problem 2:1 due February 13. | |||
Last day to drop or add class is February 7. | |||
Week 4 | Sampling and Measurement, and Decriptive Statistics. | February 13. | |
Agresti & Finlay, Chapters 2-3. | |||
Exercises 7,12, pp. 29-31 and 4,8,18, pp. 67-71. | |||
Computer problem 3:1 due February 20. | |||
Computer Problem 2:1 due (Problems at end of syllabus). | |||
Week 5 | Descriptive Statistics. | February 20. | |
Agresti & Finlay, Continue Chapter 3. | |||
Exercises 20,32,37,57, pp. 71-78. | |||
Computer problems 3:2-3:5 due February 27. | |||
Computer problem 3:1 due. | |||
Week 6 | Probability Distributions. | February 27 | |
Agresti & Finlay, Chapter 4. | |||
Exercise 7,19,39,49, pp. 111-118. | |||
Computer problems 4:1-4:2 due March 6. | |||
Computer problems 3:2-3:5 due. | |||
Week 7 | Statistical Inferences: Estimation. | March 6 | |
Agresti & Finlay, Chapter 5. | |||
Exercise 5,9,19,27, pp. 145-150. | |||
Computer problems 5:1-5:4 due March 20. | |||
Computer Problems 4:1, 4.2 due. | |||
First Midterm Examination | |||
Week 8 | Holiday--Spring Vacation. | March 13. | |
Week 9 | Statistical Inference: Significance Tests. | March 20 | |
Agresti & Finlay, Chapter 6. | |||
Exercises 1,12,24,32, pp. 198-205. | |||
Computer problems 6:1-6:3 due March 27. | |||
Computer Problem 5:1-5:4 due. | |||
Week 9 | Comparison of Two Groups. | March 27 | |
Agresti & Finlay, Chapter 7. | |||
Exercises 2,13,30,52, pp. 234-242. | |||
Computer problems 7:1-7:4 due April 3. | |||
Computer Problem 6:1-6:3 due. | |||
Week 11 | Analyzing Association Between Categorical Variables. | April 3 | |
Agresti & Finlay, Chapter 8. | |||
Exercises 3,17,22,32, pp. 286-294. | |||
Computer problem 8:1 due April 10. | |||
Computer problems 7:1-7:4 due. | |||
Second Midterm Examination | |||
Week 12 | Linear Regression and Correlation. | April 10 | |
Agresti & Finlay, Chapter 9. | |||
Exercises 1,3,10,11, pp. 342-345. | |||
Computer problems 9:1-9:4 due April 17. | |||
Computer Problem 8:1 due. | |||
Week 13 | Linear Regression and Correlation continued. | April 17 | |
Agresti & Finlay, Chapter 9. | |||
Exercises 16,17,31, pp. 346-353. | |||
Computer problems 9:5-9:6 due April 24. | |||
Computer Problem 9:1-9:4 due. | |||
Week 14 | Introduction to Multivariate Relationships. | April 24 | |
Agresti & Finlay, Chapter 10. | |||
Exercises 3,8,21,29, pp. 373-378. | |||
Computer problems 10:1-10:2 due May 1. | |||
Computer problems 9:5-9:6 due. | |||
Week 15 | Multiple Regression and Correlation. | May 1 | |
Agresti & Finlay, Chapter 11. | |||
Exercises 5,6,7,8,10,14,36 pp. 421-431. | |||
Computer problems 11:1-11:4 due May 8. | |||
Optional Extra Credit Computer Problems 1-2 due May 8. | |||
Computer Problem 10:1-10:2 due. | |||
Week 16 | Discussion | May 8 | |
Computer problems 11:1-11:4 due. | |||
Optional Extra Credit Computer Problems 1-2 due. | |||
Third Midterm Examination. | |||
COMPUTER PROBLEMS:
Instructions: Use the designated command files to complete the analyses below. Be sure to write your name on each page of output and staple pages together (do not use paper clips since pages are sometimes lost if they are not attached to each other). You may hand write comments on the output legibly or edit the pages prior to printing to facilitate reading. Clearly label all responses, and describe clearly what the analyses represent. Grading will be based in part on explicit explanation of each item. Computer problem 2:1. (Use data from computer setup 1). Computer setup 1 includes social, demographic, and political data from a small survey of students. Read the data into SPSS, create an SPSS system file, and save the data set onto your floppy disk. Print the data using LIST VARIABLES=ALL. Write your name on the data printout and turn it in for credit. Describe in words what is on the printout. State three hypotheses that you might wish to test using these data. Computer problem 3:1. (Use data from computer setup 1). Use SPSS to produce statistical summaries for a) distance from home town; b) weekly hours of TV viewing; c) weekly times reading a newspaper; d) number of HIV/AIDs victims known. Computer problem 3:2. (Use data from computer setup 1). Find the 95% confidence interval for the proportion believing in life after death. Computer problem 3:3. (Use data from computer setup 1). Use SPSS to produce descriptive analyses of a) political affiliation; b) political ideology; c) religiosity (frequency of attendance); d) support for affirmative action. Prepare a report, interpreting and summarizing the findings. Computer problem 3:4. (Use data from computer setup 2). Use SPSS to summarize poverty data. Prepare a report and graphically summarize the data. Compute measures of central tendency and variation. Be sure to note outliers, if any. If any outliers, describe their influence on results Computer problem 3:5. (Use data from computer setup 2). Omitting Washington, D.C.: a. find the quartiles and interquartile range; b. According to the definition of an outlier in terms of the IQR, are any of the observations outliers? c. Construct a box plot for the distribution. (You may hand draw if you like.) d. Repeat a-c when Washington D.C. is included. Computer problem 4:1. (Use data from computer setup 1). a) Construct a histogram of the population distribution for high school and college grade point averages; b) construct a barchart representing the distribution for gender. c) What is are the similarities and differences between the two graphs? Computer problem 4:2. (Use data from computer setup 1). If data in computer problem 4:1 constitute a simple random sample of all university students, what is the a) estimated sample mean: and b) its standard error? Computer problem 5:1. (Use data from computer setup 1). Construct and interpret the 95% confidence interval for the mean hours spent watching TV, age, college GPA. Interpret. Computer problem 5:2. (Use data from computer setup 2). Construct and interpret the 95% confidence interval for the mean murder rate, mean violent crime rate, and percent below the poverty level. Interpret and summarize. Computer problem 5:3. (Use data from computer setup 4). Construct and interpret the 95% confidence interval for the mean crude birth rate, women's economic activity, and female life expectancy. Interpret and summarize. Note: -9 must be recoded to system missing [RECODE VARIABLE NAME(-9=SYSMIS).]. Computer problem 5:4. (Use data from computer setup 1). Construct and interpret the 95% confidence interval for the proportions who supporting affirmative action and for those who believe in life after death. Interpret and summarize. Note: Test of hypotheses require the seven step formal presentations of assumptions, results, and the conclusion. Computer problem 6:1. (Use data from computer setup 1). Test H0 that political ideology equations 4. Outline arguments (steps) and draw substantive conclusion in words explicitly. Computer problem 6:2. (Use data from computer setup 1). Test H0 that proportion favoring abortion is .50. Outline arguments (steps) and draw substantive conclusion in words explicitly. Computer problem 6:3. (Use data from computer setup 1). Test H0 that proportion who report a liberal ideology is .30. Outline arguments (steps) and draw substantive conclusion in words explicitly. Computer problem 7:1. (Use data from computer setup 1). Compute the 95% confidence interval for the difference between males and females in each of the below: a. Mean number of times reading newspaper. b. Opinion about legalized abortion. c. Mean ideology score. d. Belief in life after death. Computer problem 7:2. (Use data from computer setup 1). Test formally whether males and females differ in terms of: a. Mean number of times reading newspaper. b. Opinion about legalized abortion. c. Mean ideology score. d. Belief in life after death. Computer problem 7:3. (Use data from computer setup 3). Test formally whether new homes sell for more than old homes. Computer problem 7:4. (Use data from computer setup 4). Compute median of female adult literacy rate. Recode the sample into high and low female literacy using the median as the cut point to distinguish high from low. Note: -9 must be recoded to system missing [RECODE XXX(-9=SYSMIS)]. Test H0 that: a. Literacy is positively related to contraception use. b. Literacy is negatively related to crude birth rate. c. Literacy is positively related to life expectancy. Computer problem 8:1. (Use data from computer setup 1). Using contingency table analysis (present percentage tables correctly presented and labeled), test formally hypotheses (use chi square test and compute gamma as well) that attitude about abortion is related to: a. Party affilitation. b. Religiosity. c. Political ideology. d. Vegetarian status. Computer problem 9:1. (Use data from computer setup 1). Compute correlation between political ideology and frequency of newspaper reading. Then, interpret sign of correlation and square of correlation. Computer problem 9:2. (Use data from computer setup 1). Regress newspaper reading on religiosity and regress ideology on religiosity, and: a. Write regression equations (predicting newspaper reading from religiosity and ideology from religiosity). b. What is predicted newspaper reading for each value of religiosity (0,1,2,3)? c. Which (ideology or newspaper reading) is more strongly related to religiosity? Explain. Computer problem 9:3. (Use data from computer setup 2). Regress murder rate on percentage white, then: a. Compute and interpret prediction (regression) equation. b. What is the coefficient of determination? Interpret. c. What is the correlation? Interpret. d. What is error of the estimate? Interpret. e. Remove DC from the equation. Interpret differences in regression equation, correlation, and coefficient of determination. Explain differences. Computer problem 9:4. (Use data from computer setup 2). Print plot (using scatter plot) of the relationship between percent single parent family and percent white. Then: a. Mark two observations that are most discrepant from others in the plot. b. Compute prediction equation and correlation for entire data set, then recomputed deleting the first of the two outliers, then recomputed deleting the second of the two outliers, and finally recomputed deleting both outliers. Describe the results and interpret. Computer problem 9:5. (Use data from computer setup 4). Regress price on number of bathrooms. . Note: -9 must be recoded to system missing [RECODE VARIABLE NAME(-9=SYSMIS)]. Then: a. Print scatter plot. What is the effect of using a predictor with only three values? b. Write and interpret prediction equation. c. Write estimated selling price for 1, 2, and 3 bathrooms. d. Write and interpret correlation. e. Write and interpret coefficient of determination. f. Test H0: r=0 formally (seven steps). Computer problem 9:6. (Use data from computer setup 5). Regress crime rate on percent urban. Then: a. Print scatter plot. b. Write and interpret prediction equation. c. Write estimated selling price for 1, 2, and 3 bathrooms. d. Write and interpret correlation. e. Write and interpret coefficient of determination. f. Test H0: ?=0 formally (seven steps). Computer problem 10:1. (Use data from computer setup 1). Construct partial tables relating opinion about abortion to opinion about life after death, controlling for attendance at religious services (recoded to two categories, never or occasionally vs. most weeks or every week). Prepare a report that includes: a. An arrow diagram about which variables cause which other variables prior to analysis. b. Interpret sample associations in the bivariate table and in the partial tables. c. Revise if necessary, your arrow diagram based on evidence in the sample data. Describe in words. Computer problem 10:2. (Use data from computer setup 8). Construct partial tables relating income to race, controlling for education. Dichotomize income into high and low categories (at the median for income). Prepare a report that includes: a. An arrow diagram about which variables cause which other variables prior to analysis. b. Interpret sample associations in the bivariate table and in the partial tables. c. Revise if necessary, your arrow diagram based on evidence in the sample data. Describe in words. Computer problem 11:1. (Use data from computer setup 1). a. Regress political ideology on newspaper reading and religiosity. Prepare a report, summarizing graphical analyses, bivariate models and interpretations, and regression models inferences, checks of the effects of outliers, and overall summary of relationships. b. Regress college GPA on high school GPS and hours of exercise. Prepare a report, summarizing graphical analyses, bivariate models and interpretations, and regression models inferences, checks of the effects of outliers, and overall summary of relationships. Computer problem 11:2. (Use data from computer setup 2). Regress violent crime rate on percent single parent families and metropolitan rate. Prepare a report, summarizing graphical analyses, bivariate models and interpretations, and regression models inferences, checks of the effects of outliers, and overall summary of relationships. Computer problem 11:3. (Use data from computer setup 2). Repeat 11:2, adding percent white as a predictor. Computer problem 11:4. (Use data from computer setup 2). Repeat 11:3, deleting DC as a case. Interpret differences from 11:3. Extra Credit Computer Assignment. Grading will be placed on accuracy and completeness of analyses and reports. Extra Credit Problem 1 (Using Computer Setup 5). Using all of the data in a multivariate analysis, what are the causes of crime? Write a report. Extra Credit Problem 2 (Using Computer Setup 2). Using the data in a multivariate analysis, what are the three more important causes of murder? Write a report. Computer Setups: Computer Setup 1 (Agresti: Data for Table 7_1 ***. DATA LIST FREE / subj(F2.0) geN(A2) ag(F3.0) hiH(F3.1) co(F3.1) dh(F5.0) dr(F3.1) tv(F2.0) sp(F2.0) neX(F2.0) ah(F2.0) ve(A1) pa(A1) pi(F2.0) re(F2.0) ab(A1) aa(A1) ld(A1). BEGIN DATA 1 m 32 2.2 3.5 0 5.0 3 5 0 0 n r 6 2 n n y 2 f 23 2.1 3.5 1200 0.3 15 7 5 6 y d 2 1 y y u 3 f 27 3.3 3.0 1300 1.5 0 4 3 0 y d 2 2 y y u 4 f 35 3.5 3.2 1500 8 5 5 6 3 n i 4 1 y y n 5 m 23 3.1 3.5 1600 10 6 6 3 0 n i 1 0 y n n 6 m 39 3.5 3.5 350 3 4 5 7 0 y d 2 1 y y u 7 m 24 3.6 3.7 0 .2 5 12 4 2 n i 2 1 y y y 8 f 31 3.0 3.0 5000 1.5 5 3 3 1 n i 2 1 y y y 9 m 34 3.0 3.0 5000 2 7 5 3 0 n i 1 1 y y u 10 m 28 4.0 3.1 900 2 1 1 2 1 y i 3 0 n y y 11 m 23 2.3 2.6 253 1.5 10 15 1 1 n r 5 1 n y y 12 f 27 3.5 3.6 190 3 14 3 7 0 n d 2 1 y y u 13 m 36 3.3 3.5 245 1.5 6 15 12 5 n d 1 1 y y y 14 m 28 3.2 3.2 500 6 3 10 1 2 n i 4 1 y n y 15 f 28 3.0 3.5 3500 1 4 3 1 0 n d 1 0 y y y 16 f 25 3.8 3.3 210 10 7 6 1 0 y i 2 3 y y y 17 f 41 4.0 3.0 1000 15 6 7 3 10 n i 3 3 n u y 18 m 50 3.8 3.8 0 3 5 9 6 10 n d 2 0 y n n 19 m 71 4.0 3.5 5000 3 6 12 2 2 n i 2 0 y n n 20 f 28 3.0 3.8 120 1 25 0 0 2 y d 1 1 y y y 21 f 26 3.7 3.7 8000 8 4 4 4 1 n i 4 1 y y y 22 f 27 4.0 3.7 2 2.5 4 2 7 0 n i 2 1 y y y 23 m 31 2.7 3.5 1700 5 7 7 2 0 n r 7 3 n n y 24 f 23 3.7 3.7 2 2 7 4 2 0 n i 4 0 y y y 25 m 23 3.2 3.8 450 4 0 7 7 3 n i 1 0 y y y 26 f 44 3.0 3.0 0 2 2 3 2 3 y i 3 2 y y y 27 m 26 3.7 3.0 1000 3 8 2 7 0 n d 2 1 y y u 28 f 31 3.7 3.8 850 10 10 3 7 0 n r 5 2 y n y 29 m 24 3.3 3.1 420 2 10 6 5 0 n d 4 1 y y u 30 f 26 3.3 3.3 1200 .75 10 0 3 0 n r 2 1 y y u 31 m 26 3.3 3.5 1000 1.5 0 3 3 3 y d 2 1 y y n 32 f 32 3.5 3.9 150 12 10 2 0 0 n d 2 1 n n y 33 m 26 3.4 3.4 2000 1.5 2 7 14 0 n d 2 0 y y n 34 f 22 3.2 2.8 316 2 10 3 5 2 n i 2 1 y y u 35 f 24 3.5 3.9 900 1.75 8 0 0 1 n d 1 1 y y u 36 m 24 3.6 3.3 250 2 4 6 3 1 n r 5 3 n y y 37 m 23 3.8 3.7 180 .5 10 5 7 0 n i 2 0 y n u 38 m 33 3.4 3.4 6000 1.5 8 5 6 2 n i 2 0 y y n 39 m 23 2.8 3.2 950 2 37 10 5 0 n r 5 2 y n y 40 m 31 3.8 3.5 1100 .75 .5 3 5 2 n r 6 2 y n u 41 m 26 3.4 3.4 1300 1.2 0 8 2 0 n i 2 1 n y n 42 m 28 2.0 3.0 360 .25 10 8 3 0 n d 3 0 y y u 43 f 24 3.8 3.9 1800 2 2 5 4 1 n r 6 3 n y y 44 m 23 3.0 3.6 900 15 12 0 5 0 n r 5 0 y n n 45 f 25 3.0 4.0 5000 5 1.5 0 4 0 n i 4 1 y y n 46 f 24 3.0 3.5 300 1 10 5 5 0 n d 2 0 y y n 47 f 27 3.0 3.8 2000 20 28 7 14 2 y r 3 1 y y y 48 m 24 3.3 3.8 630 1.3 2 3 5 0 n r 7 3 n n y 49 f 26 3.8 4.0 1200 1 0 4 3 1 n d 2 0 y y n 50 f 27 3.0 4.0 580 2 5 15 1 2 n d 1 1 y y n 51 m 32 3.0 3.0 2000 5 5 5 2 1 n r 5 3 n y y 52 f 41 4.0 4.0 0 8 8 4 2 2 n r 4 1 n n y 53 f 29 3.0 3.9 300 3.7 2 5 1 11 n d 2 1 y y y 54 f 50 3.5 3.8 6 6 7 3 7 0 n d 2 1 y y u 55 f 22 3.4 3.7 80 7 10 1 2 2 n i 2 0 y y u 56 f 23 3.6 3.2 375 1.5 5 10 5 0 n r 6 3 n n y 57 m 26 3.5 3.6 2000 .3 16 8 3 0 n d 4 1 y y u 58 m 30 3.0 3.0 1 1.1 1 4 3 0 n i 3 3 y n y 59 f 23 3.0 3.0 112 .5 15 3 3 0 n i 4 2 y y y 60 f 22 3.4 3.0 650 4 8 16 7 1 n i 4 1 y y y END DATA. EXECUTE. VARIABLE LABELS GEN 'Gender'/ AG 'Age in years'/ HIH 'High School GPA (on a four-point scale)'/ CO 'College GPA'/ DH 'Distance (in miles) of the campus from your home town'/ DR 'Distance (in miles) of the classroom from your current residence'/ TV 'Average number of hours per week that you watch TV'/ SP 'Average number of hours per week that you participate in sport'/ NEX 'Number of times a week you read a newspaper'/ AH 'Number of people you know who have died from AIDS or who are HIV+'/ VE 'Whether you are a vegetarian (yes, no)'/ PA 'Political affiliation (D =Democrat, R = Republican, I = independent)'/ PI 'Political ideology (1 =very liberal, 7 = very conservative)'/ RE 'How often you attend religious services (never, occasionally, most weeks, every week)'/ AB 'Opinion about whether abortion should be legal in the first three months of pregnancy (yes, no)'/ AA 'Support affirmative action (yes, no)'/ LD 'Belief in life after death (yes, no)'. SAVE OUTFILE='A:\PROBLEM1.SAV'. *** Computer Setup 2 (Agresti: Data for Table 9_1 ***). DATA LIST FREE / State(A2) VR MR M W H P S. BEGIN DATA AK 761 9.0 41.8 75.2 86.6 9.1 14.3 AL 780 11.6 67.4 73.5 66.9 17.4 11.5 AR 593 10.2 44.7 82.9 66.3 20.0 10.7 AZ 715 8.6 84.7 88.6 78.7 15.4 12.1 CA 1078 13.1 96.7 79.3 76.2 18.2 12.5 CO 567 5.8 81.8 92.5 84.4 9.9 12.1 CT 456 6.3 95.7 89.0 79.2 8.5 10.1 DE 686 5.0 82.7 79.4 77.5 10.2 11.4 FL 1206 8.9 93.0 83.5 74.4 17.8 10.6 GA 723 11.4 67.7 70.8 70.9 13.5 13.0 HI 261 3.8 74.7 40.9 80.1 8.0 9.1 IA 326 2.3 43.8 96.6 80.1 10.3 9.0 ID 282 2.9 30.0 96.7 79.7 13.1 9.5 IL 960 11.4 84.0 81.0 76.2 13.6 11.5 IN 489 7.5 71.6 90.6 75.6 12.2 10.8 KS 496 6.4 54.6 90.9 81.3 13.1 9.9 KY 463 6.6 48.5 91.8 64.6 20.4 10.6 LA 1062 20.3 75.0 66.7 68.3 26.4 14.9 MA 805 3.9 96.2 91.1 80.0 10.7 10.9 MD 998 12.7 92.8 68.9 78.4 9.7 12.0 ME 126 1.6 35.7 98.5 78.8 10.7 10.6 MI 792 9.8 82.7 83.1 76.8 15.4 13.0 MN 327 3.4 69.3 94.0 82.4 11.6 9.9 MO 744 11.3 68.3 87.6 73.9 16.1 10.9 MS 434 13.5 30.7 63.3 64.3 24.7 14.7 MT 178 3.0 24.0 92.6 81.0 14.9 10.8 NC 679 11.3 66.3 75.2 70.0 14.4 11.1 ND 82 1.7 41.6 94.2 76.7 11.2 8.4 NE 339 3.9 50.6 94.3 81.8 10.3 9.4 NH 138 2.0 59.4 98.0 82.2 9.9 9.2 NJ 627 5.3 100.0 80.8 76.7 10.9 9.6 NM 930 8.0 56.0 87.1 75.1 17.4 13.8 NV 875 10.4 84.8 86.7 78.8 9.8 12.4 NY 1074 13.3 91.7 77.2 74.8 16.4 12.7 OH 504 6.0 81.3 87.5 75.7 13.0 11.4 OK 635 8.4 60.1 82.5 74.6 19.9 11.1 OR 503 4.6 70.0 93.2 81.5 11.8 11.3 PA 418 6.8 84.8 88.7 74.7 13.2 9.6 RI 402 3.9 93.6 92.6 72.0 11.2 10.8 SC 1023 10.3 69.8 68.6 68.3 18.7 12.3 SD 208 3.4 32.6 90.2 77.1 14.2 9.4 TN 766 10.2 67.7 82.8 67.1 19.6 11.2 TX 762 11.9 83.9 85.1 72.1 17.4 11.8 UT 301 3.1 77.5 94.8 85.1 10.7 10.0 VA 372 8.3 77.5 77.1 75.2 9.7 10.3 VT 114 3.6 27.0 98.4 80.8 10.0 11.0 WA 515 5.2 83.0 89.4 83.8 12.1 11.7 WI 264 4.4 68.1 92.1 78.6 12.6 10.4 WV 208 6.9 41.8 96.3 66.0 22.2 9.4 WY 286 3.4 29.7 95.9 83.0 13.3 10.8 DC 2922 78.5 100.0 31.8 73.1 26.4 22.1 END DATA. VARIABLE LABELS VR 'Violent crime rate'/ MR 'Murder rate'/ M 'Percent in metropolitan areas'/ W 'Percent white'/ H 'Percent high school graduates'/ P 'Percent below the poverty level'/ S 'Percent of families headed by a single parent'. SAVE OUTFILE='A:\PROBLEM2.SAV'. < *** Computer Setup 3 (Agresti: Data for Table 4 ***). DATA LIST FREE / P S Be Ba New. BEGIN DATA 48.5 1.10 3 1 0 55.0 1.01 3 2 0 68.0 1.45 3 2 0 137.0 2.40 3 3 0 309.4 3.30 4 3 1 17.5 .40 1 1 0 19.6 1.28 3 1 0 24.5 .74 3 1 0 34.8 .78 2 1 0 32.0 .97 3 1 0 28.0 .84 3 1 0 49.9 1.08 2 2 0 59.9 .99 2 1 0 61.5 1.01 3 2 0 60.0 1.34 3 2 0 65.9 1.22 3 1 0 67.9 1.28 3 2 0 68.9 1.29 3 2 0 69.9 1.52 3 2 0 70.5 1.25 3 2 0 72.9 1.28 3 2 0 72.5 1.28 3 1 0 72.0 1.36 3 2 0 71.0 1.20 3 2 0 76.0 1.46 3 2 0 72.9 1.56 4 2 0 73.0 1.22 3 2 0 70.0 1.40 2 2 0 76.0 1.15 2 2 0 69.0 1.74 3 2 0 75.5 1.62 3 2 0 76.0 1.66 3 2 0 81.8 1.33 3 2 0 84.5 1.34 3 2 0 83.5 1.40 3 2 0 86.0 1.15 2 2 1 86.9 1.58 3 2 1 86.9 1.58 3 2 1 86.9 1.58 3 2 1 87.9 1.71 3 2 0 88.1 2.10 3 2 0 85.9 1.27 3 2 0 89.5 1.34 3 2 0 87.4 1.25 3 2 0 87.9 1.68 3 2 0 88.0 1.55 3 2 0 90.0 1.55 3 2 0 96.0 1.36 3 2 1 99.9 1.51 3 2 1 95.5 1.54 3 2 1 98.5 1.51 3 2 0 100.1 1.85 3 2 0 99.9 1.62 4 2 1 101.9 1.40 3 2 1 101.9 1.92 4 2 0 102.3 1.42 3 2 1 110.8 1.56 3 2 1 105.0 1.43 3 2 1 97.9 2.00 3 2 0 106.3 1.45 3 2 1 106.5 1.65 3 2 0 116.0 1.72 4 2 1 108.0 1.79 4 2 1 107.5 1.85 3 2 0 109.9 2.06 4 2 1 110.0 1.76 4 2 0 120.0 1.62 3 2 1 115.0 1.80 4 2 1 113.4 1.98 3 2 0 114.9 1.57 3 2 0 115.0 2.19 3 2 0 115.0 2.07 4 2 0 117.9 1.99 4 2 0 110.0 1.55 3 2 0 115.0 1.67 3 2 0 124.0 2.40 4 2 0 129.9 1.79 4 2 1 124.0 1.89 3 2 0 128.0 1.88 3 2 1 132.4 2.00 4 2 1 139.3 2.05 4 2 1 139.3 2.00 4 2 1 139.7 2.03 3 2 1 142.0 2.12 3 3 0 141.3 2.08 4 2 1 147.5 2.19 4 2 0 142.5 2.40 4 2 0 148.0 2.40 5 2 0 149.0 3.05 4 2 0 150.0 2.04 3 3 0 172.9 2.25 4 2 1 190.0 2.57 4 3 1 280.0 3.85 4 3 0 END DATA. VARIABLE LABELS P 'Selling price'/ S 'Size of home'/ BE 'Number of bedrooms'/ BA 'Number of bathrooms'/ New 'Whether new (1 = yes, 0 = no)'. SAVE OUTFILE='A:\PROBLEM3.SAV'. *** Computer Setup 4 (Agresti: Data for TABLE_9_13 ***).
DATA LIST FREE / Nation(A15) B W C LI LEX HDI GNP N TV. BEGIN DATA Algeria 29.0 11 47 73 68 44 1.6 5 7 Argentina 19.5 38 -9 88 76 96 4.0 12 22 Australia 14.1 61 76 93 81 99 16.6 25 48 Brazil 21.2 38 66 80 69 81 2.6 5 21 Canada 13.7 63 -9 95 81 99 20.8 23 64 China 17.8 81 83 59 70 70 1.3 5 3 Cuba 14.5 50 70 77 77 94 1.6 17 16 Denmark 12.4 77 78 92 78 99 24.2 35 54 Egypt 28.7 12 46 61 65 36 .5 6 12 France 13.0 64 81 93 81 99 24.1 21 41 Germany 11.0 -9 75 92 79 99 19.8 59 56 India 27.8 34 43 44 60 35 .3 3 4 Iraq 43.6 29 14 62 67 41 .7 4 7 Israel 20.4 49 -9 91 78 95 13.6 26 27 Japan 10.7 64 64 94 82 99 27.3 59 61 Malaysia 28.0 55 48 75 73 82 2.5 14 15 Mexico 26.6 37 53 84 74 86 3.1 13 15 Nigeria 43.3 51 6 41 52 42 .2 -9 -9 Pakistan 41.8 16 12 48 63 63 .4 2 2 Philippines 30.4 44 40 68 68 94 .7 5 4 Russia 12.6 70 -9 85 74 99 8.6 -9 -9 SouthAfrica 33.4 54 50 -9 66 70 2.6 4 10 Spain 11.2 31 -9 98 80 93 13.4 8 40 UnitedKingdom 13.2 60 81 92 79 99 17.4 39 43 UnitedStates 15.2 65 74 94 79 99 22.6 25 81 Vietnam 26.3 82 53 89 67 54 -9 -9 -9 END DATA. RECODE B TO TV(-9=SYSMIS). VARIABLE LABELS B 'Crude birth rate (number of births per 1000 population size)'/ W 'Womens economic activity (female labor force as percent of male)'/ C 'Percent women using contraception'/ LI 'Female adult literacy rate'/ LEX 'Female life expectancy'/ HDI 'Human development index'/ GNP 'Gross national product (per capita, in thousands of dollars)'/ N 'Daily newspaper circulation per 100 people'/ TV 'Number of televisions per 100 people'. SAVE OUTFILE='A:\PROBLEM4.SAV'. *** Computer Setup 5 (Agresti: Data for TABLE_9_16 ***).
DATA LIST FREE / County(A10) C I HS U. BEGIN DATA ALACHUA 104 22.1 82.7 73.2 BAKER 20 25.8 64.1 21.5 BAY 64 24.7 74.7 85.0 BRADFORD 50 24.6 65.0 23.2 BREVARD 64 30.5 82.3 91.9 BROWARD 94 30.6 76.8 98.9 CALHOUN 8 18.6 55.9 0.0 CHARLOTTE 35 25.7 75.7 80.2 CITRUS 27 21.3 68.6 31.0 CLAY 41 34.9 81.2 65.8 COLLIER 55 34.0 79.0 77.6 COLUMBIA 69 22.0 69.0 31.1 DADE 128 26.9 65.0 98.8 DESOTO 69 21.0 54.5 44.6 DIXIE 49 15.4 57.7 0.0 DUVAL 97 28.5 76.9 98.8 ESCAMBIA 70 25.2 76.2 85.9 FLAGLER 34 28.6 78.7 63.1 FRANKLIN 37 17.2 59.5 30.2 GADSDEN 52 20.0 59.9 28.8 GILCHRIST 15 20.6 63.0 0.0 GLADES 62 20.7 57.4 0.0 GULF 19 21.9 66.4 35.2 HAMILTON 6 18.7 58.4 0.0 HARDEE 57 22.1 54.8 16.7 HENDRY 47 24.9 56.6 44.7 HERNANDO 44 22.7 70.5 61.3 HIGHLANDS 56 21.1 68.2 24.8 HILLSBOR 110 28.5 75.6 89.2 HOLMES 5 17.2 57.1 16.8 INDIAN_R 58 29.0 76.5 83.0 JACKSON 32 19.5 61.6 21.7 JEFFERSON 36 21.8 64.1 22.3 LAFAYETTE 0 20.7 58.2 0.0 LAKE 42 23.4 70.6 43.2 LEE 59 28.4 76.9 86.1 LEON 107 27.3 84.9 82.5 LEVY 45 18.8 62.8 0.0 LIBERTY 8 22.3 56.7 0.0 MADISON 26 18.2 56.5 20.3 MANATEE 79 26.0 75.6 88.7 MARION 64 22.5 69.6 39.6 MARTIN 53 31.8 79.7 83.2 MONROE 89 29.4 79.7 73.2 NASSAU 42 30.2 71.2 44.9 OKALOOSA 37 27.9 83.8 84.0 OKEECH. 51 21.4 59.1 30.1 ORANGE 93 30.3 78.8 93.1 OSCEOLA 78 27.3 73.7 66.4 PALM_B 90 32.5 78.8 94.7 PASCO 42 21.5 66.9 67.4 PINELLAS 70 26.3 78.1 99.6 POLK 84 25.2 68.0 70.3 PUTNAM 83 20.2 64.3 15.7 SANTA_R 43 27.6 79.9 57.2 SARASOTA 58 29.9 71.7 92.1 SEMINOLE 56 35.6 78.5 44.4 ST_JOHNS 54 29.9 81.3 93.2 ST_LUCIE 58 27.7 84.6 92.8 SUMTER 37 19.6 64.3 19.3 SUWANEE 37 19.8 63.8 23.6 TAYLOR 76 21.4 62.1 41.8 UNION 6 22.8 67.7 0.0 VOLUSIA 62 24.8 75.4 83.9 WAKULLA 29 25.0 71.6 0.0 WALTON 18 21.9 66.5 20.9 WASHING 21 18.3 60.9 22.9 END DATA. VARIABLE LABELS C 'Crime rate'/ I 'Median income'/ HS 'Percent completing high school'/ U 'Percent urban'. SAVE OUTFILE='A:\PROBLEM5.sav'. *** Computer Setup 8 (Agresti: Data for TABLE_13_1 ***).
DATA LIST FREE /inc(F2.0) educ(F2.0) race(A2) z1(F1.0) z2(F1.0). BEGIN DATA 8 10 b 1 0 9 7 b 1 0 13 9 b 1 0 8 11 b 1 0 17 14 b 1 0 11 12 b 1 0 21 16 b 1 0 21 16 b 1 0 8 9 b 1 0 10 10 b 1 0 33 16 b 1 0 13 12 b 1 0 10 10 b 1 0 15 15 b 1 0 10 10 b 1 0 15 19 b 1 0 16 16 h 0 1 8 11 h 0 1 10 10 h 0 1 29 16 h 0 1 15 12 h 0 1 13 10 h 0 1 10 8 h 0 1 20 12 h 0 1 16 10 h 0 1 11 11 h 0 1 10 10 h 0 1 28 14 h 0 1 16 12 h 0 1 15 11 h 0 1 15 14 w 0 0 24 14 w 0 0 20 7 w 0 0 42 18 w 0 0 25 10 w 0 0 19 12 w 0 0 15 12 w 0 0 38 16 w 0 0 24 16 w 0 0 18 11 w 0 0 20 11 w 0 0 22 12 w 0 0 15 10 w 0 0 30 15 w 0 0 12 9 w 0 0 44 17 w 0 0 23 16 w 0 0 25 16 w 0 0 25 14 w 0 0 11 11 w 0 0 13 12 w 0 0 23 16 w 0 0 11 9 w 0 0 12 9 w 0 0 32 14 w 0 0 31 16 w 0 0 12 10 w 0 0 25 13 w 0 0 16 10 w 0 0 17 16 w 0 0 26 18 w 0 0 12 12 w 0 0 11 14 w 0 0 10 13 w 0 0 15 14 w 0 0 12 13 w 0 0 60 18 w 0 0 11 10 w 0 0 41 16 w 0 0 9 12 w 0 0 13 12 w 0 0 52 14 w 0 0 14 12 w 0 0 16 12 w 0 0 19 14 w 0 0 22 12 w 0 0 11 12 w 0 0 9 10 w 0 0 12 12 w 0 0 28 20 w 0 0 END DATA. VARIABLE LABELS INC 'Annual income (thousands of dollars)'/ EDUC 'Number of years of education (where 12 = high school graduate, 16 = college graduate)' RACE 'Racial-ethnic group (Black, Hispanic, White) as a letter'/ Z1 'Racial-ethnic group (Black, Hispanic, White) as a dummy variables for black'/ Z2 'Racial-ethnic group (Black, Hispanic, White) as a dummy variables for Hispanic'. SAVE OUTFILE='A:\PROBLEM8.SAV'. |