The local school district wants to survey all sixth grade students and their school-aged siblings. There are three different types of surveys: one for the sixth graders, one for their younger siblings, and one for their older siblings. The SAS data set called SCHOOL SURVEY contains data for all sixth graders in the three middle schools in the district (Rachael Carson, Green Valley, and Redwood Grove). The data set also includes data for all their siblings attending schools in the district, which can be linked back to the sixth grader by Family_ID.
a. Examine this SAS data set including the variable labels and attributes. Add a comment to your program that notes the sort order of the variables in this data set.
b. Create a data set that has one observation for each sixth grader.
c. Combine the data set from part b) with the original data, and create a variable that is the age difference in years between the sixth grader and the sibling.
d. Count the number of older siblings and the number of younger siblings of the sixth grader, and add these variables to the data set of just the sixth graders. Be sure to label the variables appropriately.
e. So the district knows how many surveys to print and distribute to each school, compute the number of sixth graders, and the total number of younger and older siblings by school. Be sure to show the complete names for the schools.
f. For each sixth grader, add variables to your data set for the age difference between the sixth grader and their youngest sibling, and the age difference between the sixth grader and their oldest sibling.
g. For each school, compute the average, minimum, and maximum age difference between the sixth graders and their youngest and oldest siblings. Be sure to show the complete names for the schools.