In this unit the students will collect statistical data about their own class and school and learn how to compare it to data from students in the USA.
 plan an investigation;
 use spreadsheets to display and analyse data;
 discuss features of data display;
 compare features of data distributions;
By Level 4 students are able to take increasing responsibility for the planning and conducting of statistical investigations. Students should be capable now of incorporating a computer into their work. The mathematics explored in this unit includes tally charts, the use of Microsoft Excel to collect and analyse data (especially as histograms), and measures of mid point (mean, median and mode) and spread (using box and whisker plots).
Note that measures of centre are not introduced in The New Zealand Curriculum until level 5. This material may not be suitable for all of your students.
The mode is the number or event that appears most often in a set of data. Suppose that the eye colours of the students in the class are blue, blue, blue, brown, brown, brown, brown, green, green. Then the most common eye colour is brown. This is the mode.
The number that all the numbers in a data set cluster equally around is the mean. This is calculated by adding all the numbers together and dividing by the number of numbers. It is important that students understand that the mean of a group of numbers (or measurements) in our unit represents what would happen if we equally redistributed our measurements so that everyone had the same measure.
The number that comes in the middle of a set of numbers when they are arranged in order is the median. If we had the following set of numbers (4, 4, 4, 8, 9, 10, 10), then the middle number will be the fourth one. This number is 8, so the median of the numbers is 8. Now here we are lucky that there are an odd number of numbers. Otherwise there wouldn't be a precise "middle" number. If the numbers had been 4, 4, 4, 8, 9, 10, then we have to take the "three and a half" number as the middle number. As this is halfway between 4 and 8 we take the mean of 4 and 8 to be the median. So the median in this case is 6.
The three concepts of mode, mean and median, measure central tendency in some way. That is, they give some idea of the middle number in a set. However, they are often different numbers. The point of the mode is that its central tendency is the sameness of data, what is the most common "same" number. As for the mean, that tells which number is as close as possible to all numbers. When you add all the differences, both positive and negative, between the mean and the other numbers in the set, the result is zero. Finally the median is literally in the middle. When the data set is put in order, it's the actual number that is at the halfway point of the list (or as close as we can get in the even case).
 Computers with Microsoft Excel and internet access
 Paper for collecting data.
investigation, survey, census, variables, box and whisker plots, tally charts, spreadsheet, mode, median, mean, midpoints, range, distribution, spread, histogram, similarities, differences, comparison
Session 1: Measuring the class

Tell the students that school uniform or sports wear manufacturers may be interested in information about different body sizes. Introduce differences in sizes by asking for volunteers and standing two students up to discuss what about them could be measured to inform clothing or footwear producers.

Brainstorm things that could be measured and compared (height, head circumference, handspan, foot length). Note that some students may be sensitive about being measured. It is not appropriate to measure weight.

Students to form small groups and select one attribute to measure (each group to select a different attribute). Ensure that one group selects height.

Discuss ways to record the data (listing, stem and leaf plots, tally chart).

Students to survey class and collect data. As the students gather data, the teacher should circulate and provide advice or assistance as required. Ensure that accuracy of measurements is maintained.
Session 2: Displaying data
Students use data from previous session to produce graphs on Microsoft Excel.

Discuss the data collected in the previous session and explain that the students will be using Microsoft Excel to produce graphs of the data.

Initially students should be given freedom to experiment with what type of graph they feel best shows the information, but some may need assistance putting the data into a spreadsheet.

Once groups have produced two or three graphs each bring the class together and discuss the graphs produced. Talk about strengths and weaknesses of each. Make a chart listing strengths and weaknesses that students could refer to in the future.

The best graph to show this type of data is a histogram, ensure that each group has produced at least one histogram of their data by the end of the session.
Session 3: Comparing to the school
The class will collect height data from the rest of the school to compare to their class results.

Depending on the size of the school and the number of other classes available to be measured, students will work in groups or individually to collect height data for the school on a tally chart.

All data for the school is to be brought together and entered onto one Excel spreadsheet.

Students can work in small groups again to produce histograms of the whole school spreadsheet data.

As a class compare histograms of school heights with previously produced histogram of class heights. Focus the discussion on similarities and differences between the data.
Session 4: Comparing to data from the internet.
Students will produce histograms of heights of students in the USA using data from the internet.

Discuss the previous sessions’ work.
Who else could we compare heights with?
Would it be interesting to compare our school’s heights with heights of students from another country? 
Explain that there is a website with information collected from students at schools in the USA that includes information about heights of students

As a class, visit the CensusAtSchool USA site and access the results for heights of students the same age as those in class.

Students can work in small groups again to produce histograms of this data.

As a class compare the histograms produced with those for the class and for the school. Focus the discussion on similarities and differences between the results.
Session 5: Investigating midpoints/spread
Note that measures of centre are not introduced in The New Zealand Curriculum until level 5. This material may not be suitable for all of your students.

Students will use mean, median and mode to investigate midpoints of height data, and produce box and whisker plots to show the spread of data.

Discuss the results from the previous 4 sessions. Discuss what we can say to compare our class/school with the results from overseas.

Collect suggestions of ways we could find out more.

Discuss mean, median and mode, how we could find them out for this data. Find the mean, median and mode for heights of students in all three samples.

Discuss range, and produce box and whisker plots for all three sets of data.

Use the box and whisker plots to compare the three sets of data.

Discuss comparisons as a class.
Who is taller US students or us?
Dear Parents and Whānau,
At school this week we will be measuring the heights of the students in the class and comparing our results to the heights of children overseas. As another comparison we would like to be able to talk about the heights of family members. Could you please help your child measure the heights of their family members and record the information to bring to school?
How tall are you?
Tally 
Total 

90100cm 

>100110cm 

>110120cm 

>120130cm 

>130140cm 

>140150cm 

>150160cm 

>160170cm 

>170180cm 

>180190cm 
Thank you.