Blog member Laura Illingworth is presenting at the SUGUKI meeting in London on Feb 6

Loading

London SAS Meetup

The Hub @ SAS London, 7th Floor, 199 Bishopsgate, London EC2M 3TY

Details

• 18:00 – Arrive
• 18:30 – “Missing data and how to use multiple imputations to improve models” by Laura Illingworth
• 18:50 – Sasensei Quiz Break
• 19:00 – “Hidden Gems in SAS Enterprise Guide” by Peter Hobart
• 19:20 – Wrap up and drinks
• 20:00 – Vacate to a local watering hole

Entrance is free but you must RSVP to confirm attendance for security/capacity reasons.

Statisticians: Beware of the Datasaurus!!

Loading

Alberto Cairo created the Datasaurus Dozen to demonstrate the necessity to view data beyond its statistics. He created a scatterplot of a dinosaur, and then generated 12 very different scatterplots with almost identical statistics.

Plot of Datasaurus data set

  • N: 142
  • Mean: X=54.27, Y=47.83
  • Standard deviation: X=16.77, Y=26.94
  • Correlation x-y: -0.06

The 12 data sets with almost identical statistics to those above are plotted here, including the x and y means as reference lines:

More information about the Datasaurus Dozen, including how the Dozen were generated and how to download the data, can be found here.

The program to create these graphs, including the data, can be downloaded as a zip file from here.