sources of authentic data term 2 – week 9 vce it – unit 2

10

Click here to load reader

Upload: cordelia-moore

Post on 28-Dec-2015

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Sources of authentic data

Term 2 – Week 9

VCE IT – UNIT 2

Page 2: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Sources of authentic data• Primary data• Secondary data• ABS census quality statement – outlines four principal

sources of error:– Respondent error: the person filling out the form

misunderstands a question and provides incorrect information.

– Processing error: when the technique to input data makes a mistake (can be via manual or automatic entry).

– Partial response: when the person only fills out part of the survey.

– Undercount: when not all respondents complete the survey.

Page 3: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

• Some sources of authentic data include:– The World Bank www.worldbank.org– Organisation for Economic Cooperation and

Development (OECD) www.oecd.org– CIA World Factbook

https://www.cia.gov/library/publications/the-world-factbook

– Australian Tax Office www.ato.gov.au

Page 4: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

• We need to start with raw facts before we can produce information.– Primary: Data obtained through measurement. Can

include surveys, interviews, direct observation etc. Best to have a quality process in place to avoid errors.

– Secondary: Data obtained from someone else or published work. Includes books, websites, newspapers etc.

Data sources and methods of acquisition

•When acquiring data, you need to be mindful of what to expect and what form the data might come in (i.e. handwritten surveys).

Page 5: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Data integrity

• Data entered must have integrity (degree to which it is correct) for useful information to be presented. The more errors in data, the lower the integrity. This includes misspelled words. Users will not rely on this data.

• Why is it important?• To ensure integrity of data, we need to validate it as

it’s entered into the database. E.g. When purchasing something online, you will be prompted for the three-digit security code on the back of the card.

Page 6: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Measurement

• The most common technique used for gathering data.

• Examples include retail stocktake, ascertaining the numbers in a crowd by counting admission tickets etc. Can also be electronic e.g. scanning a barcode, temperature-activated heating, hits on a website etc.

Page 7: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Data types and structures relevant to selected software tools

• Many different software tools can be used to create data visualisations; however, they are often derived from spreadsheet or database tools.

• Data can be entered into ‘Many eyes’ website http://manyeyes.alphaworks.ibm.com/manyeyes

• Data needs to be separated into distinct fields or columns that have only one sort of data type in them.

Page 8: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Data typesData type Description

Integer Represents whole numbers.

Characters Holds a single letter, number or symbol (i.e. (Y)es, (N)o, (M)aybe

Strings Sequences of characters

Boolean data Can only have two values: True and false

Decimal numbers Include a fractional or decimal part (often used for costs)

Date and time data Not recorded as the actual date, such as 12.01pm on 03/11/2010. The date is held in the form of an integer that counts the number of days from a specified reference point, and time data is stored as a decimal fraction.

Page 9: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

Questions

• What is a data visualisation and why are they a preferred method of information presentation?

• Explain the three processes that the analysis stage involves. What is the benefit of determining this information?

• Give a situation when quantitative data would be preferred over qualitative data, and vice versa.

• What are the four main ways data visualisation is used for?• What type of data visualisation would you use:

– To show the price increase of petrol over the last 5 years? – To show how much more money women spend of shoes than men?– To show a classes distribution of exam marks?

Page 10: Sources of authentic data Term 2 – Week 9 VCE IT – UNIT 2

• Give an example of when you would use a histogram, line chart and bubble chart.

• Describe the errors that can occur when acquiring data.

• Why would it be important to identify if a data set had an undercount?

• What is the difference between primary and secondary source data?

• Why is it important to ensure data presented has integrity?