Checking Data Entry

From PsychWiki - A Collaborative Psychology Wiki

(Difference between revisions)
Jump to: navigation, search
Stenstro (Talk | contribs)
Line 1: Line 1:
-
*Why check to see if your data have been entered correctly?
+
*'''Why check to see if your data have been entered correctly?'''
#People make mistakes. If your data have been entered by hand (by a person), then you need to double-check whether any mistakes were made when transfering into a dataset (software like SPSS, SAS, R+, S, etc.)
#People make mistakes. If your data have been entered by hand (by a person), then you need to double-check whether any mistakes were made when transfering into a dataset (software like SPSS, SAS, R+, S, etc.)
#Computers make mistakes. If you collected your data online or through an online system (such as [[Internet Research Tools | surveymonkey or your own hosted site]], then the data *should* transfer into the dataset without error, unless the online system was incorrectly set up.
#Computers make mistakes. If you collected your data online or through an online system (such as [[Internet Research Tools | surveymonkey or your own hosted site]], then the data *should* transfer into the dataset without error, unless the online system was incorrectly set up.
Line 6: Line 6:
-
*How do I check to see if the data have been entered correctly?
+
*'''How do I check to see if the data have been entered correctly?'''
#Have two or more people enter the same data and look for discrepancies. [[Image:Fe40.png]] - If you enter the data in excel or spss, you can have two or more people enter the data into separate excel files, and then merge them together looking for differences between the two.
#Have two or more people enter the same data and look for discrepancies. [[Image:Fe40.png]] - If you enter the data in excel or spss, you can have two or more people enter the data into separate excel files, and then merge them together looking for differences between the two.
#Have someone enter the data, and then double-check by randomly picking different segments to look for incorrectly entered data.
#Have someone enter the data, and then double-check by randomly picking different segments to look for incorrectly entered data.
Line 12: Line 12:
-
*What do I do when I find data that have been entered incorrectly?
+
*'''What do I do when I find data that have been entered incorrectly?'''
#The first step is to identify why it was entered incorrectly. [[Image:Fe40.png]] - The output above for variable "system1" shows a "13". Since 13 is an invalid number, you then need to identify why “13” was entered. Did the person entering data make a mistake? Or, did the subject respond with a “13” even though the question indicated that only numbers 1 through 11 are valid? You can identify the source of the error by looking at the hard copies of the data.  
#The first step is to identify why it was entered incorrectly. [[Image:Fe40.png]] - The output above for variable "system1" shows a "13". Since 13 is an invalid number, you then need to identify why “13” was entered. Did the person entering data make a mistake? Or, did the subject respond with a “13” even though the question indicated that only numbers 1 through 11 are valid? You can identify the source of the error by looking at the hard copies of the data.  
#
#

Revision as of 03:51, 16 February 2008

  1. People make mistakes. If your data have been entered by hand (by a person), then you need to double-check whether any mistakes were made when transfering into a dataset (software like SPSS, SAS, R+, S, etc.)
  2. Computers make mistakes. If you collected your data online or through an online system (such as surveymonkey or your own hosted site, then the data *should* transfer into the dataset without error, unless the online system was incorrectly set up.
  3. Irrespective of how the mistake occured, mistakes will misrepresent your true data. The purpose of conducting research is to discover reality, so incorrectly entered data thrawt the purpose of research.
  4. Misrepresenting the data that was collected can significantly impact your findings. A single incorrectly entered number can be an outlier or reduce normality or change the findings from your study.


  1. Have two or more people enter the same data and look for discrepancies. Fe40.png - If you enter the data in excel or spss, you can have two or more people enter the data into separate excel files, and then merge them together looking for differences between the two.
  2. Have someone enter the data, and then double-check by randomly picking different segments to look for incorrectly entered data.
  3. Statistical software (like SPSS, SAS, R+, S, etc) can use descriptive analysis to look for numbers that are out of range or errors in data entry. Fe40.png - The output below from SPSS for the variable "system1" shows that a subject put a "13" for the question even though the only correct responses were 1 through 11.


  1. The first step is to identify why it was entered incorrectly. Fe40.png - The output above for variable "system1" shows a "13". Since 13 is an invalid number, you then need to identify why “13” was entered. Did the person entering data make a mistake? Or, did the subject respond with a “13” even though the question indicated that only numbers 1 through 11 are valid? You can identify the source of the error by looking at the hard copies of the data.








◄ Back to Research Tools mainpage

Personal tools
Namespaces
Variants
Actions
Navigation
Interaction
Toolbox