# Interaction between categorical and continuous variables

### From PsychWiki - A Collaborative Psychology Wiki

(10 intermediate revisions not shown) | |||

Line 1: | Line 1: | ||

- | + | Testing interactions between categorical and continuous variables follows the same basic steps as testing [[Interaction between two continuous variables | interactions between two continuous variables]] so there is content overlap between this page and the page describing [[Interaction between two continuous variables | interactions between two continuous variables]]. | |

Line 45: | Line 45: | ||

===► '''Create the interaction term'''=== | ===► '''Create the interaction term'''=== | ||

*How to create the interaction term? | *How to create the interaction term? | ||

- | *#Simply | + | *#Simply multiply together the newly centered variable and the categorical variable. |

*#In our example, multiple IQ_c x study (e.g., "study" is the variable name for whether the subjects studied for the exam or not). | *#In our example, multiple IQ_c x study (e.g., "study" is the variable name for whether the subjects studied for the exam or not). | ||

- | *#In SPSS this is accomplished using the "compute" command and typing IQ_c * study in the open box. | + | *#In SPSS this is accomplished using the "compute" command and typing "IQ_c * study" in the open box. |

Line 54: | Line 54: | ||

*How to conduct the regression analysis? | *How to conduct the regression analysis? | ||

*#In SPSS, click on "linear regression" and enter the test score variable as the DV. | *#In SPSS, click on "linear regression" and enter the test score variable as the DV. | ||

- | *#Enter the newly centered continuous variable and the categorical variable as the IVs in the regression analysis | + | *#Enter the newly centered continuous variable and the categorical variable as the IVs in the regression analysis. |

*#Click "next" and enter the same two variables AND the new interaction variable as the IVs. | *#Click "next" and enter the same two variables AND the new interaction variable as the IVs. | ||

*#Run the analysis. | *#Run the analysis. | ||

- | *#In the output, look at the second model in the "Coefficients" box. An interaction is depicted as a significant value for the interaction variable | + | *#In the output, look at the second model in the "Coefficients" box. An interaction is depicted as a significant value for the interaction variable. A significant value for the centered variables can be conceptualized as a "main effect". |

*#If your interaction term is then significant it is recommended you produce plots to assist the interpretation of your interaction. | *#If your interaction term is then significant it is recommended you produce plots to assist the interpretation of your interaction. | ||

Line 65: | Line 65: | ||

==Interaction! software== | ==Interaction! software== | ||

*Given the tedious nature of using the [[#Three Steps using SPSS | three steps described above]] every time you need to test interactions between categorical and continuous variables, I was happy to find Windows-based software which analyzes statistical interactions between dichotomous, categorical, or continuous variables, AND plots the interaction graphs. | *Given the tedious nature of using the [[#Three Steps using SPSS | three steps described above]] every time you need to test interactions between categorical and continuous variables, I was happy to find Windows-based software which analyzes statistical interactions between dichotomous, categorical, or continuous variables, AND plots the interaction graphs. | ||

- | *The software is called [http://www.danielsoper.com/Interaction/default.aspx Interaction!] from a graduate student in the Information Systems department at Arizona State University. I found it very easy to use. | + | *The software is called [http://www.danielsoper.com/Interaction/default.aspx Interaction!] from a graduate student in the Information Systems department at Arizona State University. I found it very easy to use. There is also a good [http://www.danielsoper.com/Interaction/help.aspx Help section] on the website. |

- | + | *When using the software to test the interaction between a categorical and continuous variable, you should center the continuous variable first in SPSS before using the Interaction! software to analyze the data. | |

Line 75: | Line 75: | ||

---- | ---- | ||

- | ◄ Back to [[ | + | ◄ Back to [[Analyzing Data]] page |

## Latest revision as of 20:55, 7 September 2009

Testing interactions between categorical and continuous variables follows the same basic steps as testing interactions between two continuous variables so there is content overlap between this page and the page describing interactions between two continuous variables.

Two approaches are described below:

(1) ** three steps to conduct the interaction using commands within SPSS**, and

(2) ** Interaction! software** by Daniel S. Soper that performs statistical analysis and graphics for interactions between dichotomous, categorical, and continuous variables.

*For a description of what is an interaction and main effects, please see the accompanying page about What is an Interaction?.

## Contents |

## Three Steps using SPSS

There are three steps involved to calculate the interaction between two continuous variables.

### ► **Center** the continuous variable

- Why center the variable?
- To increase interpretability of interactions numerous researchers (e.g. (Aiken and West, 1991); (Judd and McClelland, 1989)) have recommended centering the continuous predictor variable).
- If the variable is not centered there are possible problems with multicolinearity, which means that if the IVs are not centered their product (used in computing the interaction) is highly correlated with the original IV.

- How to center the variable?
- You center the continuous variable by subtracting the mean score from each data-point. In other words, use SPSS, or another statistical program, to find the mean value of the variable. Then, use the "Compute" command in SPSS to create a new variable that is the original values minus the mean.

- As a concrete example,
- Suppose you have 200 subjects (N=200) for which you have their IQ score and whether or not they studied for an exam. Thus, there is one continuous variables (X1=IQ) and one categorical variable(X2=studied or not studied), and your dependent variable is the test score (Y=test score).
- Imagine that the average IQ score is 100. To center the IQ variable, 100 needs to be subtracted from every every subject's IQ score. So if a subject has an IQ of 115, their centered IQ score is 15. If a subject has an IQ of 90, their centered IQ score is -10. For easy reference, lets called the newly centered IQ score as "IQ_c".
- To check your transformation has been performed correctly you should compute the mean of your IQ_c variable. If the centering process has worked the mean score for IQ_c should be 0. It is important that the mean score you subtract is as accurate as possible. Typically this means your mean score should be entered to say at least 4 decimal places (though the number of decimal places needed will depend on your data). If you have rounded your mean score your centered variable may not have a mean of zero.

- There is a macro available that will center the variables
- Macros are useful when you need to perform the same statistical procedure for lots of variables or imagine in the future you will be performing the same analysis over and over again. In other words macros may take some initial time to learn but in the long run will save you time.
- See this website and download the file.
- Open and select run all from the pull-down menu.
- At the bottom of the downloaded file is the following text
- /* --------------------------------------------------------- */.

- /* The macro is called by:

- /* Center IDVar = variable containing casenumbers

- /* /VARS = variables

- /* /DVARS = new variables.

- /* --------------------------------------------------------- */.

- /* --------------------------------------------------------- */.
- You should re-write that text to reflect your current study. For example, remove the "/*" because that is telling SPSS to ignore the enclosed text. Then, insert your variable names into the text, such as
- CenterIDVar = subjects
- /VARS = IQ, study

- In the above example, ‘IQ’ is the variable names in SPSS given to the IQ. ‘Subjects’ indicates the variable containing the case numbers, in this case 1-200 as there were 200 subjects in the study.
- Highlight the text, and click run selection. A new SPSS data editor window should be created at the end of which should be the new SPSS variables IQ_c. You should now save this spss file with a new name.

### ► **Create the interaction term**

- How to create the interaction term?
- Simply multiply together the newly centered variable and the categorical variable.
- In our example, multiple IQ_c x study (e.g., "study" is the variable name for whether the subjects studied for the exam or not).
- In SPSS this is accomplished using the "compute" command and typing "IQ_c * study" in the open box.

### ► **Conduct Regression**

- How to conduct the regression analysis?
- In SPSS, click on "linear regression" and enter the test score variable as the DV.
- Enter the newly centered continuous variable and the categorical variable as the IVs in the regression analysis.
- Click "next" and enter the same two variables AND the new interaction variable as the IVs.
- Run the analysis.
- In the output, look at the second model in the "Coefficients" box. An interaction is depicted as a significant value for the interaction variable. A significant value for the centered variables can be conceptualized as a "main effect".
- If your interaction term is then significant it is recommended you produce plots to assist the interpretation of your interaction.

## Interaction! software

- Given the tedious nature of using the three steps described above every time you need to test interactions between categorical and continuous variables, I was happy to find Windows-based software which analyzes statistical interactions between dichotomous, categorical, or continuous variables, AND plots the interaction graphs.
- The software is called Interaction! from a graduate student in the Information Systems department at Arizona State University. I found it very easy to use. There is also a good Help section on the website.
- When using the software to test the interaction between a categorical and continuous variable, you should center the continuous variable first in SPSS before using the Interaction! software to analyze the data.

◄ Back to Analyzing Data page