Data Analysis

10 min read

Exploratory Factor Analysis: Discovering Measurement Structure

Exploratory factor analysis discovers how many latent factors a set of items represents. Learn factor retention, rotation, loadings, and how it differs from confirmatory analysis.

Dr. Sarah Mitchell

June 13, 2026

Developing a new scale and unsure how many factors your items represent? Our survey data analysis service runs the factor analysis and documents every decision.

Key Takeaways

Exploratory factor analysis discovers how many latent factors a set of items represents when you have no prior measurement model

It is the discovery stage of scale development and comes before confirmatory factor analysis, which tests a known structure

Decide the number of factors with parallel analysis and a scree plot, not the eigenvalue-greater-than-one rule alone

Use an oblique rotation when constructs are expected to correlate, which is usual for social and psychological measures

A discovered structure must be confirmed on an independent sample before it is treated as final

Factor analysis and principal components analysis are different models: the common factor model splits shared from unique variance and is what scale development needs, while components reproduce total variance including error

Decide the number of factors with parallel analysis against the 95th percentile of random eigenvalues, cross-checked by Velicer minimum average partial, not the eigenvalue-greater-than-one rule

Ordered Likert items should be factored on polychoric correlations with a robust estimator, because Pearson correlations attenuate and can create spurious difficulty factors

Under oblique rotation interpret the pattern matrix and report the factor correlation matrix; remember factor scores are indeterminate, so model the latent variable directly when you can

Exploratory factor analysis is a statistical method that examines the correlations among a set of observed variables to discover how many underlying latent factors they represent and which items belong to each. You use it when you do not yet have a theory about your measurement structure and you want the data to suggest one. It is the discovery stage of scale development, the step that comes before you ever try to confirm a structure.

The contrast with confirmatory factor analysis defines both methods. Confirmatory analysis tests a structure you specified in advance. Exploratory factor analysis imposes nothing: it lets the pattern of correlations reveal how the items cluster, how many dimensions exist, and where items load weakly or on more than one factor. When you have a brand-new pool of questionnaire items and no prior evidence about their structure, exploration comes first.

When to use exploratory factor analysis

Reach for exploratory factor analysis when you are developing a new scale, when you are adapting an existing instrument so heavily that its original structure may no longer hold, or when you have a large set of items and suspect they reduce to a smaller number of underlying dimensions. It answers questions such as how many factors my items represent, which items hang together, and which items are redundant or do not belong.

It is the wrong tool once you already have a hypothesized structure. If theory or a previous study tells you the factors and their items, you should be confirming that structure, not rediscovering it. Running exploration when you already know the answer wastes the opportunity to test it formally.

The key decisions

Several choices shape the result, and reviewers scrutinize each one.

The first is how many factors to retain. The old habit of keeping every factor with an eigenvalue above one is now considered unreliable. Better approaches include parallel analysis, which compares your eigenvalues to those from random data, and the scree plot, read alongside interpretability and theory. Retaining too many factors splinters the structure; retaining too few merges distinct constructs.

The second is the rotation. Rotation makes the loadings easier to interpret. An oblique rotation, which allows factors to correlate, is usually more realistic for psychological and social constructs than an orthogonal rotation that forces them to be independent, because real constructs are rarely uncorrelated.

The third is the extraction method, such as principal axis factoring or maximum likelihood, chosen partly on whether your data meet distributional assumptions. These decisions interact, and documenting them transparently is part of a defensible analysis our survey data analysis team handles routinely.

Reading the output

The heart of the output is the factor loading matrix, showing how strongly each item relates to each retained factor. You look for items that load cleanly, with a strong loading on one factor of roughly 0.40 or higher and weak loadings elsewhere. An item that cross-loads, loading moderately on two factors, is ambiguous and may need to be revised or dropped. An item that loads weakly everywhere is not measuring any of the factors and is a candidate for removal.

Before trusting any of this, check that the data are suitable for factoring at all. The Kaiser-Meyer-Olkin measure of sampling adequacy should be acceptable, generally above 0.60, and Bartlett's test of sphericity should be significant, indicating the items are correlated enough to factor. Skipping these checks risks extracting structure from data that has none.

Sample size

Exploratory factor analysis needs a reasonable number of cases per item for stable results, and stronger, cleaner factor structures require fewer cases than weak, muddy ones. Fixed ratios are rough guidance only. The reliable path is to plan the sample with the same care you would for any analysis, using the reasoning in our power analysis and sample size guide and our sample size calculator to sanity-check feasibility before collecting data.

What comes next

Exploratory analysis suggests a structure; it does not confirm one. The standard workflow is to explore on one sample and then confirm the resulting structure on an independent sample with a confirmatory measurement model, and ultimately to build that validated measurement model into a structural equation model if your study tests relationships among the constructs. Treating an exploratory result as final, without confirming it on fresh data, is a common shortcut that weakens a scale-development paper.

Need professional help with your research?

Our PhD methodologists deliver complete systematic reviews and meta-analyses, from protocol to manuscript.

Chat on WhatsApp Get a Free Quote

Common mistakes

Using the eigenvalue-greater-than-one rule alone. It over-extracts. Use parallel analysis and a scree plot together.
Forcing orthogonal rotation on correlated constructs. Allow factors to correlate when theory says they should.
Keeping ambiguous items. Cross-loading and weakly loading items muddy the structure and should be revised or removed.
Skipping the suitability checks. Run the sampling adequacy measure and the sphericity test before interpreting factors.
Treating exploration as confirmation. A discovered structure must be confirmed on an independent sample.

The common factor model, and why it is not principal components

The most consequential decision is made before any rotation: factor analysis and principal components analysis are different models, and the software offers both under confusingly similar menus. The common factor model partitions each item's variance into a part shared with the other items (the communality) and a part unique to that item (measurement error plus item-specific variance), and it models only the shared part:

x_i = l_i1*F1 + l_i2*F2 + ... + l_im*Fm + e_i

Here the latent factors F explain the correlations among items, the loadings l are the regression weights of items on factors, and e is the unique factor. Principal components analysis makes no such split: components are weighted sums of the items that reproduce total variance, error included, so they are summaries, not latent causes. For scale development you almost always want the common factor model (principal axis factoring or maximum likelihood extraction), because the question is what unobserved construct generates the responses, not how to compress the items. Reporting "factor analysis" while having run principal components is one of the quiet errors a measurement reviewer catches immediately.

Deciding the number of factors without the eigenvalue rule

Parallel analysis is the right default, but it has its own choices. Horn's parallel analysis compares each observed eigenvalue to the distribution of eigenvalues from many random datasets of the same size; retain a factor only while the observed eigenvalue exceeds the 95th percentile (not merely the mean) of the random eigenvalues, which is the more conservative and reproducible threshold. Run it on the factor model, not on components, so the random baseline matches what you are estimating. Pair it with Velicer's minimum average partial test, which stops adding factors when the average squared partial correlation among items starts to rise, and treat strong disagreement between the two as a signal to examine the borderline factor for interpretability rather than to trust a single rule. When you extract by maximum likelihood you additionally get a formal model-fit chi-square and approximate fit indices, which give an independent check on whether a given number of factors reproduces the correlation matrix.

Likert items need polychoric correlations

Most questionnaires use ordered categories (strongly disagree to strongly agree), yet the default factor analysis runs on Pearson correlations, which assume continuous, normally distributed variables. With five or fewer categories, or visibly skewed items, Pearson correlations are attenuated and can manufacture spurious "difficulty factors" that merely group items by how extreme their response distributions are. The correct treatment is to factor the matrix of polychoric correlations, which estimate the association between the continuous traits assumed to underlie each pair of ordinal items, and to extract with a robust estimator such as weighted least squares or unweighted least squares. Treating ordinal data as interval is defensible only when items have many categories and roughly symmetric distributions.

From suitability checks to rotation to a clean loading matrix, our PhD methodologists make the structure defensible. Request a quote.

Factor scores are indeterminate, and the rotation output has two matrices

Two technical points routinely trip people up when they move from extraction to use. First, under an oblique rotation the software prints both a pattern matrix and a structure matrix: the pattern matrix holds the unique loading of each item on each factor controlling for the other factors (interpret the structure from this one), while the structure matrix holds item-factor correlations inflated by the factor intercorrelations. Always report the factor correlation matrix alongside, because correlations near or above 0.7 suggest the factors are not really distinct. Second, factor scores are indeterminate: because the common factor model has more unknowns than equations, infinitely many sets of scores are consistent with the same loadings, so a factor score is an estimate, not the latent value. If you must carry scores forward, the Bartlett or Anderson-Rubin methods have better properties than simple regression scores, but the cleaner path is to model the latent variable directly in a structural equation model rather than scoring and regressing in two error-prone steps.

A worked exploration in R

library(psych)

# Suitability: sampling adequacy and sphericity before anything else
KMO(items); cortest.bartlett(items)

# How many factors: parallel analysis on the FACTOR model, plus Velicer's MAP
fa.parallel(items, fm = 'ml', fa = 'fa', n.iter = 500)   # 95th-percentile rule
vss(items, n = 6)                                         # MAP and very-simple-structure

# Extraction by maximum likelihood with an oblique (oblimin) rotation
f <- fa(items, nfactors = 3, fm = 'ml', rotate = 'oblimin')
print(f$loadings, cutoff = 0.30)   # pattern matrix
f$Phi                              # factor correlation matrix

# Ordinal items: factor the polychoric correlations instead
fa(items, nfactors = 3, fm = 'wls', rotate = 'oblimin', cor = 'poly')

Bringing it together

Exploratory factor analysis is the disciplined discovery of measurement structure: let the data suggest how many factors exist and which items belong, document every decision, and then confirm the result on fresh data. It is the first step in building a scale that will eventually survive peer review.

If you are developing or refining a questionnaire and want the factor structure done rigorously, our survey data analysis team runs the analysis and documents every choice. Request a quote and tell us about your item pool.

Pro Tip

Check suitability first

Run the Kaiser-Meyer-Olkin measure of sampling adequacy and Bartlett\u2019s test of sphericity before interpreting factors. They tell you whether the data can be factored at all.

Pro Tip

Resolve ambiguous items

Items that cross-load on two factors or load weakly everywhere muddy the structure. Revise or remove them rather than forcing them into a factor.

Pro Tip

Make sure you ran factor analysis, not principal components

Principal components reproduce total variance including error and are summaries, not latent constructs. For scale development extract with principal axis factoring or maximum likelihood so the model explains the shared variance that a construct would generate.

Pro Tip

Factor polychoric correlations for Likert items

Pearson correlations assume continuous normal data and are attenuated for five-point items, which can produce factors that merely group items by response extremity. Use polychoric correlations with a weighted or unweighted least squares estimator.

Frequently Asked Questions

Exploratory factor analysis lets the data suggest how many factors exist and which items load on them when you have no prior model. Confirmatory factor analysis tests a structure you specified in advance. Exploration is the discovery stage; confirmation tests a hypothesized structure, ideally on a separate sample.

The most reliable approaches are parallel analysis, which compares your eigenvalues to those from random data, and the scree plot read alongside interpretability and theory. The older rule of keeping every factor with an eigenvalue above one tends to over-extract and is no longer recommended on its own.

A clean item loads strongly on one factor, generally around 0.40 or higher, and weakly on the others. Items that cross-load moderately on two factors are ambiguous, and items that load weakly everywhere are not measuring any factor; both are candidates for revision or removal.

Use oblique rotation when you expect the factors to correlate, which is typical for social and psychological constructs, because it produces a more realistic and interpretable solution. Orthogonal rotation forces factors to be independent and is appropriate only when the constructs are genuinely unrelated.

Found this useful? Share it with your colleagues.

Meta-Analysis

How to Do a Meta-Analysis: A Step-by-Step Guide for Researchers

A rigorous, doctoral-level guide to conducting a meta-analysis: defining the question, extracting effect sizes and their variances, choosing a between-study variance estimator, pooling, and diagnosing heterogeneity and bias.

Meta-Analysis

Meta-Analysis in Psychology: Definition, Examples, and How It Works

Meta-analysis in psychology pools the effect sizes from many studies into one reliable result. Learn the definition, real examples, and how researchers run one.

Evidence Synthesis

Systematic Review Statistics: 40+ Verified Benchmarks (2026)

Roughly 80 systematic reviews are published daily. The average takes 67.3 weeks, uses 5 authors, and costs about $141,195 in researcher time. Every figure sourced and linked.

Need professional help with your research?

Our PhD methodologists deliver complete systematic reviews and meta-analyses, from protocol to manuscript.

Explore our Systematic Review Service, handled end-to-end by a PhD methodologist.

Quote my systematic review or see Systematic Review Service

Professional Support

Let a PhD Expert Handle Your Research

From protocol to publication-ready manuscript. Our PhD-level methodologists handle systematic reviews, meta-analyses, scoping reviews, and more. Most projects deliver in under 2 weeks.

Our promise: Free rework on search, screening, or synthesis if reviewers push back.

4.9 / 5Quote in minutesPRISMA 2020 + Cochrane HandbookPhD methodologistNDA available on request

Chat on WhatsApp now

Quote my systematic review See Systematic Review Service

Written by

Dr. Sarah Mitchell

PhD, Biostatistics & Research Methodology

Systematic Review MethodologyMeta-AnalysisBiostatistics

Dr. Sarah Mitchell holds a PhD in Biostatistics from Johns Hopkins Bloomberg School of Public Health and has over 15 years of experience in systematic review methodology and meta-analysis. She has authored or co-authored 40+ peer-reviewed publications in journals including the Journal of Clinical Epidemiology, BMC Medical Research Methodology, and Research Synthesis Methods. A former Cochrane Review Group statistician and current editorial board member of Systematic Reviews, Dr. Mitchell has supervised 200+ evidence synthesis projects across clinical medicine, public health, and social sciences.

Learn more about our team

A scale-development paper that holds up is explored carefully and then confirmed on fresh data. If you want the factor analysis done rigorously, our team delivers it. Request a quote or see our statistical consulting support.

Let a PhD Expert Handle Your Research

From protocol to publication-ready manuscript. Our PhD-level methodologists handle systematic reviews, meta-analyses, scoping reviews, and more. Most projects deliver in under 2 weeks.

Quote my systematic review See Systematic Review Service

Quote in minutes. Pay only after you approve your quote. Unlimited revisions until your reviewers are satisfied. NDA available on request.

Exploratory Factor Analysis: Discovering Measurement Structure

Key Takeaways

When to use exploratory factor analysis

The key decisions

Reading the output

Sample size

What comes next

Common mistakes

The common factor model, and why it is not principal components

Deciding the number of factors without the eigenvalue rule

Likert items need polychoric correlations

Factor scores are indeterminate, and the rotation output has two matrices

A worked exploration in R

Bringing it together

Check suitability first

Resolve ambiguous items

Make sure you ran factor analysis, not principal components

Factor polychoric correlations for Likert items

Frequently Asked Questions

Related Articles

Let a PhD Expert Handle Your Research

Dr. Sarah Mitchell

Let a PhD Expert Handle Your Research

Related Articles