the assessment of performance and self-report validity in persons claiming pain-related disability

31
This article was downloaded by: [Dalhousie University] On: 13 November 2014, At: 02:21 Publisher: Routledge Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK The Clinical Neuropsychologist Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/ntcn20 The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability Kevin W. Greve a b , Kevin J. Bianchini a b & Steve T. Brewer a a Department of Psychology , University of New Orleans , New Orleans , LA , USA b Jefferson Neurobehavioral Group , Metairie , LA , USA Published online: 16 Nov 2012. To cite this article: Kevin W. Greve , Kevin J. Bianchini & Steve T. Brewer (2013) The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability, The Clinical Neuropsychologist, 27:1, 108-137, DOI: 10.1080/13854046.2012.739646 To link to this article: http://dx.doi.org/10.1080/13854046.2012.739646 PLEASE SCROLL DOWN FOR ARTICLE Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of the Content. This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms- and-conditions

Upload: steve-t

Post on 18-Mar-2017

213 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

This article was downloaded by: [Dalhousie University]On: 13 November 2014, At: 02:21Publisher: RoutledgeInforma Ltd Registered in England and Wales Registered Number: 1072954 Registeredoffice: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK

The Clinical NeuropsychologistPublication details, including instructions for authors andsubscription information:http://www.tandfonline.com/loi/ntcn20

The Assessment of Performance andSelf-report Validity in Persons ClaimingPain-related DisabilityKevin W. Greve a b , Kevin J. Bianchini a b & Steve T. Brewer aa Department of Psychology , University of New Orleans , NewOrleans , LA , USAb Jefferson Neurobehavioral Group , Metairie , LA , USAPublished online: 16 Nov 2012.

To cite this article: Kevin W. Greve , Kevin J. Bianchini & Steve T. Brewer (2013) The Assessmentof Performance and Self-report Validity in Persons Claiming Pain-related Disability, The ClinicalNeuropsychologist, 27:1, 108-137, DOI: 10.1080/13854046.2012.739646

To link to this article: http://dx.doi.org/10.1080/13854046.2012.739646

PLEASE SCROLL DOWN FOR ARTICLE

Taylor & Francis makes every effort to ensure the accuracy of all the information (the“Content”) contained in the publications on our platform. However, Taylor & Francis,our agents, and our licensors make no representations or warranties whatsoever as tothe accuracy, completeness, or suitability for any purpose of the Content. Any opinionsand views expressed in this publication are the opinions and views of the authors,and are not the views of or endorsed by Taylor & Francis. The accuracy of the Contentshould not be relied upon and should be independently verified with primary sourcesof information. Taylor and Francis shall not be liable for any losses, actions, claims,proceedings, demands, costs, expenses, damages, and other liabilities whatsoever orhowsoever caused arising directly or indirectly in connection with, in relation to or arisingout of the use of the Content.

This article may be used for research, teaching, and private study purposes. Anysubstantial or systematic reproduction, redistribution, reselling, loan, sub-licensing,systematic supply, or distribution in any form to anyone is expressly forbidden. Terms &Conditions of access and use can be found at http://www.tandfonline.com/page/terms-and-conditions

Page 2: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

http://dx.doi.org/10.1080/13854046.2012.739646

The Assessment of Performance and Self-report Validity inPersons Claiming Pain-related Disability

Kevin W. Greve1,2, Kevin J. Bianchini1,2, and Steve T. Brewer1

1Department of Psychology, University of New Orleans, New Orleans, LA, USA2Jefferson Neurobehavioral Group, Metairie, LA, USA

One third of all people will experience spinal pain in their lifetime and half of these will

experience chronic pain. Pain often occurs in the context of a legally compensable event with

back pain being the most common reason for filing a Workers Compensation claim in the

United States. When financial incentives to appear disabled exist, malingered pain-related

disability is a potential problem. Malingering may take the form of exaggerated physical,

emotional, or cognitive symptoms and/or under-performance on measures of cognitive and

physical capacity. Essential to the accurate detection of Malingered Pain-related Disability is

the understanding that malingering is an act of will, the goal of which is to increase the

appearance of disability beyond that which would naturally arise from the injury in

question. This paper will review a number of Symptom Validity Tests (SVTs) that have been

developed to detect malingering in patients claiming pain-related disability and will conclude

with a review of studies showing the diagnostic benefit of combining SVT findings from a

comprehensive malingering assessment. The utilization of a variety of tools sensitive to the

multiple manifestations of malingering increases the odds of detecting invalid claims while

reducing the risk of rejecting a valid claim.

Keywords: Chromic pain; Malingering; Assessment.

INTRODUCTION

About a third of all people will experience spinal pain in their lifetimes andhalf of these will experience chronic pain (Von Korff et al., 2005). The medianprevalence of benign chronic pain is 15% (ranging from 2% to 40%, depending onthe specific study) with the most frequent pain sites being low back, neck, andshoulder (Verhaak, Kerssens, Dekker, Sorbi, & Bensing, 1998). Pain often occurs inthe context of a legally compensable event such as a work-related injury or incidentin which some other party is potentially liable, and in the United States back pain isthe most common reason for filing a Workers Compensation claim (Guo, Tanaka,Halperin, & Cameron, 1999).

Malingering is a potential problem in contexts where there are financialincentives to appear disabled. Malingering is ‘‘the intentional production of false orgrossly exaggerated physical or psychological symptoms, motivated by externalincentives such as avoiding military duty, avoiding work, obtaining financialcompensation, evading criminal prosecution, or obtaining drugs’’ (DSM-IV-TR;American Psychiatric Association, 2000, p. 739). Malingered Pain-Related

The Assessment of Performance and Self-report Validity inPersons Claiming Pain-related Disability

Kevin W. Greve1,2, Kevin J. Bianchini1,2, and Steve T. Brewer1

1Department of Psychology, University of New Orleans, New Orleans, LA, USA2Jefferson Neurobehavioral Group, Metairie, LA, USA

One third of all people will experience spinal pain in their lifetime and half of these will

experience chronic pain. Pain often occurs in the context of a legally compensable event with

back pain being the most common reason for filing a Workers Compensation claim in the

United States. When financial incentives to appear disabled exist, malingered pain-related

disability is a potential problem. Malingering may take the form of exaggerated physical,

emotional, or cognitive symptoms and/or under-performance on measures of cognitive and

physical capacity. Essential to the accurate detection of Malingered Pain-related Disability is

the understanding that malingering is an act of will, the goal of which is to increase the

appearance of disability beyond that which would naturally arise from the injury in

question. This paper will review a number of Symptom Validity Tests (SVTs) that have been

developed to detect malingering in patients claiming pain-related disability and will conclude

with a review of studies showing the diagnostic benefit of combining SVT findings from a

comprehensive malingering assessment. The utilization of a variety of tools sensitive to the

multiple manifestations of malingering increases the odds of detecting invalid claims while

reducing the risk of rejecting a valid claim.

Keywords: Chromic pain; Malingering; Assessment.

INTRODUCTION

About a third of all people will experience spinal pain in their lifetimes andhalf of these will experience chronic pain (Von Korff et al., 2005). The medianprevalence of benign chronic pain is 15% (ranging from 2% to 40%, depending onthe specific study) with the most frequent pain sites being low back, neck, andshoulder (Verhaak, Kerssens, Dekker, Sorbi, & Bensing, 1998). Pain often occurs inthe context of a legally compensable event such as a work-related injury or incidentin which some other party is potentially liable, and in the United States back pain isthe most common reason for filing a Workers Compensation claim (Guo, Tanaka,Halperin, & Cameron, 1999).

Malingering is a potential problem in contexts where there are financialincentives to appear disabled. Malingering is ‘‘the intentional production of false orgrossly exaggerated physical or psychological symptoms, motivated by externalincentives such as avoiding military duty, avoiding work, obtaining financialcompensation, evading criminal prosecution, or obtaining drugs’’ (DSM-IV-TR;American Psychiatric Association, 2000, p. 739). Malingered Pain-Related

Address correspondence to: Kevin W. Greve, Ph.D., Jefferson Neurobehavioral Group, 2901 N.

I-10 Service Road East, Metairie, LA 70002, USA. E-mail: [email protected]

Accepted for publication: October 5, 2012. First published online: November 16, 2012.

� 2013 Taylor & Francis

The Clinical Neuropsychologist, 2013

Vol. 27, No. 1, 108–13 ,7

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 3: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Disability (MPRD) is the ‘‘intentional exaggeration or fabrication of cognitive,emotional, behavioral, or physical dysfunction attributed to pain for the purposesof obtaining financial gain, to avoid work, or to obtain drugs’’ (Bianchini, Greve, &Glynn, 2005, p. 407).

Conceptually, what is being malingered is disability: the altered capacity tocare for oneself or others, or to work. The two primary vectors by which malingereddisability can manifest are (1) diminished or reduced capability and/or(2) exaggerated subjective symptoms. Clinically, pain patients may claim disabilitydue to emotional or cognitive problems in addition to the physical limitations thatare typically attributed to injuries and illness. Malingered disability may alsomanifest in these domains. Thus malingering may take the form of exaggeratedphysical, emotional, or cognitive symptoms and/or under-performance on measuresof cognitive and physical capacity (Bianchini et al., 2005).

Thus disability can potentially be achieved via a variety and mixture ofpresentations. Malingering is an act of will and its specific manifestation depends onthe idiosyncrasies of the individual case, including the claimant’s beliefs regardingthe effects of their injury and efforts to avoid detection. A variety of ‘‘SymptomValidity Tests’’1 (SVT) have been developed to detect different forms of themalingering presentation. However, because of the varying malingering presenta-tions, it is unreasonable to expect any one of these SVTs to have perfect sensitivity(detection of all malingering cases). All SVTs will fail to detect some cases whenproper attention is paid to minimizing false positive (FP) errors (Greve & Bianchini,2004). Fortunately, perfect sensitivity is not essential for an SVT to be a valuabletool. Larrabee, Greiffenstein, Greve, and Bianchini (2007, p. 338, emphasis added)made this point clearly:

It is not necessary to rely on single events that unequivocally demonstrate intent

because the MND [Malingered Neurocognitive Dysfunction; Slick, Sherman, &

Iverson, 1999] and MPRD criteria are based on behaviors and symptom report that are

atypical for and not representative of expected clinical findings in legitimate,

unequivocal neurological, psychiatric or developmental disorders. Thus intent is

inferred as a result of the combined improbability of events rather than relying on a single

definitive indication of intent.

This passage emphasizes two points that are critical to the proper assessmentof malingering. First is the idea of ‘‘improbability.’’ Generally speaking, ‘‘improb-ability’’ is determined empirically by comparing clinical patients who have beendiagnosed as malingering using explicit criteria (usually based on the Slick et al.,1999, or Bianchini et al., 2005, systems) to etiologically similar patients, who oftenhave greater objective pathology than the malingering group. This method isreferred to as ‘‘known-groups’’ (Greve & Bianchini, 2004; Rogers, 1997) or

1 It is becoming a more common practice to refer to those measures assessing the validity of performance

on ability tests as ‘‘performance validity tests’’ (PVT) and reserving the term ‘‘symptom validity test’’ for

those indicators designed to assess exaggeration of subjectively reported symptoms. These latter

indicators may also be referred to as ‘‘self-report validity tests.’’ However, for simplicity of explication

and in keeping with long-standing usage, this paper will refer to any psychometric test, score, or indicator

used to detect invalid performance on measures of cognitive or physical capacity or exaggeration of

subjective symptoms as a ‘‘symptom validity test’’.

109VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 4: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

‘‘criterion-groups’’ (Dawes & Meehl, 1966; Frederick, 2000) validation.2 Criteriongroups validation is considered the preferred term (Heilbronner et al., 2009). Usingthis approach one can identify score levels beyond which few or no non-malingeringpatients perform. The degree of ‘‘improbability’’ is expressed concretely as the FPrate at a given score level.

The second point is the importance of combining SVTs. When a variety ofSVTs sensitive to the potential idiosyncrasies of the malingering presentation areused in combination, the likelihood that a malingerer will go undetected (FalseNegative [FN] error rate) will be significantly reduced. Moreover, as will bediscussed later, when these SVTs have low inter-correlations in non-malingeringcases then the FP rate of the battery as a whole may actually be lower than the FPrate of individual SVTs (Larrabee, 2003a; Meyers & Volbrecht, 2003; Vickery et al.,2004; Victor, Boone, Serpa, Buehler, & Ziegler, 2009). Thus an appropriatelydesigned comprehensive assessment of performance validity and symptom exag-geration will increase the confidence that the clinician or researcher has that anindividual case’s malingering status has been clearly and correctly characterized.

In this paper we will review a variety of SVTs developed to detectmalingering in patients claiming pain-related disability, all of which have beenstudied using criterion groups methodology. These SVTs assess the primarybehavioral domains relevant to MPRD (and probably all compensable injury/illness malingering presentations regardless of etiology). In most cases dataspecific to pain patients will be presented; when available and appropriate, dataderived from other clinical groups (primarily traumatic brain injury [TBI]) will bepresented for contrast. The paper will conclude with a review of the studiesshowing the diagnostic benefit of combining SVT findings from a comprehensivemalingering assessment.

METHODS OF MALINGERING DETECTION

Over the last 15 years or more, numerous SVTs have been developed andempirically validated, mostly in TBI.3 A number of cognitive, emotional, andhealth-related SVTs have now been validated in patients with chronic pain. Inkeeping with the idea that malingering can occur on two vectors, the discussion ofSVTs in pain will be divided into two sections: (1) those assessing diminished orreduced capacity (i.e., performance validity) and (2) those measuring exaggerationof subjective symptoms (self-report validity).

Assessing performance validity

Measures of performance validity are intended to determine if an individualhas under-performed on a tests of perceptual, cognitive, or physical capacity

2 ‘‘Mixed-group’’ validation (Dawes & Meehl, 1966) has certain advantages over ‘‘criterion-groups’’

validation and may result in more precise estimates of classification accuracy (Frederick, 2000; Frederick

& Bowden, 2009).3A review of this research is beyond the scope of this paper and the reader is referred to three recent

publications on this topic for more information (Boone, 2007; Larrabee, 2007; Morgan & Sweet, 2009).

3110 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 5: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

or ability. These may take two general forms (1) stand-alone SVTs; and(2) embedded indicators which are derived from clinical measures of cognitiveability.

Stand-alone SVTs. Stand-alone SVTs are independent tests specificallydesigned to detect efforts to misrepresent the status of a cognitive or perceptualfunction. Among the earliest stand-alone SVTs were those devised by Andre Reysuch as the Dot Counting Test and the 15-Item Test (for discussion, see Lezak,Howieson, & Loring, 2004). The 15-Item Test is one of the most frequently givenSVTs (Slick, Tan, Strauss, & Hultsch, 2004) despite its limited diagnostic value(Resnick, 2005). These and similar SVTs have been subjected to extensive modernresearch to determine and improve the diagnostic accuracy (for recent review, seeBoone & Lu, 2007; Nitch & Glassmire, 2007).

A second variety of stand-alone SVT, the forced-choice SVT, is of more recentvintage but has come to have a central place in the batteries of most psychologistsengaged in the assessment of persons with incentive to appear disabled. Initiallydescribed for use among patients presenting suspicious sensory-perceptual deficits(Binder, 1992a, 1992b; Brady & Lind, 1961; Grosz & Zimmerman, 1965; Pankratz,Fausti, & Peed, 1975),4 Pankratz (1983) is credited with naming this procedure‘‘symptom validity testing’’ and, in collaboration with Binder, with adapting it toquestions of exaggerated memory complaints (Binder & Pankratz, 1987).

The procedure for the forced-choice SVT involves the presentation of ato-be-remembered stimulus (target) followed by a (usually) two-choice recognitiontrial. In the absence of memory a patient should select 50% of the targets by chancealone. Significantly below chance performance on an SVT requires active avoidanceof the correct response (Frederick & Speed, 2007; Reynolds, 1998). Numerousstand-alone forced-choice SVTs using a variety of stimuli and methodologicalvariations have been published (Bianchini, Mathias, & Greve, 2001) although only ahandful are commonly used in clinical practice (Slick et al., 2004).

Among the more popular and well-studied tests are the Test of MemoryMalingering (TOMM; Tombaugh, 1996), Word Memory Test (WMT; Green, 2005;Green, Allen, & Astner, 1996), and the Portland Digit Recognition Test (PDRT;Binder, 1990, 1993). We have been using stand-alone SVTs with pain patients in ourpractice since the early to mid-1990s when these tests became commerciallyavailable, and routinely use the PDRT, TOMM, and WMT in some combination inall our pain assessments. All three have been subjected to intense scientific scrutinyincluding validation in patients with pain.

Individually the PDRT, TOMM, and WMT are equally capable of detectingMPRD at cutoffs associated with comparable FP rates (Greve, Ord, Curtis,Bianchini, & Brennan, 2008). At cutoffs recommended in their respective manuals(Binder, 1990; Tombaugh, 1996), the PDRT and TOMM were very specific(FP rate¼ 2%) in non-malingering patients with pain but they failed to detect about50% of malingerers; in contrast, at the author-recommended cutoffs (Green, 2005;Green et al., 1996), the WMT was very sensitive (80%) but prone to FP errors

4The forced-choice methodology has recently been applied to persons with primary pain complaints who

also reported impaired somatosensory function (Greve, Bianchini, & Ameduri, 2003).

111VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 6: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

(FP rate¼ 30%). These findings emphasize the importance of proper calibrationand of reporting classification accuracy for a range of scores rather than relying ona single cutting scores as well as the reliance on appropriate research methods fordetermining classification accuracy. Moreover, failure to consider FP rates whencomparing sensitivity will result in meaningless and possibly misleading conclusions.

In replications with larger samples, at the 2% FP cutoffs identified by Greveet al. (2008), the individual subtests of the PDRT and TOMM detected 30–45% ofMPRD cases at FP rates of less than 5% (Greve, Etherton, Ord, Bianchini, &Curtis, 2009; Greve, Bianchini, Etherton, Ord, & Curtis, 2009). The WMTdemonstrated sensitivity of about 55% but the FP rates were between 7 and 15%(Greve, Ord, Etherton, Bianchini, & Curtis, 2009). The reader is referred to thesevarious papers for detailed classification accuracy data.

Greve et al. (2008) also examined the accuracy of these three SVTs incombination in pain patients (see Table 1). At cutoffs associated with low FP(e.g., 2%) for individual subtests, the joint FP rate never exceeded 5% for any pairof SVTs or all three together. However, at cutoffs associated with higher individualFP rates (i.e., 10%), the joint FP rates sometimes doubled. Requiring at least twopositive findings at this more aggressive cutoff resulted in FP rates of 10% or less).At the same time, failure on any one SVT at the 2% cutoff resulted in equal or betterclassification accuracy compared to failure of any two SVTs at the 10% level. So itmay be that little is gained by using the more aggressive cutoffs.

When used in combination, failure on one or more of these three SVTs atcutoffs associated with an individual subtest FP rate of 2% results in the detectionof 69% of MPRD cases while maintaining a joint FP rate of 5%. Also included inTable 1 is the Likelihood Ratio (LR; LR¼ sensitivity/FP error rate). The LRindicates the likelihood that a score was produced by a malingerer relative to non-malingerers. An LR value of 1.0 indicates that the score does not differentiatebetween groups whereas a higher LR value indicates a higher probability thatsomeone is malingering versus not (Grimes & Schulz, 2005). At the 2% FP cutoffs,the LR’s are extremely high, indicating that the tests are excellent at discriminatingmalingering from non-malingering cases.

The value of stand-alone SVTs in the detection of MPRD is evident: they areprobably among the most accurate of SVTs. However, stand-alone SVTs do havesome disadvantages. First, even when used jointly they fail to detect 30% or more ofMPRD cases. Being sensitive primarily to feigned memory impairment, theyprovide limited coverage of relevant behavioral domains. So if an individualmalingerer chooses to exaggerate physical or psychological problems rather thanmemory problems, they will go undetected. Moreover, these SVTs may bevulnerable to active efforts to avoid detection (e.g., coaching). Since they arereadily identifiable by their format and because there is considerable informationavailable about them on the internet (Bauer & McCaffrey, 2006) some malingerersmay go undetected. Finally, stand-alone SVTs require additional testing time andthey do not speak directly to the validity of performance on standard clinicalmeasures.

Embedded SVTs. Embedded or internal SVTs are derived from standardclinical tests and procedures that are used as a routine part of the psychological test

112 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 7: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

battery rather than being an independent test. They may consist of a single speciallydeveloped test score (e.g., Reliable Digit Span [RDS], Greiffenstein, Baker, & Gola,1994), multiple scores combined in various ways (e.g., Vocabulary minus Digit Spanfrom the Wechsler Adult Intelligence Scale [WAIS], Mittenberg, Theroux-Fichera,Zielinski, & Heilbronner, 1995; Linear Shrinkage Model for the California VerbalLearning Test [CVLT], Millis & Volinsky, 2001), or simply a standard clinical scorethat has demonstrated validity in differentiating malingering from non-malingeringcases (e.g., Working Memory Index [WMI] from the WAIS-III, Etherton,Bianchini, Ciota, Heinly, & Greve, 2006).

Table 1. Sensitivity, False Positive Error Rate, Positive Predictive Power, and Likelihood

Ratio for each SVT individually and in combination using cutoffs associated with

approximately 2% and 10% false positive error rates in chronic pain patients

�2% FP cutoff Sens FP LR

PDRT (E� 21, H� 18,T� 44) 47 2 24

TOMM (T2� 48, Ret� 44) 50 2 25

WMT (IR, DR� 77.5, CNS� 70.0) 53 2 27

PDRT and/or TOMM 60 2 30

PDRT and/or WMT 64 5 32

TOMM and/or WMT 62 5 31

PDRT and TOMM 36 2 18

PDRT and WMT 36 0 36

TOMM and WMT 41 0 36

Failed any one 69 5 14

Failed any two 48 2 24

Failed all three 33 0 33

�10% FP cutoff Sens FP LR

PDRT (E� 24, H� 19,T� 44) 59 12 5

TOMM (T2, Ret5 48) 60 21 6

WMT (IR� 70.0, DR� 77.5, CNS� 72.5) 64 17 4

PDRT and/or TOMM 73 26 3

PDRT and/or WMT 76 21 4

TOMM and/or WMT 71 31 2

PDRT and TOMM 47 7 7

PDRT and WMT 47 7 7

TOMM and WMT 53 7 8

Failed any one 79 33 2

Failed any two 60 12 5

Failed all three 43 5 9

E, H, T¼Easy, Hard, and Total trials of the Portland Digit Recognition Test (PDRT);

FP¼ approximate false positive error rate for the indicated cutoff; IR, DR,

CNS¼ Immediate Recognition, Delayed Recognition, and Recognition Consistency of

the Word Memory Test (WMT); LR¼ likelihood ratio; Sens¼ sensitivity at the indicated

cutoff; T2, Ret¼Trial 2 and Retention Trial of the Test of Memory Malingering (TOMM).

113VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 8: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Embedded SVTs have drawn considerable research and clinical interestbecause (1) they enhance the sensitivity of the entire malingering battery withoutrequiring extra administration time; (2) they can provide direct information aboutthe validity of performance on specific tests (Mathias, Greve, Bianchini, Houston, &Crouch, 2002; Meyers & Diep, 2000; Meyers & Volbrecht, 1998); (3) they may beless susceptible to coaching than SVTs (Ashendorf, O’Bryant, & McCaffrey, 2003;Mathias et al., 2002); and (4) they can be used to evaluate performance validity inthe absence of specialized techniques.5 The development of malingering detectiondata for standard clinical measures has become a popular enterprise in the last 15years as the content of the Boone (2007) and Larrabee (2007) texts demonstrate.

Our full psychological pain evaluation battery includes two measures ofcognitive ability: the Wechsler Adult Intelligence Scale-III (WAIS-III) and theCalifornia Verbal Learning Test-II (CVLT-II). These measures are included becausepain patients regularly complain of cognitive changes which they attribute to theirinjuries, and because cognitive status, whether altered or not, has implications forfuture vocational prospects if the patient cannot return to their job of injury. Wealso include the Finger Tapping Test (FTT) as a measure of physical capacity.Arguably this test might be considered a stand-alone SVT in this battery but it is aroutine clinical measure in neuropsychological test batteries (Rabin, Barr, &Burton, 2005), so we are including it in this section on embedded indicators.

The Wechsler Adult Intelligence Scale-III (WAIS-III; Wechsler, 1997) is themost commonly used intelligence test in North America (Rabin et al., 2005) and ithas received considerable attention as a source of evidence of malingering.Mittenberg et al. (1995) have developed two embedded indicators from WAISsubtests. The first, referred to as the Mittenberg formula, was derived usingdiscriminant function analysis. The second was theoretically derived and is simplythe difference between the Vocabulary and Digit Span subtest scores. Originallydeveloped in the WAIS-R, these two SVTs have now been tested in the WAIS-III(Greve, Bianchini, Mathias, Houston, & Crouch, 2003; Mittenberg et al., 2001).Recent research (Curtis, Greve, & Bianchini, 2009) in traumatic brain injury hasraised concerns about the ability of these SVTs to accurately detect malingering.They have not been tested in chronic pain. In contrast, the Reliable Digit Span test(Greiffenstein et al., 1994) has proven effective in detecting malingering in a varietyof patient populations including chronic pain. The Working Memory Index andPossessing Speed Index, though not as well studied, have also proven useful.

Reliable Digit Span (RDS) is calculated by summing the longest forward andbackward digit strings (both trials must be completed without error) from the DigitSpan test. RDS and Digit Span (from which it is derived) are generally preserved inpersons with brain dysfunction (Greiffenstein et al., 1994; Heinly, Greve, Bianchini,Love, Brennan, 2005; Iverson & Franzen, 1994, 1996; Iverson & Tulsky, 2003)including those with amnestic disorders (Butters & Cermak, 1980). RDS may be oneof the most-well-validated clinical malingering indicators, with more than a dozen

5Being able to apply embedded indicators retrospectively is very helpful in research projects using

archival data in which stand-alone SVTs may not have been used. See Larrabee, Millis, and Meyers

(2008) for a good example of this application. Use of embedded indicators may also be helpful when

examining data from a forensic evaluation when stand-alone SVTs were not used or were inadequate.

114 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 9: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

validation studies in the published literature (for a review of many of these studies,see Suhr & Barrash, 2007). Overall, existing research demonstrates that RDS is asensitive and specific indicator of intentional under-performance and malingering ina variety of patient populations including traumatic brain injury (TBI),toxic exposure, and chronic pain (for comparisons of RDS in these groups seeGreve et al., 2007).

In pain samples RDS differentiates non-malingering pain patients from thosewho meet the Bianchini et al. (2005) criteria for MPRD. However, in contrast tosome other measures of performance validity, RDS is relatively coarse because thereare so few score levels, the accuracy rate jumps substantially from score to score.For example, the LR goes from 1.90 to 3.29 to 21.4 at cutoffs of 5¼8, 5¼7, and5¼6, respectively. In patients with pain and most other populations of interest(e.g., TBI, toxic exposure), cutoffs greater than 7 are too non-specific and those lessthan 6 are unnecessarily insensitive. We have typically argued against identifyingany specific cutoff for a test. However, in this case recommending a cutoff of RDS5¼6 as evidence of intentional under-performance seems justified and is consistentwith previous findings in patients with pain (Meyers & Volbrecht, 2003).

RDS is unaffected by chronic pain, associated clinical litigation-relatedfactors, and all but the most severe brain trauma. However, in patients with seriousobjective neuropathology (e.g., stroke and dementia) scores of 4 or even 3 may benecessary to raise concerns about performance validity (Heinly et al., 2005). WhileRDS of 7 may raise concern about performance validity, the accumulated researchindicates that RDS5¼6 is most consistently associated with FP rates less than 10%in populations commonly seen in a medico-legal context.

Processing Speed Index (PSI) and Working Memory Index (WMI) derivedfrom the WAIS-III have proven effective in detecting under-performance in patientswith pain. The PSI of the WAIS-III consists of two subtests, Digit Symbol andSymbol Search, which involve processing visual information and responding undertime constraint. The WMI taps both simple and complex attention processes andmeasures an individual’s ability to retain and manipulate information mentallythrough tasks involving attention to increasing amounts of information or attendingto more than one task at a time. The WMI is a composite of performance on threeWAIS-III subtests: Arithmetic, Letter-Number Sequencing, and Digit Span tests.

Etherton and colleagues (Etherton, Bianchini, Ciota, et al., 2006; Etherton,Bianchini, Heinly, & Greve, 2006) studied the effect of pain-related disability onWAIS WMI and PSI scores and found that these two indices were capable ofaccurately detecting MPRD. They also reported FP rate data for moderate-severeTBI and dementia patients. PSI scores of less than 75 detect over 60% of MPRDcases with FP rates of less than 10%. However, PSI is affected by objective severebrain dysfunction associated with TBI and dementia. WMI scores in the same rangedetect slightly fewer MPRD cases but are also less sensitive to genuine brainpathology. These findings have now been replicated in concussion patients and inpatients with stroke (Curtis et al., 2009).

The WAIS-IV (Wechsler, 2008) included embedded indicators as part of itsdevelopment. As yet, the primary research focus has been on RDS (Jasinski, Berry,Shandera, & Clark, 2011; Reese, Suhr, & Riddle, 2012; Young, Sawyer, Roper, &Baughman, 2012). Young et al. (2012) and Reese et al. (2012) both demonstrated

115VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 10: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

that RDS scores of 6 or less from the WAIS were associated with acceptablespecificity replicated the FP results from the WAIS-3 and WAIS-R. The meta-analysis of Jasinski et al. (2011) supports that conclusion. Although the validationof RDS in the WAIS-IV is limited in its direct application to patient with chronicpain, the fact that the WAIS-IV-derived RDS has similar accuracy to RDS from theWAIS-R and WAIS-3 means that RDS validation data from those tests canreasonably be applied to RDS derived from the latest WAIS revision.

The California Verbal Learning Test (CVLT; Delis, Kramer, Kaplan, & Ober,1987) is among the most frequently used methods for evaluating learning andmemory in North America (Rabin et al., 2005). The detection of malingering withthe CVLT has been approached in two ways. The first method uses the raw testscores individually (e.g., Total Correct from Trials 1 through 5 [Total 1–5] andRecognition Hits, see Ashendorf et al., 2003; Millis, Putnam, Adams, & Ricker,1995; Sweet et al., 2000). The second approach involves the development ofempirically derived composites of the original raw scores. Millis and colleagues(Martens, Donders, & Millis, 2001; Millis & Putnam, 1997; Millis et al., 1995; Millis& Volinsky, 2001). Curtis, Greve, Bianchini, and Brennan (2006) recommended theLinear Shrinkage Model (Millis & Volinsky, 2001) and Recognition Hits for usebased on a number of considerations. Recognition Hits is produced by the scoringprogram and easily calculated by hand. However, as a single score it may bevulnerable to coaching. The derivation of the Linear Shrinkage Model is morecomplex, but its complexity may make it more resistant to coaching. Both were ofcomparable accuracy in detecting malingering.

A revision of the CVLT (CVLT-II) became available in 2000 (Delis, Kramer,Kaplan, & Ober, 2000). Greve, Curtis, Bianchini, & Ord (2009) examined theequivalence of the CVLT-1 and CVLT-2 in the detection of malingering in TBI andchronic pain. This is the only study to date involving pain patients. The two CVLTversions were roughly equivalent in TBI. However, pain patients generallyperformed more poorly on the CVLT-II and the differences necessitated the useof different normative data. These findings mean that at the same cutoffs,malingering indicators on the CVLT-2 will be associated with a higher rate of FPerrors than on the CVLT-1 in chronic pain. Table 2 presents classification accuracydata for Recognition Hits and Linear Shrinkage Model in patients with pain.

As can be seen, the difference between the two versions was most pronouncedwhen cutoffs associated with lower FP rates were examined. CVLT-1 cutoffsassociated with FP error rates of approximately 10% (a conservative upper bound)always resulted in CVLT-2 FP error rates of 15% or more, even in TBI. In the TBIpatients cutoffs associated with a 5% FP error rate in the CVLT-1 resulted insimilar FP rates in the CVLT-2. In the pain sample Recognition Hit accuracy wascomparable but the cutoffs for the Linear Shrinkage score needed to be adjustedupward to maintain a comparable FP rate. These findings mean that CVLT-1decision rules can be safely applied to the CVLT-2 in patients with TBI claims. Inpatients with chronic pain only the decision rules for Recognition Hits can bedirectly applied.

Overall, these data suggest that, in pain, the Linear Shrinkage Model is moreaccurate than Recognition Hits. The classification accuracy of these indicators inthe CVLT-2 is comparable to that of the CVLT-1, but the cutoffs are different. The

116 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 11: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

norms are not interchangeable. Moreover, genuine neurocognitive impairment caninfluence both indicators so one must take care to use the most appropriatenormative comparisons for a given case. A number of new validity indicatorsderived from the CVLT-2 have been reported and studied (Wolfe et al., 2010) buthave yet to be applied to chronic pain samples.

A variety of motor tasks including grip strength measurements, peg boardtasks, and finger tapping have studied in the context of malingering. Only a limitedamount of research on the accuracy of motor tasks for the detection of malingeringin pain patients has been done. Greiffenstein (2007) and Arnold and Boone (2007)have reviewed many of these techniques. More research has been done outside ofpsychology/neuropsychology although this research is not as conceptually orstatistically sophisticated (for an illustration of the breadth and problems in thisliterature, see Fishbain, Cutler, Rosomoff, & Rosomoff, 1999; Lechner, Bradbury,& Bradley, 1998). For example, many of these studies are limited in that they usesimulator designs rather than clinical malingering samples, report only groupstatistics (e.g., mean comparisons), and do not describe individual classificationaccuracy. When classification accuracy is reported there has often been a misplacedemphasis on sensitivity, which has led to rejection of techniques that mightotherwise be of value. Validation of physical capacity measures for the detection ofmalingering is currently under way in our laboratory and in Europe (J. Kool,personal communication, January 28, 2009).

Table 2. Cumulative percentages of pain patients with scores equal to or lower than the indicated

score Recognition Hits and Linear Shrinkage Model on the California Verbal Learning Test (original

and 2nd eds.)

Recognition Hits Linear Shrinkage Model

CVLT-1 CVLT-2 CVLT-1 CVLT-2

Not

Mal Mal LR

Not

Mal Mal LR

Not

Mal Mal LR

Not

Mal Mal LR

0 0 0

1 2 2

2 2 2 11

3 0 2 – 2 10 0

4 1 8 8 2 9 2

5 2 9 4.5 5 8 0 2 – 0

6 2 11 5.5 15 7 1 4 4 2

7 2 13 6.5 0 15 – 6 1 6 6 2

8 3 17 5.7 3 24 8 5 1 9 9 0 10 –

9 3 21 7 5 37 7.4 4 2 11 5.5 3 17 5.7

10 3 25 8.3 11 44 4 3 3 13 4.3 3 27 9

11 6 34 5.7 18 56 3.1 2 3 19 6.3 5 42 8.4

12 10 49 4.9 32 63 1.9 1 5 34 6.8 18 54 3

13 23 60 2.6 47 81 1.7 0 10 45 4.5 34 81 2.4

14 40 76 1.9 63 85 1.3 �1 26 62 2.4 58 90 1.6

15 59 85 1.4 90 95 1.1 �2 66 83 1.3 84 100 1.2

16 100 100 1 100 100 1 �3 91 100 1.1 95 – –

CVLT¼California Verbal Learning Test; LR¼Likelihood Ratio; Mal¼malingering.

117VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 12: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Grip strength measured with the JAMAR hand dynamometer has beenstudied extensively and appears to be a good indicator of poor effort (see Greve &Bianchini, 2012). Robinson, Geisser, Hanson, and O’Connor (1993), demonstratedsensitivity of 39% in simulators. Unfortunately the more important false-positiveerror rate was not reported so the value of the reported sensitivity level could not bedetermined. In contrast, Smith, Nelson, Sadoff, and Sadoff (1989) and Chengalur,Smith, Nelson, and Sadoff (1990) established cutoffs based on specificity: ‘‘thecriterion values for the sincere and faking categorization, were determined to be atthe 95% levels of the sincere trials of the injured and non-injured hands’’ (Chengaluret al., 1990, p. 151, italics added). They reported sensitivity ranging from 23% and85% depending on the particular variable or combination of variables and thegender of the participant, even when the false-positive error rate was held to 5%.When classification accuracy is calculated from reported group statistics, sensitivityvaried widely as a function of the variable examined, but it was as high as 60% forsome variables as cutoffs associated with 100% specificity.

Larrabee (2003a) found that a combined bilateral finger tapping raw score ofless than 63 detected 40% of malingering personal injury claimants compared toonly 6.5% of non-malingering patients. Arnold et al. (2005) reported a variety offinger tapping scores separately for men and women in patients with a variety ofconditions. At Larrabee’s cutoff, 17% of non-malingering women and 8% of non-malingering men failed. However, sensitivity was similar, ranging from 30% to50%. Shifting the cutoff for women to 58 resulted in an FP rate of 9% whilemaintaining sensitivity at 50%. Curtis, Brasseux, Greve, and Bianchini (2008) gotcomparable results in TBI claimants. Aguerrevere, Guise, Greve, and Bianchini(2009) studied tapping in patients with pain. Among the men scores of 80 or lesswere associated with FP rates of 10% and sensitivity of 68%. Using Larrabee’scutoff, the FP rate was 3% and sensitivity was 58%. In short, finger tapping datafrom criterion groups research are consistent in their ability to detect malingering.

These findings indicate that finger tapping is an accurate indicator of pooreffort and malingering. Of course, it is important to consider objective physicalpathology that might affect finger tapping such as conditions and pathologyinvolving the motor regions of the brain or the peripheral nerves (e.g., cervicalcompressions, carpal tunnel syndrome) as well as orthopedic or rheumatoidconditions affecting the upper extremities. With the exception of Larrabee’s study,finger tapping diagnostic accuracy data are available for dominant and non-dominant hands separately. Gender is also accounted for in those studies. Insummary, finger tapping is a valuable addition to the SVT battery not only becauseit accurately detects malingering but because it is one of the only measures ofphysical capacity validated for this purpose.

Assessing self-report validity. Measures of self-report validity have beendeveloped to assess exaggeration of emotional/psychological, somatic/health-related, and cognitive symptoms and complaints. These may take the form ofstand-alone scales and validity scales that are components of omnibus personalityor health scales. The focus in this section is on two major personality inventories(i.e., the Millon Multiaxial Clinical Inventory-III and the Minnesota MultiphasicPersonality Inventory) and two brief screens (the Modified Somatic Perception

118 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 13: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Questionnaire and the Pain Disability Inventory) because these have been subjected

to criterion groups validation in patients with pain.The Minnesota Multiphasic Personality Inventory (MMPI and MMPI-2;

Butcher, Dahlstrom, Graham, Tellegen, & Kaemmer, 1989) is one of the most

widely used clinical psychological assessment instruments (Lubin, Larsen, &

Matarazzo, 1984; Rabin et al., 2005). The MMPI includes numerous scales

designed to detect and quantify over- and under-reporting of symptoms. The

effectiveness of MMPI validity scales in detecting exaggeration and/or malingering

of symptoms in a range of conditions has been examined in a number of previous

studies (for reviews, see in particular Lees-Haley, Iverson, Lange, Fox, & Allen,

2003; Rogers, Sewell, Martin, & Vitacco, 2003).The MMPI has been used with pain patients for decades (Deardorff, Chino, &

Scott, 1993; Gatchel, Polatin, & Kinney, 1995; Keller & Butcher, 1991; Robinson,

Swimmer, & Rallof, 1989; Slesinger, Archer, & Duane, 2002). Only recently has its

ability to detect malingered pain-related disability been assessed (Bianchini,

Etherton, Greve, Heinly, & Meyers, 2008). The following paragraphs briefly

review the findings of four sets of MMPI scales (F-family, Symptom Validity Scale,

Meyers Index, Hypochondriasis & Hysteria) relative to the detection of malingering

in general and patients with pain in particular.The MMPI-2F-Family validity scales include F (Infrequency), Fb (Back

Infrequency), and Fp (Infrequency Psychopathology) and detect exaggeration using

a ‘‘rare-symptoms’’ (or infrequency) strategy. The general features of F and Fb are

well-known. Fp, however, is a newer scale (Arbisi & Ben-Porath, 1995) designed to

detect intentional exaggeration of psychiatric symptoms in a way that is not affected

by general psychological distress the way F and Fb are. Thus Fp contains only items

rarely endorsed by known psychiatric patients. In criterion-groups studies

(Bianchini et al., 2008; Greve, Bianchini, Love, Brennan, & Heinly, 2006;

Larrabee, 2003b; McCusker, Moran, Serfass, & Peterson, 2003), Fp is more

sensitive to malingering in settings in which severe psychological disturbances may

be more valuable (psychiatric forensic settings) than in TBI and chronic pain

samples where cognitive and/or physical symptoms may be more valuable. Overall,

for Fp, critical T-scores cutoffs associated with FP rates of 10% are in the range of

65 to 70 while T-scores of 70 to 75 are required to reach recommended conservative

FP levels in patients with TBI and pain. See Table 3 for detailed classification

accuracy data.The MMPI-2 Symptom Validity Scale (FBS; Lees-Haley, English, & Glenn,

1991) was developed based on observations that many personal injury malingerers

present with a mixed fake-good/fake-bad presentation. In August 2006 FBS was

added to the standard Pearson MMPI-2 materials. FBS has been studied extensively

and there are published specificity and sensitivity data on the FBS that have been

derived from methodologically sound studies. Most of the literature on the validity

of FBS has been reviewed in three papers: Ben-Porath, Greve, Bianchini, and

Kaufmann (2009); Greiffenstein, Fox, and Lees-Haley (2007); and Nelson, Sweet,

and Demakis (2006). A full discussion of all aspects of FBS is beyond the scope of

this paper; the reader is referred to those papers for detailed reviews of the FBS

literature.

119VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 14: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Regarding the use of FBS in the detection of malingering of patients with pain,Bianchini et al. (2008) found that FBS scores of 28 or higher were associated withan FP error rate of 5% while detecting almost 70% of definitivelydiagnosed malingerers. Overall, existing research indicates that FBS is generallynot elevated (i.e., 428) in patients without external incentive regardless of theseverity of their injury/illness and related disability. In TBI, FBS scores tend to be

Table 3. Cumulative percentages of patients with scores above the indicated t-score for MMPI-2

F-Family validity scales (F, Fb, Fp)

Clinical pain patients Published no incTBI

No

Inc

Inc

Only

All Not

MPRD

Def

MPRD Sim TBI1 Pain2 Clin1Def

MND1

N 23 34 57 32 26 18 100 133 14

F

490 41 35 0 1 21

485 41 42 1 1 21

480 0 0 50 46 1 3 29

475 0 3 2 50 50 6 5 43

470 13 6 9 66 58 13 7 64

465 13 15 14 72 65 0 18 11 71

460 35 24 28 81 77 11 30 23 79

Fb 132

490 50 35 – 2 50

485 0 0 53 35 – 3 57

480 3 2 59 39 – 4 64

475 3 2 59 46 0 – 5 71

470 0 9 7 69 54 6 – 12 71

465 4 12 12 78 62 6 – 16 71

460 4 27 18 81 65 38 – 23 79

455 13 35 26 84 73 33 – 32 79

Fp 130

485 0 0 19 15 2 5 21

480 4 2 25 23 3 5 21

475 4 2 31 35 3 5 21

470 4 0 2 34 35 6 9 29

465 4 6 5 38 54 0 8 9 29

460 9 12 11 53 58 11 14 18 50

450 30 29 30 72 81 39 38 35 57

The tabled value is the percentage of patients performing ABOVE (more extreme than) the given score.

For non-malingering patients this is the False Positive error rate (complement of specificity); for

malingering-patients this is the Sensitivity rate.

DEF¼Definite MPRD or MND; Inc Only¼ Incentive-Only; MPRD¼Malingered Pain-Related

Disability; No Inc¼No-Incentive; MND¼Malingered Neurocognitive Dysfunction; Clin¼mixed

clinical sample; TBI¼ traumatic brain injury.1From Greve et al. (2006).2From Meyers et al. (2002).

120 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 15: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

higher in patients with less severe injuries. Similarly, pain patients diagnosed asdefinitively malingering had less objective physical pathology than non-malingeringpain patients and scored higher on FBS. In contrast, the probability of having apositive FBS score increases with the strength of independent evidence ofmalingering. See Table 4 for detailed classification accuracy data.

TheMeyers validity scale composite index (Meyers, Millis, & Volkert, 2002) forthe MMPI-2 was designed to assess exaggeration of self-reported psychologicalsymptoms by weighting and integrating seven individual MMPI-2 validity scales.Each scale score is weighted 0, 1, or 2 depending on how strongly the score indicatesexaggeration. The Meyers Index was originally developed comparing chronic painpatients with external incentive to exaggerate symptoms, chronic pain patientswithout incentive, and medically sophisticated but uninjured simulators (e.g.,nurses).

An abbreviated version of this index which uses only the scores produced bythe standard Pearson print-out has also been validated (Aguerrevere, Greve,Bianchini, & Meyers, 2008). This paper reports the included scales and score weightsfor both versions as well as accuracy data from several published studies. Themaximum score is 14 for the original Meyers Index and 10 for the abbreviated

Table 4. Cumulative percentages of patients with scores above the indicated raw score for MMPI-2

Scale FBS (Symptom Validity Scale)

Clinical pain patients No-incentiveTBI

No

Inc

Inc

Only

All Not

MPRD

Def

MPRD Sim TBI1 Pain2 Clin1Def

MND1

N 23 34 57 32 26 18 100 132 14

FBS

�34 22 23 14

�33 0 0 31 31 21

�32 3 2 34 31 36

�31 3 2 44 39 50

�30 3 2 53 50 0 64

�29 3 2 62 62 0 2 0 71

�28 9 5 69 65 11 9 1 71

�27 12 7 72 69 17 11 2 79

�26 0 12 7 81 69 17 16 4 86

�25 4 21 14 91 73 17 16 6 93

�24 4 32 21 94 77 22 22 8 93

�23 13 38 28 94 81 22 28 9 93

�22 17 47 35 94 89 22 34 14 93

�21 26 53 42 97 89 22 43 19 100

The tabled value is the percentage of patients performing ABOVE (more extreme than) the given score.

For non-malingering patients this is the False Positive error rate (complement of specificity); for

malingering-patients this is the Sensitivity rate.

DEF¼Definite MPRD or MND; Inc Only¼ Incentive-Only; MPRD¼Malingered Pain-Related

Disability; No Inc¼No-Incentive; MND¼Malingered Neurocognitive Dysfunction; Clin¼mixed

clinical sample; TBI¼ traumatic brain injury.1From Greve et al. (2006).2From Meyers et al. (2002).

121VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 16: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

version. Raw scores above 5 or 6 for the original and 3 or 4 for the abbreviated

Meyers Index are strong indications that symptoms are being intentionally

exaggerated regardless of the alleged cause of the claimed disability. About half

of malingerers are detected at these levels. See Table 5 for detailed classificationaccuracy data.

Hypochondriasis (Hs), Hysteria (Hy) Scales are indicators of somatization

(Friedman, Glesser, Smeltzer, Wakefield, & Schwartz, 1983;Marks & Seeman, 1963),

Table 5. Cumulative frequencies for the four TBI groups, the three chronic pain group, the two student

simulator group by Meyer Index score

Traumatic brain injury Chronic pain

No

Inc

Clin

No

Inc

Inc

Only

All Not

MND

Prob

MND

Def

MND

All

MND

No

Inc1Inc

Only

All Not

MPRD

Def

MPRD Sim2

N 18 73 91 80 14 94 123 34 157 32 56 129

Original Meyers Index

14 1 7 2 6 5

�13 4 7 4 9 16

�12 11 14 12 16 23

�11 13 21 14 31 35

�10 14 21 15 38 50

�9 0 0 15 43 19 38 59 0

�8 1 1 24 43 27 47 61 2

�7 1 1 29 43 31 0 0 47 63 2

�6 3 2 38 50 39 3 2 47 70 3

�5 7 6 44 71 48 0 3 2 63 74 5

�4 11 9 53 86 55 2 12 8 75 74 6

�3 0 25 20 61 86 65 6 18 13 81 77 9

�2 11 33 29 66 93 69 15 32 24 88 84 15

�1 28 52 47 79 93 81 37 50 45 97 90 29

�0 100 100 100 100 100 100 100 100 100 100 100 100

Abbreviated Meyers Index

10 3 7 3 6 9

�9 8 21 10 9 23

�8 13 21 14 31 39

�7 0 0 14 21 15 38 47 0

�6 1 1 21 36 23 47 55 1

�5 4 3 34 43 35 0 0 47 61 2

�4 4 3 44 57 46 0 6 4 66 72 4

�3 0 14 11 55 86 60 2 6 4 78 75 7

�2 6 27 23 65 86 68 11 23 17 88 82 11

�1 28 51 46 78 93 80 31 49 42 97 87 26

�0 100 100 100 100 100 100 100 100 100 100 100 100

Inc¼ Incentive; Prob¼Probable; Def¼Definite; Sim¼Simulators; Clin¼Clinical;

MND¼Malingering Neurocognitive Dysfunction; MPRD¼Malingering Pain-Related Dysfunction.1combined No-Inc chronic pain groups of Bianchini et al. (2008; n¼ 23) and Meyers et al., (2002;

n¼ 100).2combined simulators of Bianchini et al (2008; n¼ 26) and Meyers et al., 2002; n¼ 30).

122 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 17: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

the tendency to express emotional distress in terms of physical symptoms and/ordevelop physical symptoms in response to psychosocial stressors. Hs and Hy indicatethe presence of greater diversity of physical symptoms than can be explained bygenuine injury and illness alone. Elevations on these scales do not necessarily indicateintentional exaggeration, but non-malingering patients seen in a compensation-seeking setting were three to four times more likely to have T-scores greater than 80on these scales compared to persons not seeking compensation (Bianchini et al.,2008). Persons independently identified as malingering pain-related disability scoredeven higher on these two scales. Malingerers and Simulators scored about 10 pointshigher than these compensation-seeking non-malingering patients and were almostfour times more likely to score 90 or higher. When a certain score level is observed inpatients who are malingering, but rarely or never in patients who are physicallyinjured and/or experiencing the interpersonal circumstances, psychosocial forces, andstresses of litigation, this argues that scores at that level and higher are the result ofintentional exaggeration and not the psychosocial processes associated with injury inthe context of litigation, including somatization (Bianchini et al., 2008). See Table 6for detailed classification accuracy data.

The MMPI-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen,2008) includes updated version of older validity scales (e.g., F, FBS) as well as theintroduction of a number of newer scales (Infrequent Somatic Responses [Fs],Response Bias Scale [RBS]). See Ben-Porath (2012) for a complete review of thisrevised and updated MMPI. Fs may be of particularly value in the chronic painpopulation. Fs may be particularly valuable in the chronic pain population. Thisscale consists of 16 items rarely endorsed by medical patients and was designedspecifically to detect over-reporting of somatic symptoms (Ben-Porath & Tellegen,2008). Using a simulator design, Sellbom, Wygant, and Bagby (2012) examined theability of these scales to differentiate feigned somatic symptoms from genuinemedical illness and somatoform disorder. This study found that the MMPI-2-RFvalidity scales (particularly Fp-r and Fs) readily discriminated college studentsasked to simulate physical health symptoms from patients with genuine medicalillness but were less accurate in differentiate the simulators from persons withsomatoform disorder. In a criterion-groups validation study using neuropsycholo-gical (brain injury) samples (not chronic pain), sensitivity ranged from 48% forFBS-r down to 10% (K-r) when the false positive error rate was held to 10%(Schroeder et al., 2012).

The Millon Multiaxial Clinical Inventory-III (MCMI-III) is the latest versionof the Millon Clinical Multiaxial Personality Inventories (Millon, 1994). TheMCMI-III contains scales intended to assess selected Axis I and II disorders fromthe fourth edition of the Diagnostic and Statistical Manual of Mental Disorders(DSM-IV; American Psychiatric Association, 2000). To determine response bias theMCMI inventories include three modifier indices; Disclosure, which assesses howmuch information the patient is revealing on the test; Desirability, which assessesunder-reporting of psychiatric symptoms; and Debasement, which assesses over-reporting of psychological and somatic problems (Millon, Davis & Millon, 1997).

Research on the detection of malingering has generally focused on traumaticbrain injury and psychological distress; both the MCMI-II and MCMI-III havebeen studied. The MCMI-II modifier indices appear to be reliable indicators of

123VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 18: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

symptom exaggeration (Bagby, Gillis, Toner, & Goldberg, 1991; Grossman &Craig, 1995; Lees-Haley, 1992; Retzlaff, Sheehan, & Fiel, 1991). For example, Lees-Haley (1992) demonstrated that Disclosure and Debasement differentiated pseudo-Post Traumatic Stress Disorder (pseudo-PTSD) patients from those with genuinepsychological impairment. At scores of 60 or higher, both scales had FP rates of 3%with sensitivity of 73% for Disclosure 87% for Debasement.

Using the MCMI-III modifier indices, Daubert and Metzler (2000) found thatcutoffs at FP rates ranging from 11% to 19% in adult psychiatric outpatientsdetected 55% to 74% of non-clinical simulators. Schoenberg, Dorr, and Morgan(2003) examined hospitalized psychiatric patients and student simulators. Theyreported FP rates ranging from 12% to 34% and sensitivity ranged from 35% to59%. These findings reflect the higher modifier index scores of psychiatric inpatients.

Table 6. Cumulative percentages of patients with scores below the indicated t-score for MMPI-2

Hypochondriasis (Hs) and Hysteria (Hy) scales

Clinical pain patients No-incentiveTBI

No

Inc

Inc

Only

All Not

MPRD

DefMPR

D Sim TBI1 Pain2 Clin1Def

MND1

N 23 34 57 32 26 18 100 133 14

Hs

4105 0 0

4100 0 0 6 0 0 7

495 4 0 2 22 15 1 0 7

490 4 9 7 44 39 0 3 1 29

485 4 18 12 59 46 6 8 2 57

480 9 29 21 78 62 6 14 4 79

475 17 47 35 84 77 17 25 11 93

470 35 65 49 94 85 22 36 21 93

Hy

4105 3 2 9 8 0 29

4100 6 4 19 15 2 29

495 6 4 25 26 0 4 0 43

490 12 7 47 46 6 4 2 43

485 0 27 11 59 62 6 13 4 50

480 4 41 26 72 77 6 18 10 71

475 9 53 35 84 81 17 28 17 86

470 22 59 44 91 85 33 42 26 93

The tabled value is the percentage of patients performing ABOVE (more extreme than) the given score.

For non-malingering patients this is the False Positive error rate (complement of specificity); for

malingering-patients this is the Sensitivity rate.

DEF¼Definite MPRD or MND; Inc Only¼ Incentive-Only; MPRD¼Malingered Pain-Related

Disability; No Inc¼No-Incentive; MPRD¼Malingered Pain Related Disability. MND¼Malingered

Neurocognitive Dysfunction; Clin¼mixed clinical sample.1From the MMPI-2s of patients reported in Greve et al. (2006).2From the MMPI-2s of the patients reported in Meyers et al. (2002)

124 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 19: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Aguerrevere, Greve, Bianchini, and Ord (2011) studied the MCMI-IIImodified indices in traumatic brain injury patients divided into malingering andnon-malingering groups. All three scales accurately differentiated malingering fromnon-malingering TBI patients. Debasement was the most accurate index, detecting55% of malingerers at the cutoff associated with 4% FP rate (BR470) and 51% ofmalingerers at the cutoff associated with no FP errors (BR� 75). When indices wereexamined together at cutoffs associated with an FP rate of approximately 5%,sensitivity increased to 68% but the joint FP rate also increased to 12%. These datasupport the use of the MCMI-III in TBI. However, as with the Schoenberg et al.(2003) study, the MCMI-III was unable to differentiate malingering TBI patientsfrom psychiatric inpatients.

In our chronic pain sample (Greve, Aguerrevere, Bianchini, Etherton, &Heinly, 2009) FP rates were higher than at the same cutoffs in TBI patients. ForDebasement the cutoff had to be increased to 85 to achieve an FP rate comparableto that seen at 70 in the TBI patients. At cutoffs associated with FP rates of 10% orless, LRs were in the 3 to 4 range. Given the high rate of psychopathology inpatients with chronic pain, one has to wonder about the pre-morbid and co-morbidpsychiatric influences in even non-malingering chronic pain patients. It appears thatthe MCMI-III modifier indices need to be used with caution in patients with chronicpain, particularly if they have a history of serious psychiatric illness.

Numerous questionnaires assessing pain-related problems and health concernare in use and some have built in validity scales (e.g., Battery for HealthImprovement-2 [BHI]; Bruns & Disorbio, 2003) or have cutoffs related to symptomexaggeration (e.g., Dallas Pain Questionnaire; Lawlis, Cuencas, Selby, & McCoy,1989). However, there is presently no research demonstrating the ability of most ofthese measures to differentiate between the exaggeration of symptoms associatedwith somatization (Gatchel, 2004) and malingering. Thus scores on these measuresmay be capable of determining whether psychosocial factors are influencing/complicating the clinical pain presentation but they cannot define the mechanismmore clearly.

However, Larrabee (2003c) has specifically addressed the exaggeration of paincomplaints by probable malingerers. He reported cut-offs, sensitivity, and predictivepower associated with FP rates of 10%, 5%, and 1% for the McGill PainQuestionnaire (MPQ; Melzak, 1975), the Pain Disability Index (PDI; Tait, Chibnall,& Krause, 1990), and the Modified Somatic Perception Questionnaire (MSPQ;Main, 1983). The patients in the malingering group met the Slick et al. (1999)criteria for Probable MND and none had objective medical findings that couldaccount for their pain complaints. The MPQ was unable to accurately identifymalingering at acceptable FP levels. The PDI performed better, identifying 38% ofmalingerers at the cutoff associated with the 5% FP rate (4¼57). The MSPQ wasextremely accurate, identifying 69% of the MND cases with only a 1% FP rate(4¼14). The relatively small sample size of this study and the fact that injurycharacteristics were not reported limits its usefulness with patients with knownphysical injuries.

Bianchini et al. (2012) examined the diagnostic accuracy of the MSPQ and PDIin 51 not-MPRD pain patients and 105 patients classified as MPRD. Studentsimulators (n¼ 36) were included for comparison. The not-MPRD group scored

125VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 20: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

lower than the allMPRDand Simulator groups, which did not differ from each other.Sensitivity at particular cutoffs was similar to that reported by Larrabee (2003c)but FP rates at those cutoffs were substantially higher. In this study, at the cutoffsassociated with a 10% FP rate, sensitivity was 48% for the MSPQ (4¼16) and38% for the PDI (4¼59). The higher cutoffs in patients seen specifically forpersisting pain reflect the high rate of somatization in the chronic pain populationand the need to differentiate exaggeration due to malingering from that related tosomatization.

Combined improbability

The preceding section demonstrated that it is possible to accuratelydifferentiate malingering from non-malingering chronic pain cases using SVTs.Also evident is the fact that no single SVT is perfectly accurate. In general,sensitivity ranges from about 40% (sometimes lower) to 60% (sometimes higher) atreasonable FP levels (e.g., 5–10%). Thus the reliance on a single SVT will result insome malingering cases going undetected. At the same time, single SVT failures arecommon, particularly when there is severe pathology (Dean, Victor, Boone, &Arnold, 2008; Meyers & Volbrecht, 2003). In the context of external incentive, nosingle failure is sufficient to meet published criteria for malingering.6

It has been the convention in the study of cognitive malingering to require twopositive SVT findings or inconsistencies for a diagnosis of malingering(Greiffenstein et al., 1994; Greiffenstein, Gola, & Baker, 1995; Millis, Ross, &Ricker, 1998; Mittenberg et al., 2001; Slick et al., 1999). Larrabee (2008) andReynolds (1998) have made similar points about the relevance of increasingevidence of inconsistency for the determination that disability is being malingered.Thus SVTs must be used in combination to reliably detect malingering. This is notmerely a conceptual or theoretical position, but one which has been justified onstatistical grounds (Boone & Lu, 2003; Larrabee, 2003a) and which is supported byempirical data compiled over the past decade demonstrated empirically.

Meyers and Volbrecht (2003) studied nine different (mostly embedded)validity indicators in a diverse sample of 796 patients. Only 5% of the 399 non-litigating patients failed more than two validity checks compared to 83% ofgraduate students asked to simulate brain damage. Of the litigating patients 30%failed more than two checks, a finding consistent with estimates of the prevalence ofmalingering in litigating brain injured samples (Larrabee, 2005). Similar results werereported by Vickery et al. (2004) who randomly assigned non-litigating persons witha history of moderate-severe TBI and community volunteers to either a control orsimulator test condition (n¼ 23 per group). Simulators were given a monetarybonus if they successfully avoided detection. Each participant was administeredthree stand-alone SVTs. Failure on any one SVT was associated with an FP rate of6.5% and sensitivity of 89.1% (Likelihood ratio¼ 13.7).

6 The exception is the case of a significantly below chance performance on a forced-choice SVT, a finding

which is considered definitive evidence of intentional under-performance (Frederick & Speed, 2007;

Reynolds, 1998) and which along with the presence of external incentive meets criteria for a diagnosis of

malingering (Bianchini et al., 2005; Slick et al., 1999).

126 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 21: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Larrabee (2003a) studied the individual and combined accuracy of fiveindicators (four embedded indicators, one self-report indicator) in non-malingeringpatients (n¼ 54) and clinically diagnosed malingering cases (n¼ 41) in two studies.The average FP rate across all indicators was 9.3% and sensitivity was 53.6%.However, failure on any two or more indicators results in a decrease in the FP rateto 5.6% and an increase in sensitivity to 87.8%. In a similar study using patientreferrals defined as presenting with ‘‘credible’’ (n¼ 66) or ‘‘noncredible’’ (n¼ 37)cognitive deficits, Victor et al. (2009) studied the accuracy of four embeddedindicators. The average FP rate across all indicators was 14.4% and sensitivity was64.4%. However, as with Larrabee’s (2003a) study, when a failure was defined astwo or more positive SVT results, the FP rate dropped to 5.9% while sensitivityincreased to 87.5%. Chafetz (2011) similarly demonstrated that in Social SecurityDisability evaluations, the FP error rate decreased with increasing numbers of failedSVTs and with three or more failed SVTs the FP error rate was essentially 0.

These studies demonstrate that combining indicators sensitive to malingeringof different aspects of disability results in FP rates that are improved over those ofthe individual indicators and increased sensitivity. These studies support the idea ofusing multiple SVTs in the assessment of malingering and requiring two or morepositive findings as a minimum criterion for to diagnose malingering. However, thepreceding studies have focused on patients with or claiming brain dysfunction. Dothe findings hold in patients with pain? There is limited research in this regard.Meyers and Diep (2000) examined the effectiveness of six indicators in litigating(n¼ 55) and non-litigating (n¼ 53) patients claiming pain-related cognitive deficits.As with the Meyers and Volbrecht (2003) study described above, 29% of thelitigants failed two or more SVTs while none of the non-litigants did.

Approaching the same question from a different angle, Greve et al. (2008)found that pain patients classified as malingering using only embedded indicatorsand self-report measures did not differ from simulators reported in the publishedliterature on RDS, PDRT, TOMM, WMT, and FBS. At the same time, the painpatients with incentive who passed all selected SVTs did not differ from chronicpain patients described in the published literature who had no known externalincentive to appear disabled.

In a similar study of chronic pain patients (n¼ 536), Greve, Aguerrevere, et al.(2009) found essentially no difference between patients identified as malingering onthe basis of multiple SVT failures (Probable MPRD), those who scored significantlybelow chance (Definite MPRD) on a forced choice SVT, and from simulators onclinical tests of memory (CVLT), psychological state (MCMI-III), and motorspeed (finger tapping). Moreover, the pain patients who were classified as notMPRD by virtue of having passed all selected SVTs always performed significantlybetter than those determined to be intentionally exaggerating or faking theirdisability.

Taken together, these findings support Larrabee’s (2008) conclusion that wasbased on statistical arguments ‘‘that multiple SVT failures . . . provide strongevidence for probable malingering when two SVTs are failed, and very strongevidence for probable malingering, if not definite malingering, when three SVTs arefailed’’ (p. 677). The one caveat is that the SVTs must be relatively independent innon-malingering populations; the likelihood of non-independence can be increased

127VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 22: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

by a priori selection of SVTs from different behavioral and cognitive domains.Moreover, with properly selected SVTs that assess these different behavioraldomains, passing all indicators at cutoffs associated with high sensitivity is a strongindication that a patient is not malingering.

CONCLUSIONS

The purpose of this paper was to review the science on SVTs available for usein patients claiming disability associated with chronic pain as well as the empiricaland conceptual basis for particular choices of methods. The forgoing text suggests anumber of general conclusions and recommendations.

(1) Assess malingering and validity comprehensively with well-calibrated tools.Many of these tools and their calibration data have been described in this articleand the primary source material is readily available. There remains aconsiderable amount of work to be done in the calibration of SVTs in chronicpain and physical effort measures which offers particularly fertile ground forresearch.

(2) An appropriately designed assessment with appropriately calibrated tools willincrease the likelihood that real malingering will be detected and decrease thelikelihood of an erroneous determination of malingering. The focus should beon the combined improbability of failed SVTs. Failure on two or three relativelyindependent SVTs is strong evidence that pain-related disability is beingintentionally exaggerated. In the context of multiple failed SVTs, passing otherSVTs does not significantly decrease the probability of malingering.

(3) Just because a person does not meet the minimum criteria for a diagnosis ofmalingering doesn’t mean they are not intentionally exaggerating theirdisability. There may be some positive findings that are so strong (e.g., achance [not below chance] level forced-choice SVT finding, FBS of 37) that it isnecessary to explicitly note that malingering could be influencing a patient’sclinical presentation.

(4) A valid presentation in one behavioral domain does not imply that thepresentation in other domains is also valid. In our experience, it is notuncommon for a pain patient to emphasize their physical disabilities whileminimizing psychological problems. Often cognitive impairment is emphasizedless in pain than in TBI. Malingering is a means to an end (apparent increaseddisability) and the specific approach is individualized.

(5) The more atypical or improbable the findings, the more likely the patient’s painrelated disability is being intentionally exaggerated. In this context, it isimportant to be sensitive to the qualitative inconsistencies (as opposed to thepsychometric inconsistencies which have been the subject of this paper). Thesequalitative inconsistencies can complement psychometric findings but need tobe used with caution.

(6) Conversely, if validity is comprehensively assessed and the patient is negative onALL indicators at cutoffs associated with high sensitivity (e.g.,460%), then onecan reasonably rule out malingering. However, any positive SVT result meansthat malingering is a concern.

128 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 23: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

SUMMARY

Essential to the accurate detection of intentional exaggeration of disability isthe understanding that malingering is an act of will the goal of which is to increasethe appearance of disability beyond that which would naturally arise from the injuryor illness in question. Unlikely genuine clinical pathology whose related symptomsand disability are limited by natural underlying processes, the clinical presentationof a person choosing to malinger has no boundaries. In practice the presentation islimited only by the knowledge and beliefs of the individual and may be broadlyshaped by the nature of the claim. Therefore the clinician cannot make assumptionsabout the presentation (e.g., evaluation for cognitive malingering in TBI andexaggerated physical symptoms in pain) of a given case when one develops anassessment strategy. One simply does not know how the next case will present.Assessing validity with a variety of tools sensitive to the multiple manifestations ofmalingering will increase the odds of detecting invalid disability claims whilesimultaneously reducing the risk of falsely rejecting a valid claim.

ACKNOWLEDGMENT

The material in this article was originally published in German as the following:Greve, K. W., & Bianchini, K. J. (2009). Schmerz und Beschwerdenvalidierrung[Symptom validity testing in pain]. In T. Merten & D. Dettenborn (Eds.), Diagnostikder Beschwerdenvaliditat (pp. 193–229. Berlin: Deutsch Psychologen Verlag GmbH.This updated article is now published for the first time in English with the kindpermission of Deutscher Psychologen Verlag, Berlin.

REFERENCES

Aguerrevere, L., Greve, K. W., Bianchini, K. J., & Meyers, J. E. (2008). Detectingmalingering in traumatic brain injury and chronic pain with an abbreviated version ofthe Meyers Index for the MMPI-2. Archives of Clinical Neuropsychology, 23, 831–838.

Aguerrevere, L. E., Greve, K. W., Bianchini, K. J., & Ord, J. S. (2011). Classification

Accuracy of the Millon Clinical Multiaxial Inventory-III Modifier Indices in theDetection of Malingering in Traumatic Brain Injury. Journal of Clinical andExperimental Neuropsychology, 33, 497–504.

Aguerrevere, L. E., Guise, B. J., Greve, K. W., & Bianchini, K. J. (2009). Detectingmalingered pain-related disability with the Finger Tapping Test. The ClinicalNeuropsychologist, 23, 567.

American Psychiatric Association (2000). Diagnostic and statistical manual (4th ed. textrevised). Washington, DC: American Psychiatric Association.

Arbisi, P. A., & Ben-Porath, Y. S. (1995). An MMPI-2 infrequent response scale for use with

psychopathological populations: The infrequency-psychopathology scale, ‘‘F(p)’’.Psychological Assessment, 7, 424–431.

Arnold, G., & Boone, K. B. (2007). Use of motor and sensory tests as measures of effort.In K. B. Boone (Ed.), Assessment of feigned cognitive impairment: A neuropsychological

perspective (pp. 178–209). New York, NY: Guilford Press.Arnold, G., Boone, K. B., Lu, P., Dean, A., Wen, J., Nitch, S., & McPherson, S. (2005).

Sensitivity and specificity of finger tapping test scores for the detection of suspect effort.

The Clinical Neuropsychologist, 19, 105–120.

. . 129VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 24: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Ashendorf, L., O’Bryant, S. E., & McCaffrey, R. J. (2003). Specificity of malingeringdetection strategies in older adults using the CVLT and WCST. The Clinical

Neuropsychologist, 17, 255–262.Bagby, R. M., Gillis, J. R., Toner, B. B., & Goldberg, J. (1991). Detecting fake-good and

fake-bad responding on the Millon Clinical Multiaxial Inventory–II. Journal of

Counseling and Clinical Psychology, 3, 496–498.Bauer, L., & McCaffrey, R. J. (2006). Coverage of the Test of Memory Malingering, Victoria

Symptom Validity Test, and Word Memory Test on the Internet: Is test security

threatened? Archives of Clinical Neuropsychology, 21, 121–126.Ben-Porath, Y. S. (2012). Interpreting the MMPI-2-RF. Minneapolis, MN: University of

Minnesota Press.

Ben-Porath, Y. S., Greve, K. W., Bianchini, K. J., & Kaufmann, P. M. (2009). The MMPI-2Symptom Validity Scale (FBS) is an empirically-validated measure of over-reporting inpersonal injury litigants and claimants: Reply to Butcher et al. (2008). PsychologicalInjury and the Law, 1, 62–85.

Ben-Porath, Y. S., & Tellegen, A. (2008). MMPI-2 Restructured form: Manual foradministration, scoring, and interpretation. Minneapolis, MN: University of MinnesotaPress.

Bianchini, K. J., Aguerrevere, L. E., Guise, B. J., Ord, J. S., Etherton, J. L., Meyers, J. E.,et al. (2012). Accuracy of the modified somatic perception questionnaire and pain disabilityindex in the detection of malingered pain-related disability in chronic pain. Manuscript

submitted for publication.Bianchini, K. J., Etherton, J. L., Greve, K. W., Heinly, M. T., & Meyers, J. E. (2008).

Classification accuracy of MMPI-2 validity scales in the detection of pain-relatedmalingering: A known-groups approach. Assessment, 15, 435–449.

Bianchini, K. J., Greve, K. W., & Glynn, G. (2005). On the diagnosis of malingered pain-related disability: Lessons from cognitive malingering research. The Spine Journal, 5,404–417.

Bianchini, K. J., Mathias, C. W., & Greve, K. W. (2001). Symptom validity testing: A criticalreview. The Clinical Neuropsychologist, 15, 19–45.

Binder, L. M. (1990). Malingering following minor head trauma. The Clinical

Neuropsychologist, 4, 25–36.Binder, L. M. (1992a). Malingering detected by forced choice testing of memory and tactile

sensation: A case report. Archives of Clinical Neuropsychology, 7, 155–163.

Binder, L. M. (1992b). Forced choice testing provides evidence of intentional exaggerationof tactile sensation deficits. Archives of Physical Medicine and Rehabilitation, 73, 377–380.

Binder, L. M. (1993). Portland digit recognition test manual (2nd ed.). Private publication:Portland, Oregon, USA.

Binder, L. M., & Pankratz, L. (1987). Neuropsychological evidence of a factitious memorycomplaint. Journal of Clinical and Experimental Neuropsychology, 9, 167–171.

Boone, K. B. (2007). Assessment of feigned cognitive impairment: A neuropsychological

perspective. New York, NY: Guilford Press.Boone, K. B., & Lu, P. H. (2003). Noncredible cognitive performance in the context of severe

brain injury. The Clinical Neuropsychologist, 17, 244–254.

Boone, K. B., & Lu, P. H. (2007). Non-forced-choice effort measures. In G. J. Larrabee (Ed.),Assessment of malingered neuropsychological deficits (pp. 27–43). New York, NY: OxfordUniversity Press.

Brady, J. P., & Lind, D. L. (1961). Experimental analysis of hysterical blindness. Archives ofGeneral Psychiatry, 4, 331–339.

Bruns, D., & Disorbio, J. M. (2003). Battery for health improvement 2: Manual. Minneapolis,MN: Pearson.

130 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 25: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Butcher, J. N., Dahlstrom, W. G., Graham, J. R., Tellegren, A., & Kaemmer, B. (1989).MMPI-2: Manual for administration and scoring. Minneapolis, MN: University of

Minnesota.Butters, N., & Cermak, L. S. (1980). Alcoholic Korsakoff’s syndrome. An information-

processing approach to amnesia. New York, NY: Academic Press.

Chafetz, M. (2011). Reducing the probability of false positives in malingering detection ofsocial security disability claimants. Clinical Neuropsychologist, 25, 1239–1252.

Chengalur, S. N., Smith, G. A., Nelson, R. C., & Sadoff, A. M. (1990). Assessing sincerity of

effort in maximal grip strength tests. Archives of Physical Medicine and Rehabilitation,69, 148–153.

Curtis, K. L., Brasseux, R., Greve, K. W., & Bianchini, K. J. (2008, February). Classification

accuracy of finger tapping and grooved pegboard indicators to malingering in traumaticbrain injury. A poster presented at the 36th annual meeting of the InternationalNeuropsychological Society, Waikoloa, Hawaii.

Curtis, K. L., Greve, K. W., & Bianchini, K. J. (2009). The Wechsler Adult Intelligence Test –

III and malingering in traumatic brain injury: Classification accuracy in known groups.Assessment, 16, 401–414.

Curtis, K. L., Greve, K. W., Bianchini, K. J., & Brennan, A. (2006). California Verbal

Learning Test indicators of Malingered Neurocognitive Dysfunction: Sensitivity andspecificity in traumatic brain injury. Assessment, 13, 46–61.

Daubert, S. D., & Metzler, A. E. (2000). The detection of fake-bad and fake-good

responding on the Millon Clinical Multiaxial Inventory III. Psychological Assessment,12, 418–424.

Dawes, R. M., & Meehl, P. E. (1966). Mixed group validation: A method for determining thevalidity of diagnostic signs without using criterion groups. Psychological Bulletin, 66,

63–67.Dean, A. C., Victor, T. L., Boone, K. B., & Arnold, G. (2008). The relationship of IQ to

effort test performance. The Clinical Neuropsychologist, 22, 705–722.

Deardorff, W. W., Chino, A. F., & Scott, D. W. (1993). Characteristics of chronic painpatients: Factor analysis of the MMPI-2. Pain, 54, 153–158.

Delis, D. C., Kramer, J. H., Kaplan, E., & Ober, B. A. (1987). California verbal learning test:

Adult version. Manual. San Antonio, TX: Psychological Corporation.Delis, D. C., Kramer, J. H., Kaplan, E., & Ober, B. A. (2000). California verbal learning test –

Second edition. New York, NY: Psychological Corporation.

Etherton, J. L., Bianchini, K. J., Ciota, M. A., Heinly, M. T., & Greve, K. W. (2006).Pain, malingering, and the WAIS-III working memory index. The Spine Journal, 6,61–71.

Etherton, J. L., Bianchini, K. J., Heinly, M. T., & Greve, K. W. (2006). Pain, malingering,

and performance on the WAIS-III processing speed index. Journal of Clinical andExperimental Neuropsychology, 28, 1218–1237.

Fishbain, D. A., Cutler, R., Rosomoff, H. L., & Rosomoff, R. S. (1999). Chronic pain

disability exaggeration/malingering and submaximal effort research. The Clinical Journalof Pain, 15, 244–274.

Frederick, R. I. (2000). Mixed group validation: A method to address the limitations of

criterion group validation in research on malingering detection. Behavioral Science andthe Law, 18, 693–718.

Frederick, R. I., & Bowden, S. C. (2009). The Test Validation Summary. Assessment, 16,

215–236.Frederick, R. I., & Speed, F. M. (2007). On the interpretation of below-chance responding in

forced-choice tests. Assessment, 14, 3–11.

. . 131VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 26: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Friedman, A. F., Gleser, G. C., Smeltzer, D. J., Wakefield, J. A., & Schwartz, M. S. (1983).MMPI overlap item scales for differentiating psychotics, neurotics, and nonpsychiatric

groups. Journal of Consulting and Clinical Psychology, 51, 629–631.Gatchel, R. J. (2004). Comorbidity of chronic pain and mental health disorders: The

biopsychosocial perspective. The American Psychologist, 59, 795–805.

Gatchel, R. J., Polatin, P. B., & Kinney, R. K. (1995). Predicting outcome of chronic backpain using clinical predictors of psychopathology: A prospective analysis. HealthPsychology, 14, 415–420.

Green, P. (2005). Green’s word memory test for window’s: User’s manual. Edmonton, Canada:Green’s Publishing, Inc.

Green, P., Allen, L. M., & Astner, K. (1996). The word memory test: A user’s guide to the oral

and computer-administered forms, US Version 1.1. Durham, NC: CogniSyst.Greiffenstein, M. F. (2007). Motor, sensory, and perceptual-motor pseudoabnormalities.

In G. J. Larrabee (Ed.), Assessment of malingered neuropsychological deficits(pp. 100–130). New York, NY: Oxford University Press.

Greiffenstein, M. F., Baker, W. J., & Gola, T. (1994). Validation of malingered amnesiameasures with a large clinical sample. Psychological Assessment, 6, 218–224.

Greiffenstein, M. F., Fox, D., & Lees-Haley, P. R. (2007). The MMPI-2 Fake Bad Scale in

detection of noncredible brain injury claims. In K. Boone (Ed.), Assessment of feignedcognitive impairment: A neuropsychological perspective (pp. 210–235). New York, NY:Guilford Publications.

Greiffenstein, M., Gola, T., & Baker, J. (1995). MMPI-2 validity scales versus domainspecific measures in detection of factitious traumatic brain injury. The ClinicalNeuropsychologist, 9, 230–240.

Greve K. W., Aguerrevere, L. E., Bianchini, K. J., Etherton J. L., & Heinly, M. T. (2009,

November). Validation of the Bianchini et al. (2005) criterion for malingered pain-relateddisability. A poster presented at the 29th annual meeting of the National Academy ofNeuropsychology, New Orleans, Louisiana.

Greve, K. W., & Bianchini, K. J. (2004). Setting empirical cut-offs on psychometric indicatorsof negative response bias: A methodological commentary with recommendations.Archives of Clinical Neuropsychology, 19, 533–541.

Greve, K. W., & Bianchini, K. J. (2012). The psychological assessment of pain-relateddisability. In D. Faust (Ed.), Coping with psychiatric and psychological testimony (6th ed.,pp. 587–609). New York, NY: Oxford University Press.

Greve, K. W., Bianchini, K. J., & Ameduri, C. J. (2003). The use of a forced-choice test oftactile discrimination in the evaluation of functional sensory loss. Archives of PhysicalMedicine and Rehabilitation, 84, 1233–1236.

Greve, K. W., Bianchini, K. J., Etherton, J. L., Ord, J. S., & Curtis, K. L. (2009). Detecting

malingered pain-related disability: Classification accuracy of the Portland DigitRecognition Test. The Clinical Neuropsychologist, 23, 850–869.

Greve, K. W., Bianchini, K. J., Love, J. M., Brennan, A., & Heinly, M. T. (2006). Sensitivity

and specificity of MMPI-2 validity scales and indicators to malingered neurocognitivedysfunction in traumatic brain injury. The Clinical Neuropsychologist, 20, 491–512.

Greve, K. W., Bianchini, K. B., Mathias, C. W., Houston, R. J., & Crouch, J. A. (2003).

Detecting malingered performance on the Wechsler adult intelligence scale: Validation ofMittenberg’s approach in traumatic brain injury. Archives of Clinical Neuropsychology,18, 245–260.

Greve, K. W., Curtis, K. L., Bianchini, K. J., & Ord, J. (2009). Are the original and secondedition of the California Verbal Learning Test equally accurate in detecting malingering?Assessment, 16, 237–248.

132 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 27: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Greve, K. W., Etherton, J. L., Ord, J., Bianchini, K. J., & Curtis, K. L. (2009). Detectingmalingered pain-related disability: Classification accuracy of the Test of Memory

Malingering. The Clinical Neuropsychologist, 23, 1250–1271.Greve, K. W., Ord, J., Curtis, K. L., Bianchini, K. J., & Brennan, A. (2008). Detecting

malingering in traumatic brain injury and chronic pain: A comparison of three forced-

choice symptom validity tests. The Clinical Neuropsychologist, 22, 896–918.Greve, K. W., Ord, J. S., Etherton, J. L., Bianchini, K. J., & Curtis, K. L. (November 2009).

Criterion groups validation of the Word Memory Test in chronic pain. Poster presented at

the annual meeting of the American Academy of Clinical Neuropsychology.Greve, K. W., Springer, S., Bianchini, K. J., Black, F. W., Heinly, M. T., Love, J. M., . . .

Ciota, M. A. (2007). Malingering in toxic exposure: Classification accuracy of reliable

digit span and WAIS-III digit span scaled scores. Assessment, 14, 12–21.Grimes, D. A., & Schulz, K. F. (2005). Refining clinical diagnosis with likelihood ratios.

Lancet, 365, 1500–1505.Grossman, L. S., & Craig, R. J. (1995). Comparison of MCMI–II and 16PF validity scales.

Journal of Personality Assessment, 64, 384–389.Grosz, H. J., & Zimmerman, J. (1965). Experimental analysis of hysterical blindness. Archives

of General Psychiatry, 13, 255–260.

Guo, H. R., Tanaka, S., Halperin, W. E., & Cameron, L. L. (1999). Back pain prevalence inUS industry and estimates of lost workdays. American Journal of Public Health, 89,1029–1035.

Heilbronner, R. L., Sweet, J. J., Morgan, J. E., Larrabee, G. J., Millis, S. R., & ConferenceParticipants (2009). American Academy of Clinical Neuropsychology ConsensusConference Statement on the Neuropsychological Assessment of Effort, ResponseBias, and Malingering. The Clinical Neuropsychologist, 23, 1093–1129.

Heinly, M. T., Greve, K. W., Bianchini, K. J., Love, J. L., & Brennan, A. (2005). WAIS digitspan-based indicators of malingered neurocognitive dysfunction: Classification accuracyin traumatic brain injury. Assessment, 12, 429–444.

Iverson, G. L., & Franzen, M. D. (1994). The Recognition Memory Test, Digit Span, andKnox Cube Test as markers of malingered memory impairment. Assessment, 1, 323–334.

Iverson, G. L., & Franzen, M. D. (1996). Using multiple objective memory procedures to

detect simulated malingering. Journal of Clinical and Experimental Neuropsychology, 18,38–51.

Iverson, G. L., & Tulsky, D. S. (2003). Detecting malingering on the WAIS-III unusual digit

span performance patterns in the normal population and in clinical groups. Archives ofClinical Neuropsychology, 18, 1–9.

Jasinski, L. J., Berry, D. T., Shandera, A. L., & Clark, J. A. (2011). Use of the Wechsler adultintelligence scale digit span subtest for malingering detection: A meta-analytic review.

Journal of Clinical and Experimental Neuropsychology, 33, 300–314.Keller, L. S., & Butcher, J. N. (1991). Assessment of chronic pain patients with the MMPI-2.

MMPI-2 Monographs. Minneapolis, MN: University of Minnesota Press.

Larrabee, G. J. (2003a). Detection of malingering using atypical performance patterns onstandard neuropsychological tests. The Clinical Neuropsychologist, 17, 410–425.

Larrabee, G. J. (2003b). Exaggerated pain report in litigants with malingered neurocognitive

dysfunction. The Clinical Neuropsychologist, 17, 395–401.Larrabee, G. J. (2003c). Detection of symptom exaggeration with the MMPI-2 in

litigants with malingered neurocognitive dysfunction. The Clinical Neuropsychologist,

17, 54–68.Larrabee, G. J. (2005). Assessment of malingering. In G. J. Larrabee (Ed.), Forensic

neuropsychology: A scientific approach (pp. 115–158). New York, NY: Oxford UniversityPress.

133VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 28: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Larrabee, G. J. (2007). Assessment of malingered neuropsychological deficits. New York, NY:Oxford University Press.

Larrabee, G. J. (2008). Aggregation across multiple indicators improves the detection ofmalingering: Relationship to likelihood ratios. The Clinical Neuropsychologist, 22,666–679.

Larrabee, G. J., Greiffenstein, M. F., Greve, K. W., & Bianchini, K. J. (2007). Refiningdiagnostic criteria for malingering. In G. J. Larrabee (Ed.), Assessment of malingeredneuropsychological deficits (pp. 334–371). New York, NY: Oxford University Press.

Larrabee, G. J., Millis, S. R., & Meyers, J. E. (2008). Sensitivity to brain dysfunction of theHalstead-Reitan vs an ability-focused neuropsychological battery. The ClinicalNeuropsychologist, 22, 813–825.

Lawlis, G. F., Cuencas, R., Selby, D., & McCoy, C. E. (1989). The development of the Dallaspain questionnaire. An assessment of the impact of spinal pain on behavior. Spine, 14,511–516.

Lechner, D. E., Bradbury, S. F., & Bradley, L. A. (1998). Detecting sincerity of effort:

A summary of methods and approaches. Physical Therapy, 78, 867–888.Lees-Haley, P. R. (1992). Efficacy of MMPI-2 validity scales and MCMI-II modifier scales

for detecting spurious PTSD claims: F, F-K, fake bad scale, ego strength, subtle-obvious

subscales, DIS, and DEB. Journal of Clinical Psychology, 48, 681–689.Lees-Haley, P. R., English, L. T., & Glenn, W. J. (1991). A fake bad scale on the MMPI-2 for

personal injury claimants. Psychological Reports, 68, 203–210.

Lees-Haley, P. R., Iverson, G. L., Lange, R. T., Fox, D. D., & Allen, L. M. (2003).Malingering in forensic neuropsychology: Daubert and the MMPI-2. Journal of ForensicNeuropsychology, 3, 167–203.

Lezak, M. D., Howieson, D. B., & Loring, D. W. (2004). Neuropsychological assessment

(4th ed.). New York: Oxford University Press.Lubin, B., Larsen, R. M., & Matarazzo, J. D. (1984). Patterns of psychological test usage in

the United States. The American Psychologist, 39, 451–454.

Main, C. J. (1983). The Modified Somatic Perception Questionnaire (MSPQ). Journal ofPsychosomatic Research, 27, 503–514.

Marks, P. A., & Seeman, W. (1963). The actuarial description of abnormal personality.

Baltimore: Williams & Wilkins.Martens, M., Donders, J., & Millis, S. R. (2001). Evaluation of invalid response sets after

traumatic head injury. Journal of Forensic Neuropsychology, 2, 1–18.

Mathias, C. W., Greve, K. W., Bianchini, K. B., Houston, R. J., & Crouch, J. A. (2002).Detecting malingered neurocognitive dysfunction using the reliable digit span intraumatic brain injury. Assessment, 9, 301–308.

McCusker, P. J., Moran, M. J., Serfass, L., & Peterson, K. H. (2003). Comparability of the

MMPI-2 F(p) and F scales and the SIRS in clinical use with suspected malingerers.International Journal of Offender Therapy and Comparative Criminology, 47, 585–596.

Melzack, R. (1975). The McGill pain questionnaire: Major properties and scoring methods.

Pain, 1, 277–299.Meyers, J. E., & Diep, A. (2000). Assessment of malingering in chronic pain patients using

neuropsychological tests. Applied Neuropsychology, 7, 133–139.

Meyers, J. E., Millis, S. R., & Volkert, K. (2002). A validity index for the MMPI-2. Archivesof Clinical Neuropsychology, 17, 157–169.

Meyers, J. E., & Volbrecht, M. (1998). Validation of reliable digits for detection of

malingering. Assessment, 5, 303–307.Meyers, J. E., & Volbrecht, M. E. (2003). A validation of multiple malingering

detection methods in a large clinical sample. Archives of Clinical Neuropsychology, 18,261–276.

134 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 29: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Millis, S. R., & Putnam, S. H. (1997). The CVLT in the assessment of financially compensablemild head injury: Further development. Paper presented at the meeting of the

International Neuropsychology Society, Bergen, Norway.Millis, S. R., Putnam, S. H., Adams, K. M., & Ricker, J. H. (1995). The California Verbal

Learning Test in the detection of incomplete effort in neuropsychological evaluation.

Psychological Assessment, 7, 463–471.Millis, S., Ross, S., & Ricker, J. (1998). Detection of incomplete effort on the Wechsler Adult

intelligence scale-revised: A cross-validation. Journal of Clinical and Experimental

Neuropsychology, 20, 167–173.Millis, S. R., &Volinsky, C. T. (2001). Assessment of response bias in mild head injury: Beyond

malingering tests. Journal of Clinical and Experimental Neuropsychology, 23, 809–828.

Millon, T. (1994). Millon Clinical Multiaxial Inventory-III: Manual. Minneapolis, MN:National Computer Systems.

Millon, T., Davis, R., & Millon, C. (1997). Millon Clinical Multiaxial Inventory III: Manual.Minneapolis, MN: National Computer Systems.

Mittenberg, W., Theroux, S., Aguila-Puentes, G., Bianchini, K., Greve, K., & Rayls, K.(2001). Identification of malingered head injury on the Wechsler adult intelligence scale –3rd edition. The Clinical Neuropsychologist, 15, 440–445.

Mittenberg, W., Theroux-Fichera, S., Zielinski, R. E., & Heilbronner, R. L. (1995).Identification of malingered head injury on the Wechsler adult intelligence scale –Revised. Professional Psychology: Research and Practice, 26, 491–498.

Morgan, J., & Sweet, J. (2009). Neuropsychology of malingering casebook. New York, NY:Taylor & Francis.

Nelson, N. W., Sweet, J. J., & Demakis, G. J. (2006). Meta-analysis of the MMPI-2 fake badscale: Utility in forensic practice. The Clinical Neuropsychologist, 20, 39–58.

Nitch, S. R., & Glassmire, D. M. (2007). Non-forced-choice measures to detect noncrediblecognitive performance. In K. B. Boone (Ed.), Assessment of feigned cognitive impairment:A neuropsychological perspective (pp. 78–102). New York, NY: Guilford Press.

Pankratz, L. (1983). A new technique for the assessment and modification of feigned memorydeficit. Perceptual and Motor Skills, 57, 367–372.

Pankratz, L., Fausti, S. A., & Peed, S. (1975). Case study: A forced-choice technique to

evaluate deafness in the hysterical or malingering patient. Journal of Consulting andClinical Psychology, 43, 421–422.

Rabin, L. A., Barr, W. B., & Burton, L. A. (2005). Assessment practices of clinical

neuropsychologists in the United States and Canada: A survey of INS, NAN, and APADivision 40 members. Archives of Clinical Neuropsychology, 20, 33–65.

Reese, C. S., Suhr, J. A., & Riddle, T. L. (2012). Exploration of malingering indices in theWechsler Adult Intelligence Scale-Fourth Edition Digit Span subtest. Archives of Clinical

Neuropsychology, 27, 176–181.Retzlaff, P., Sheehan, E., & Fiel, A. (1991). MCMI–II report style and bias: Profile and

validity scale analyses. Journal of Personality Assessment, 56, 466–477.

Reynolds, C. R. (1998). Common sense, clinicians, and actuarialism. In C. R. Reynolds (Ed.),Detection of malingering during head injury litigation (pp. 261–286). New York, NY:Plenum.

Reznek, L. (2005). The Rey 15-item memory test for malingering: A meta-analysis. BrainInjury, 19, 539–543.

Robinson, M. E., Geisser, M. E., Hanson, C. S., & O’Connor, P. D. (1993). Detecting

submaximal efforts in grip strength testing with the coefficient of variation. Journal ofOccupational Rehabilitation, 3, 45–50.

Robinson, M. E., Swimmer, G. I., & Rallof, D. (1989). The P-A-I-N MMPI classificationsystem: A critical review. Pain, 37, 211–214.

. . 135VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 30: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Rogers, R. (1997). Researching dissimulation. In R. Rogers (Ed.), Clinical assessment ofmalingering and deception (pp. 398–426). New York, NY: Guilford Press.

Rogers, R., Sewell, K. W., Martin, M. A., & Vitacco, M. J. (2003). Detection of feignedmental disorders: A meta-analysis of the MMPI-2 and malingering. Assessment, 10,160–177.

Schoenberg, M. R., Dorr, D., & Morgan, C. D. (2003). The ability of the Millon ClinicalMultiaxial Inventory – third edition to detect malingering. Psychological Assessment, 15,198–204.

Schroeder, R. W., Baade, L. E., Peck, C. P., VonDran, E. J., Brockman, C. J., Webster, B. K.,& Heinrichs, R. J. (2012). Validation of MMPI-2-RF validity scales in criterion groupneuropsychological samples. The Clinical Neuropsychologist, 26, 129–146.

Sellbom, M., Wygant, D., & Bagby, M. (2012). Utility of the MMPI-2-RF in detecting non-credible somatic complaints. Psychiatry Research, 197, 295–301.

Slesinger, D., Archer, R. P., & Duane, W. (2002). MMPI-2 characteristics in a chronic painpopulation. Assessment, 9, 406–414.

Slick, D. J., Sherman, E. M. S., & Iverson, G. L. (1999). Diagnostic criteria for malingeringneurocognitive dysfunction: Proposed standards for clinical practice and research. TheClinical Neuropsychologist, 13, 545–561.

Slick, D. J., Tan, J. E., Strauss, E. H., & Hultsch, D. F. (2004). Detecting malingering:A survey of experts’ practices. Archives of Clinical Neuropsychology, 19, 465–473.

Smith, G. A., Nelson, R. C., Sadoff, S. J., & Sadoff, A. M. (1989). Assessing sincerity of

effort in maximal grip strength tests. Archives of Physical Medicine and Rehabilitation,68, 73–80.

Suhr, J. A., & Barrash, J. (2007). Performance on standard attention, memory, andpsychomotor speed tasks as indicators of malingering. In G. J. Larrabee (Ed.),

Assessment of malingered neuropsychological deficits (pp. 131–170). New York, NY:Oxford University Press.

Sweet, J. J., Wolfe, P., Sattlberger, E., Numan, B., Rosenfeld, J. P., Clingerman, S., &

Nies, K. J. (2000). Further investigation of traumatic brain injury versus insufficienteffort with the California Verbal Learning Test. Archives of Clinical Neuropsychology,15, 105–113.

Tait, R. C., Chibnall, J. T., & Krause, S. (1990). The Pain Disability Index: Psychometricproperties. Pain, 40, 171–182.

Tombaugh, T. (1996). Test of Memory Malingering manual. New York, NY: MultiHealth

Systems.Verhaak, P. F., Kerssens, J. J., Dekker, J., Sorbi, M. J., & Bensing, J. M. (1998). Prevalence

of chronic benign pain disorder among adults: A review of the literature. Pain, 77,231–239.

Vickery, C. E., Berry, D. T. R., Dearth, C. S., Vagnini, V. L., Baser, R. E., Cragar, D. E., &Orey, S. A. (2004). Head injury and the ability to feign neuropsychological deficits.Archives of Clinical Neuropsychology, 19, 37–48.

Victor, T. L., Boone, K. B., Serpa, J. G., Buehler, J., & Ziegler, E. A. (2009). Interpreting themeaning of multiple symptom validity test failure. The Clinical Neuropsychologist, 23,297–313.

Von Korff, M., Crane, P., Lane, M., Miglioretti, D. L., Simon, G., Saunders, K., . . .

Kessler, R. (2005). Chronic spinal pain and physical-mental comorbidity in the UnitedStates: Results from the national comorbidity survey replication. Pain, 113, 331–339.

Wechsler, D. (1997). Wechsler adult intelligence scale – Third edition: Administration andscoring manual. San Antonio, TX: The Psychological Corporation.

Wechsler, D. (2008). Wechsler adult intelligence scale – Fourth edition: Technical andinterpretive manual. San Antonio, TX: Pearson Assessment.

136 KEVIN W. GREVE ET AL.

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014

Page 31: The Assessment of Performance and Self-report Validity in Persons Claiming Pain-related Disability

Wolfe, P. L., Millis, S. R., Hanks, R., Fichtenberg, N., Larrabee, G. J., & Sweet, J. J. (2010).Effort indicators within the California Verbal Learning Test-II (CVLT-II). Clinical

Neuropsychologist, 24, 153–168.Young, J. C., Sawyer, R. J., Roper, B. L., & Baughman, B. C. (2012). Expansion and

re-examination of digit span effort indices on the WAIS-IV. The Clinical

Neuropsychologist, 26, 147–159.

137VALIDITY ASSESSMENT IN CHRONIC PAIN

Dow

nloa

ded

by [

Dal

hous

ie U

nive

rsity

] at

02:

21 1

3 N

ovem

ber

2014