4 Study 1: Validating The Heart Manikin and The Rejection Manipulation

The critical outcome measure for my dissertation is a state measure of belonging—that captures how much participants feel accepted, connected, loved, and cared for at a given moment. My dissertation required a new scale since existing scales focus on measuring belonging in a group context or belonging as an individual difference. For example, the need-threat scale (Williams, 2009), measures how one felt rejected by a group in Cyberball (e.g., “I felt the other players interacted with me a lot,” “I felt I belonged to the group”). The UCLA loneliness scale (Russell, 1996) measures threatened belonging at the individual difference level, not at the state level (e.g., “How often do you feel that you lack companionship?”). In this study, I developed a new scale that is unconstrained in a group context and measures a state belonging.

In Studies 2 and 3, I planned to have all participants complete a social rejection induction essay from previous studies (Sunami et al., 2019; Twenge & Campbell, 2003), without a control or acceptance condition to reduce the number of participants and costs. Since various forms of this manipulation have been successfully used in many labs to induce rejection, I was initially confident that this manipulation would be effective. To further ensure that this particular social rejection induction was effective before running my primary studies, I examined the effectiveness of the rejection manipulations in Studies 1c and 1e.

4.1 The Heart Manikin

To provide a suitable measure for my dissertation, I proposed the Heart Self-Assessment Manikin scale, a new single-item measure of belonging (Figure 4.1). The Heart Self-Assessment Manikin is an adapted version of the Self-Assessment Manikin (Figure 4.2) that measures emotional valence, arousal, and dominance in a given moment (Bradley & Lang, 1994; P. J. Lang, 1980). The original Self-Assessment Manikin showed good convergent validity with existing verbal measures (Bradley & Lang, 1994). In the original Self-Assessment Manikin, participants see pictorial figures and choose a number corresponding to a figure that best describes their current feelings (valence, arousal, dominance). Likewise, the Heart Self-Assessment Manikin asks participants to indicate the number best reflects their current belonging. Similar to the original Self-Assessment Manikin, the Heart Self-Assessment Manikin is easy to administer, quick to complete, and easily understood by participants relative to a traditional text-based questionnaire.

Figure 4.1: The Heart Manikin

Note. Participants indicate how they feel at the moment on a 9-point scale. The body and the face of the figure is taken from the valence subscale of the original Self-Assessment Manikin (P. J. Lang, 1980).

Figure 4.2: The original Self-Assessment Manikin (Lang, 1980)

Note. From top to bottom, the items refer to valence, arousal, and dominance, respectively. Participants indicate how they feel at the moment on a 9-point scale. The scale has been validated as a state measure (P. J. Lang, 1980). The vector drawings of the valence and arousal items are adopted from an existing GitHub repository at https://github.com/hexa-/SAM-vectors (hexa-, 2020)

4.2 Current Study

In Study 1, I evaluated construct validity and test-retest reliability of the Heart Manikin, using five existing datasets (Studies 1a, 1b, 1c, 1d, and 1e). For construct validity, I focused on convergent validity, discriminant validity, and the scale’s sensitivity to a laboratory manipulation already known to affect belonging (Boateng et al., 2018). In addition, I tested the effectiveness of the rejection manipulation to be used in the subsequent dissertation studies (Study 1e).

4.2.1 Convergent Validity

If the Heart Manikin measures belonging, it should correlate with other measures of belonging. Thus, I expected that the Heart Manikin would converge measures of belonging [Studies 1c, 1d, and 1e; Williams (2009)] and social isolation (Study 1b). I also expected that the heart Manikin scores would also converge with measures of depression (Study 1a) since lonely people experience more depressive symptoms than non-lonely people (Cacioppo et al., 2006; Jaremka, Fagundes, Glaser, et al., 2013; Jaremka et al., 2014).

The belonging need is pervasive—people with lower belonging may also experience lower self-esteem, a lower sense of control, and lower sense of meaning (Hartgerink et al., 2015; Leary et al., 1995; Williams, 2009). Thus, I expected that the Heart Manikin scores converge with the measures of self-esteem, control, and a sense of meaning (Studies 1c, 1d, and 1e).

Socially accepted people experience positive emotions; socially rejected people experience negative emotions (Gerber & Wheeler, 2009; Richman & Leary, 2009; Williams, 2009). The valence scores of the Self-Assessment Manikin (Bradley & Lang, 1994) measures how positively or negatively a person is feeling at a moment. Thus, I expected that the Heart Manikin scores would positively correlate with the valence scores of the original Self-Assessment Manikin (Studies 1a, 1b, 1c, 1d, 1e).

People who expect and fear social rejection tend to report lower belonging than those who do not since they are prone to act in ways to elicit social rejection from others, akin to self-fulfilling prophecy (Downey, Freitas, et al., 1998). Thus, I expected that the Heart Manikin scores would negatively correlate with the measures of sensitivity of social rejection, including rejection sensitivity (Study 1e), fear of negative evaluation (Study 1e), and avoidant and anxious attachment styles (Study 1e).

People with a caring and nurturing relationship with their romantic partner should report higher belonging than those who do not. Thus, I expected that the Heart Manikin scores would positively correlate with the degree of support that they receive from their partner, the relationship quality, and closeness to their partner, and perceived responsiveness of their partner (Study 1b). Conversely, the heart manikin scores should negatively correlate with the measures of conflicts, ostracism, psychological and physical abuse in a romantic relationship (Studies 1b and 1c).

4.2.2 Discriminant validity

If the Heart Manikin scale measures state belonging, its scores should be discriminant against measures of other unrelated constructs. To examine discriminant validity, I explored correlations between the Heart Manikin scores with the following measures: arousal and dominance subscales of the Self-Assessment Manikin [Studies 1a, 1c, 1d, and 1e; Bradley & Lang (1994); P. J. Lang (1980)], beliefs about biological differences between Black and White people [Study 1a; Hoffman et al. (2016)], interpersonal reactivity [Study 1a; Davis (1980)], self-monitoring tendencies [Study 1a; Snyder (1974)], tendency to be excited by paradoxes [Study 1a; Miron-Spektor et al. (2018)], capacity to acknowledge and integrate competing opinions of others [Study 1a; Zhang et al. (2015)], membership to different social groups [Study 1a; Haslam et al. (2008 Oct-Dec)], subjective social status [Studies 1b, 1c, and 1e; Adler et al. (2000)], perpetration of abusive and controlling behaviors in a romantic relationship [Study 1b; Graham-Kevan & Archer (2003) and Postmus et al. (2015)], food cravings (Study 1b), dietary social support (Study 1b), body image (Study 1b), levels of physical activity [Study 1b; Godin (2011)], sleep quality [Study 1b; Cella et al. (2019)], narcissism [Study 1b; Konrath et al. (2014)], perceived psychological stress [Study 1b; S. Cohen et al. (1983)], the need for closure [Study 1d; Kruglanski (1990)], and adherence to traditional social values [Study 1d; Proulx & Heine (2008); Rosenblatt et al. (1989)].

4.2.4 Test-Retest Reliability

For studies that measured the Heart Manikin repeatedly (Studies 1b, 1c, 1d, and 1e), I evaluated the test-retest reliability of the scale by calculating intraclass correlations (Koo & Li, 2016; Rabe-Hesketh & Skrondal, 2012). I did not make any a priori prediction about the test-retest reliability of the Heart Manikin for two reasons. First, I could not make an a priori prediction about test-retest reliability since the measure was designed to be a state scale and thus, by definition, should fluctuate over time. Second, the primary purpose of validating the Heart Manikin was to use it as an outcome measure after an experimental manipulation for Studies 2 and 3. Since I was not relying on the temporal stability of the measure for these studies (e.g., comparing pre vs. post scores), the utility of the scale for my dissertation did not depend on the test-retest reliability of the scale. I calculated the test-retest reliability of the Heart Manikin to explore the psychometric property of the scale.

4.2.5 Validating the Rejection Manipulation

In Studies 2 and 3, I planned to induce feelings of social rejection using the rejection prompt in the social rejection paradigm used in the study. In this paradigm, participants would be asked to write about their time being rejected in the past (Sunami et al., 2019; Twenge & Campbell, 2003). I planned to use only the social rejection prompt, without an acceptance or neutral condition, to reduce the number of participants and thus the costs of the studies. The downside of this approach is that I was not able to test the effectiveness of the manipulation in Studies 2 and 3 since I only used the rejection condition without a non-rejection condition. Thus, it was crucial to ensure that the social rejection manipulation used in Studies 2 and 3 was effective before conducting the studies. Again, many other laboratories used various forms of this manipulation to induce rejection (Bernstein et al., 2010; Derrick et al., 2009; Troisi et al., 2015), adding the confidence to the effectiveness of the manipulation. To further ensure the effectiveness in our laboratory, I validated the rejection via a pilot study, consistent with an existing recommendation (Hauser et al., 2018). Study 1e included the essay rejection manipulation with the same rejection induction that I planned to use in Studies 2 and 3 and a control condition. Thus, I treated Study 1e as a pilot study and examine if the rejection manipulation affected belonging.

4.2.6 General Analytic Strategy

To examine convergent validity, I tested an association between the aforementioned measures used in the study and the Heart Manikin. I used an alpha of .05 as a cutoff point for statistical significance. To examine discriminant validity, I used an equivalence test (Lakens, 2017) since a non-significant relationship is not an absence of a relationship in a null-hypothesis testing. To do so, I set the smallest effect size of interest (SESOI) that is the minimal effect size that I consider theoretically meaningful. Any effect size that was lower than this effect size was considered theoretically negligible, and thus equivalent to zero. To determine the SESOI, I first used the average effect size (r = .21) derived from 474 meta-analytic effect sizes (with more than 25,000 studies) in social psychology (Richard et al., 2003). I first transformed this estimate (r = .21) to Fisher’s z (Fisher’s z = .21) for normality (Borenstein, 2019). To safeguard against the inflation of effect size, I consider the lower bound of the 60% confidence interval as the target effect size (Perugini et al., 2014). To calculate the confidence interval, I first calculated the standard error for the Fisher’s z using the sample size of 474, treating each meta-analytic effect size independently (Borenstein, 2019):

\[SE_{z} = \sqrt{\frac{1}{474-3}} = 0.046\]

Then, I calculated the confidence interval using the normal distribution. The lower bound of the 60% confidence interval was Fisher’s z = 0.17 (Fisher z = 0.21, 60%CI[0.17, 0.25]), which was equivalent to r = 0.17 and Cohen’s d = 0.35. Thus, I set the SESOI as r = .17. I compared any non-significant observed coefficient with the SESOI to see if the observed effect size was theoretically negligible. To examine the test-retest reliability, I calculated ICCs and interpreted them as poor (<.50), moderate (.50–.75), good (.75–.90), and excellent (>.90) based on existing guidelines (Koo & Li, 2016).

Studies 1b, 1c, 1d, and 1e include data where participants completed the Heart Manikin and other measures across multiple time points. To account for the dependency in data, I used a linear mixed model. I describe fixed predictors under each study section. I first included both random intercept and the random effect of Time. If the model did not converge, I removed the random effect of Time from the model. If the model converged, I retained the random Time effect. To determine the structure of the residual variance-covariance matrix (R matrix) and the random-effects variance-covariance structure (G matrix), I tested models with different structures and choose the one that fits the data best. For the R matrix, I tested diagonal, compound symmetry, and unstructured structures. For the G matrix, I tested identity, variance components, and unstructured structures.

4.3 Study 1a (RPR Data)

I used a cross-sectional dataset from an online mass testing session conducted for the psychology participant pool. See Table 4.1 for the measures included in this study.

Table 4.1: Summary of Measures for Study 1a
Measure	Time	Construct	Validity	Citation
Self-Assessment Manikin - Valence	Time 1	State valence	Con.	Bradley & Lang, 1994
Center for Epidemiologic Studies Depression Scale (CES-D)	Time 1	Depressive symptoms	Con. (R)	Radloff, 1977
PROMIS Social Isolation	Time 1	Social isolation	Con.	Cella et al., 2019; Hahn et al., 2014
Beliefs about Biological Differences between Blacks and Whites Scale	Time 1	False beliefs about biological differences between Black and White people	Dis.	Hoffman et al., 2016
Interpersonal Reactivity Scale
Perspective Taking	Time 1	Tendency to perspective-take	Dis.	Davis, 1980
Fantasy	Time 1	Tendency to transport into the feelings in characters in media	Dis.	Davis, 1980
Empathic Concern	Time 1	Tendency to feel sympathy and concerns for others’ misfortune	Dis.	Davis, 1980
Personal Distress	Time 1	Tendency to feel anxiety and unease in tense situations	Dis.	Davis, 1980
Self-Monitoring Scale	Time 1	Tendency to self-observe and control one’s behavior according to social appropriateness	Dis.	Snyder, 1974
Paradox Mindset Scale	Time 1	Tendency to accept and get excited by tensions	Dis.	Miron-Spektor et al., 2018
Integrative Complexity Scale	Time 1	Capacity to acknowledge the competing opinions	Dis.	Zhang et al., 2015
Multiple Identity Scale	Time 1	Membership to different social groups	Dis.	Haslam et al., 2008
Note. Con. = Convergent Validity. Dis. = Discriminant Validity. (R) = Reverse association.

4.3.1 Participants

All undergraduate students enrolled in an introductory psychology course were invited to complete a mass testing session for the psychology participant pool at the University of Delaware in 2018 Fall. Among those who accessed the survey website (1160 participants), 571 participants were randomly assigned to a questionnaire block that contained the Heart Manikin and thus included in this study.

4.3.2 Procedure and Materials

Participants answered an online questionnaire that included all the measures. Since the goal of Study 1 is to validate the Heart Manikin adapted from the Self-Assessment Manikin, I describe the Heart Manikin and the Self-Assessment Manikin in more detail below. For other measures, see Table 4.1 for the summary and Appendix A for the detailed descriptions.

Heart Manikin. I developed the Heart Manikin to measure a state belonging: how much a person feels cared for, accepted, loved, and connected at a given moment (Figure 1). The measure consisted of 5 mankins adopted from the valence item of the Self-Assessment Manikin (P. J. Lang, 1980). Each figure has a drawing of a heart. The size of the heart and the face of the manikin corresponds with belonging. The bigger the heart the manikin has, the more belonging. The scale had a horizontal bar below the manikin figures that presented 9 ticks, with ticks below and between the 5 figures. Participants were asked to indicate how they feel at the moment in this 9-point scale (“Please select the number that best corresponds to how you currently feel.”). I report the reliability and validity of the scale in the result sections.

Self-Assessment Manikin. The Self-Assessment Manikin is a 3-item measure of valence, arousal, and dominance (Bradley & Lang, 1994; P. J. Lang, 1980). Each scale had 5 manikin figures representing different levels of valence, arousal, and dominance. Participants responded how they currently feel on a 9-point scale: 1 = “unhappy, annoyed, unsatisfied, and bored” to 9 = “happy, pleased, satisfied, and hopeful” for valence, 1 = “relaxed, calm, sleepy, sluggish” to 9 = “excited, frenzied, wide-awake, and aroused” for arousal, and 1= “submissive, influenced, controlled by others” to 9 = “in control, important, dominant, autonomous” for dominance. The Self-Assessment Manikin has a good convergent validity with the existing verbal measures of valence, arousal, and dominance (Bradley & Lang, 1994). Study 1a only included the valence item.

4.3.3 Results

To test convergent and discriminant validities, I examined bivariate correlations between the Heart Manikin scores and the scores of the measures in Table 4.1. Results are presented in Figure 4.3 (see [Appendix] for the bivariate correlation table). Consistent with the prediction, the Heart Manikin scores correlated with the hypothesized measures for convergent validity: the Valence Manikin (r(569) = 0.71, p < .001, 95%CI [0.66, 0.75]), social isolation (r(564) = -0.60, p < .001, 95%CI [-0.65, -0.54]), CESD (r(569) = -0.58, p < .001, 95%CI [-0.63, -0.53]).

For the discriminant validity, I found mixed results. As predicted, the Heart Manikin scores did not correlate with the measures of overall interpersonal reactivity (r(567) = 0.01, p = .856, 95%CI [-0.07, 0.09]), perspective taking (r(568) = -0.00, p = .929, 95%CI [-0.09, 0.08]), fantasy (r(568) = -0.03, p = .494, 95%CI [-0.11, 0.05]), paradoxical mindset (r(567) = 0.03, p = .451, 95%CI [-0.05, 0.11]), or integrative complexity (r(566) = 0.02, p = .596, 95%CI [-0.06, 0.10]). However, the Heart Manikin scores correlated with the measures of empathy (r(567) = 0.16, p < .001, 95%CI [0.08, 0.24]), distress (r(567) = -0.12, p = .005, 95%CI [-0.20, -0.04]), multiple identity (r(566) = 0.19, p < .001, 95%CI [0.11, 0.27]), social monitoring (r(568) = -0.09, p = .040, 95%CI [-0.17, 0.00]), and beliefs in biological differences between Black and White people (r(568) = -0.11, p = .008, 95%CI [-0.19, -0.03]), contrary to the prediction.

For all correlation coefficients with a p-value larger than p = .05, I performed an equivalence test to examine if they were theoretically equivalent to zero. The 90% confidence intervals of the correlation coefficients for the interpersonal reactivity, paradoxical mindset, and integrative complexity all fell within the smallest effect size of interest (|r| = 0.17). Thus, I consider these coefficients as theoretically equivalent to zero.

Overall, these results suggest strong support for the convergent validity and moderate support for the discriminant validity of the Heart Manikin.

Figure 4.3: Study 1a - Forestplot Showing Correlation Coefficients with Heart Manikin Scores.

4.4 Study 1b (RAIv1)

This study was designed to test the relationship between interpersonal distress and immune function (Jaremka, unpublished). Table 4.2 shows a summary of the measures included in the study.

Table 4.2: Summary of Measures for Study 1b
Measure	Time	Construct	Validity	Citation
Self-Assessment Manikin - Valence	Time 1, 2, 3	State valence	Con.	Bradley & Lang, 1994
MacArthur Scale of Subjective Social Status	Time 1, 2, 3	Subjective social status	Dis.	Adler et al., 2000
PROMIS—Short Form 8a
Social Isolation	Time 1, 2, 3	Social isolation	Con. (R)	Cella et al., 2019; Hahn et al., 2014
Emotional Support	Time 1, 2, 3	Emotional support	Con.	Cella et al., 2019; Hahn et al., 2014
Informational Support	Time 1, 2, 3	Informational support	Con.	Cella et al., 2019; Hahn et al., 2014
Couples Satisfaction Index	Time 1, 2, 3	Romantic relationship quality	Con.	Funk & Rogge, 2007
Inclusion of Other in Self Scale	Time 1, 2, 3	Closeness between the self and the other person	Con.	Aron et al., 1992
Partner Responsiveness Scale	Time 1, 2, 3	Romantic partner responsiveness	Con.	Gable et al., 2012
Relationship Conflict Scale	Time 1, 2, 3	Conflicts in a romantic relationship	Con. (R)	Ad-hoc
Ostracism from Romantic Partner Scale	Time 1, 2, 3	Ostracism from a romantic partner	Con. (R)	Ad-hoc
Abusive Behavior Inventory—Revised
Psychological Abuse	Time 1, 2, 3	Perpetration of psychological abuse against a romantic partner	Dis.	Postmus et al., 2015
Physical Abuse	Time 1, 2, 3	Perpetration of physical abuse against a romantic partner	Dis.	Postmus et al., 2015
Controlling Behavior Scale—Modified
Economic Control	Time 1, 2, 3	Perpetration of economic control	Dis.	Graham-Kevan & Archer, 2003
Threats	Time 1, 2, 3	Perpetration of threats	Dis.	Graham-Kevan & Archer, 2003
Intimidation	Time 1, 2, 3	Perpetration of intimidation	Dis.	Graham-Kevan & Archer, 2003
Emotional Control	Time 1, 2, 3	Perpetration of emotonal control	Dis.	Graham-Kevan & Archer, 2003
Isolation	Time 1, 2, 3	Perpetration of isolation	Dis.	Graham-Kevan & Archer, 2003
Modified Food Craving Questionnaire—Trait Version	Time 1, 2, 3	Food craving	Dis.	Cepeda-Benito et al., 2000
Dietary Social Support Scale	Time 1, 2, 3	Support from one’s romantic partner about eating	Dis.	Ad-hoc
Body Image Questionnaire	Time 1, 2, 3	Body image	Dis.	Ad-hoc
Godin Leisure-Time Exercise Questionnaire	Time 1, 2, 3	Physical activity	Dis.	Godin, 2011; Godin & Shephard, 1985
PROMIS Sleep Disturbance—Short Form 4a	Time 1, 2, 3	Sleep disturbance	Dis.	Cella et al., 2019
Single-Item Narcissism Scale	Time 1, 2, 3	Narcissism	Dis.	Konrath et al., 2014
Perceived Stress Scale	Time 1, 2, 3	Perceived stress	Dis.	S. Cohen et al., 1983
Note. Con. = Convergent Validity. Dis. = Discriminant Validity. (R) = Reverse association.

4.4.1 Participants

One-hundred and seven participants participated in the study. Participants were eligible to participate if they were in a romantic relationship at the beginning of the study. Participants were recruited from the psychology participant pool at the University of Delaware. They received partial course credits as compensation.

The dataset contained data from 121 participants. Participants were eligible to participate if they were in a romantic relationship at the beginning of the study. Participants were recruited from the psychology participant pool at the University of Delaware. They received partial course credits as compensation. During the data inspection, I found that two participants had duplicate data points in the study One participant had 2 data points for the Visit 3 on the same date but no data for Visit 2. For this participant, I disregarded their data from later participation on that day. Another participant had participated twice for Visit 1 on different dates. I disregarded their data for the later participation for Visit 1.

4.4.2 Procedure and Materials

The study had three visits (Visits 1–3) with average intervals of 27.24 days between Visits 1 and 2, and 27.23 days between Visits 2 and 3, respectively. In each visit, participants came to a group testing room and answered all questionnaires. The Heart Manikin was identical to the ones used in Study 1a. See Table 4.2 for the summary of the measures and [Appendix] for detailed descriptions.

The following questionnaires included questions about their current romantic partner: the Couples Satisfaction Index (Funk & Rogge, 2007), the Inclusion of the Other in the Self Scale to one’s current romantic partner (Aron et al., 1992), the Partner Responsiveness Scale (Gable & Impett, 2012), the Relationship Conflict Scale, the Ostracism from Romantic Partner Scale, the Abusive Behavior Inventory-Revised (Postmus et al., 2015), the Controlling Behavior Scale (Graham-Kevan & Archer, 2003), and the Dietary Social Support Scale. In Visits 2 and 3, participants who are no longer in a relationship with a partner previously reported answered about both their relationship with a new romantic partner and their ex-partner.

4.4.3 Results

For testing convergent validity, I constructed a mixed model that predicted the Heart Manikin across time for each measure in Table 4.2. I included the fixed effects of Time (categorical; 1–3) and the scores of a given measure (centered). I first included the random intercept and the random effect of Time. However, the model failed to converge with the Time random effect, and thus I dropped the Time random effect. Figure 4.4 shows regression coefficients for each measure predicting the Heart Manikin scores after controlling for the fixed effect of Time.

Convergent Validity. Consistent with the predictions, all convergent measures showed evidence for convergent validity: valence manikin (B = 0.57, SE = 0.04, t = 12.78, p < .001), partner responsiveness (B = 0.39, SE = 0.05, t = 7.35, p < .001), inclusion of the other in the self (B = 0.29, SE = 0.06, t = 5.11, p < .001), informational support (B = 0.34, SE = 0.05, t = 6.20, p < .001), emotional support (B = 0.43, SE = 0.05, t = 8.77, p < .001), couples satisfaction (B = 0.44, SE = 0.05, t = 8.68, p < .001), social isolation (B = -0.44, SE = 0.05, t = -7.95, p < .001), partner ostracism (B = -0.38, SE = 0.05, t = -7.14, p < .001), relationship conflict (B = -0.25, SE = 0.05, t = -4.78, p < .001), and depression (B = -0.52, SE = 0.05, t = -10.07, p < .001). These results suggest a strong support for the convergent validity for the Heart Manikin.

Discriminant Validity. Out of the 13 measures for the discriminant validity, 9 measures did not correlate with the Heart Manikin scores, supporting the discriminant validity: sleep quality (B = 0.03, SE = 0.06, t = 0.56, p = .578), socioeconomic status (B = 0.13, SE = 0.06, t = 1.97, p = .050), psychological abuse perpetration (B = -0.17, SE = 0.06, t = -2.70, p = .007), physical abuse perpetration (B = -0.02, SE = 0.06, t = -0.34, p = .735), isolation control (B = -0.11, SE = 0.06, t = -1.86, p = .064), intimidation control (B = -0.04, SE = 0.05, t = -0.73, p = .464), emotional control (B = -0.09, SE = 0.06, t = -1.59, p = .113), economic control (B = -0.09, SE = 0.06, t = -1.61, p = .108), craving (B = -0.07, SE = 0.07, t = -1.02, p = .310), and body image (B = -0.02, SE = 0.07, t = -0.27, p = .786). Contrary to the prediction, 3 measures correlated with the Heart Manikin: threats control (B = -0.09, SE = 0.06, t = -1.59, p = .113), stress (B = -0.42, SE = 0.06, t = -7.62, p < .001), and narcissism (B = -0.14, SE = 0.06, t = -2.27, p = .024).

For all coefficients with a p-value greater than .05, I ran an equivalence test to test if they were theoretically equivalent to zero (Figure 4.4). Results showed that the correlation coefficients with the measures of sleep quality, physical abuse, intimidation, and body image were theoretically equivalent to zero (using |r| = 0.17). The correlation coefficients with the measure of economic control, emotional control, and craving were not equivalent to zero or different from zero, and thus I interpret the results for these measures as ambiguous. Overall, the current results suggest a moderate discriminant validity of the Heart Manikin scores, especially against measures of sleep quality, physical abuse, intimidation, and body image.

Figure 4.4: Study 1b - Forestplot Showing Correlation Coefficients with Heart Manikin Scores

Test-Retest Reliability. To explore the test-retest reliability, I constructed an unconditional mixed model predicting the Heart Manikin scores over Time, and interpreted its intraclass correlation (ICC) as a measure of reliability. The obtained ICC was 0.33 (average interval between visits = 36.3 days). The ICC indicates a poor reliability according to the guideline (Koo & Li, 2016). These results suggest that participants reported different levels of Heart Manikin scores across the visits. Note that the low reliability does not imply that the scale performed well or poorly, since the Heart Manikin scale was meant to measure fluctuations over time.

4.5 Study 1c (ARv1)

Study 1c was designed to test whether social rejection by a close other threatens belonging more than social rejection by a stranger (Nadzan et al., 2019). Table 4.3 summarizes the measures used in the study.

Table 4.3: Summary of Measures for Study 1c
Measure	Time	Construct	Validity	Citation
Self-Assessment Manikin
Valence	Times 1, 2, & 3	State valence	Con.	Bradley & Lang, 1994
Arousal	Time 2	State arousal	Dis.	Bradley & Lang, 1994
Dominance	Time 2	State dominance	Dis.	Bradley & Lang, 1994
Modified Need-Threat Scale—Essay Version
Belonging	Time 2	Belonging	Con.	Williams, 2009
Self-Esteem	Time 2	Self-esteem	Con.	Williams, 2009
Control	Time 2	Control	Con.	Williams, 2009
Meaningful Existence	Time 2	Meaning existence	Con.	Williams, 2009
MacArthur Scale of Subjective Social Status	Time 3	Subjective social status	Dis.	Adler et al., 2000
Note. Con. = Convergent Validity. Dis. = Discriminant Validity. (R) = Reverse association.

4.5.1 Participants

Two-hundred ninety-two participants were recruited from Amazon Mechanical Turk (MTurk). Participants received $1.50 for participation.

4.5.2 Procedure and Materials

The study was a 2 (Social Rejection: Rejection vs. Acceptance) x 2 (Essay Target: Stranger vs. Close Friend) design. Participants provided informed consent and completed the Heart Manikin (Time 1) and the Time 1 measures (see Table 4.3). Then, participants were randomly assigned to one of the five essay conditions. In the stranger rejection condition, participants wrote about a time when they felt rejected by a stranger. In the close friend rejection condition, participants wrote about a time when they felt rejected by a close friend. In the stranger acceptance condition, participants wrote about a time when they felt accepted by a stranger. In the close friend acceptance, participants wrote about a time when they felt accepted by a close friend. Participants wrote the essay for 5 minutes. After the essay task, participants answered the Heart Manikin, the original Self-Assessment Manikin, and the Need-Threat Scale at Time 2. Then, participants indicated the characteristics of the person that they described in the essay task, unrelated to the current scale validation. Next, participants answered the Heart Manikin and the valence Self-Assessment Manikin and further questions about the person in the essay task at Time 3. Then, participants again answered the valence Self-Assessment Manikin and the Heart Manikin, and the demographics at Time 4 (Bradley & Lang, 1994). See [Appendix] for the detailed descriptions of these measures.

4.5.3 Results

Convergent and Discriminant Validities. To test convergent discriminant validities, I first examined the bivariate correlations between the Heart Manikin and the included measures in Table 4.3. Detailed results are available in [Appendix]. Here, I report the results with the socioeconomic status, since I detail results for the other measures in the mixed models below. Contrary to the prediction, the socioeconomic status scores correlated with the Heart Manikin scores r(288) = 0.18, p = .002, 95%CI [0.06, 0.29], suggesting that the Heart Manikin scores did not discriminate against the subjective socioeconomic status.

To test convergent and discriminant validities after controlling for the manipulations, I examined an association between the Heart Manikin scores and Time 2 measures. To do so, I constructed regression models predicting the Time 2 Heart Manikin scores for each Time 2 measure. I included the following predictors in the model: the given Time 2 measure, Social Rejection (-0.5 = Rejection, 0.5 = Acceptance), Essay Target (-0.5 = Stranger, 0.5 = Close Friend), and Social Rejection x Essay Target. Results showed that all indicators for convergent validity all predicted the Heart Manikin Scores: the valence manikin (B = 0.86, SE = 0.03, t = 24.85, p < .001), self-esteem (B = 0.57, SE = 0.05, t = 11.60, p < .001), belonging (B = 0.79, SE = 0.06, t = 14.03, p < .001), control (B = 0.33, SE = 0.04, t = 7.38, p < .001), meaningful existence (B = 0.51, SE = 0.05, t = 10.08, p < .001), and overall need-threat (B = 0.78, SE = 0.05, t = 15.26, p < .001), supporting the convergent validity. Contrary to the prediction, the discriminant validity indicators also co-varied with the Heart Manikin as well: arousal (B = 0.18, SE = 0.04, t = 4.62, p < .001), dominance (B = 0.56, SE = 0.04, t = 15.64, p < .001). See Figure 4.5 for the forest plot.

Figure 4.5: Study 1c - Regression Coefficients at Time 2

Since the Heart Manikin and the Valence Manikin were measured over time, I created a linear mixed model to examine convergent and discriminant validities of the Heart Manikin with the valence Self-Assessment Manikin item across Times 1–3. I created a dummy variable (Grouping Dummy) representing the four experimental groups in the study to facilitate interpretation. The fixed predictors were Time, Valence Manikin, Grouping Dummy, and Grouping Dummy x Time. Results showed that the valence scores predicted the Heart Manikin scores (B = 0.70, SE = 0.02, t = 29.34, p < .001) after controlling for the effects of manipulations and Time. I interpret the results as a strong support for the convergent validity between the Valence Manikin and the Heart Manikin.

Sensitivity to Experimental Manipulation. To test the sensitivity of the Heart Manikin scores to the social rejection manipulation, I ran a Welch’s t-test comparing the rejection and acceptance conditions at Time 2. Results showed that the rejected participants reported lower Heart Manikin scores (M = 2.97, SD = 2.11) than the accepted participatns (M = 7.28, SD = 1.95) at Time 2, t(286.8) = -18.06, p < .001, d = -2.12, 95%CI [-2.41, -1.83] (see Figure 4.6). Also, see Figure B.3 in [Appendix] for the Heart Manikin scores over time across conditions.

Study 1c - Heart Manikin Scores across Rejection Conditions. Participants in the rejection condition reported lower Heart Manikin scores than those in the acceptance condition.

Figure 4.6: Study 1c - Heart Manikin Scores across Rejection Conditions. Participants in the rejection condition reported lower Heart Manikin scores than those in the acceptance condition.

Test-Retest Reliability. To test test-retest reliability after controlling for effects from experimental manipulations and time, I calculated the ICC using the same linear mixed model for testing convergent and discriminant validities without the valence Manikin term. The obtained ICC was 0.44, which indicates poor reliability according to the criteria (Koo & Li, 2016). These results suggest that the Heart Manikin scores changed across time points in the study even after controlling for the effects of manipulation and time. Again, poor reliability does not imply the poor quality of the measure since the Heart Manikin should be able to capture fluctuations of belonging as a state measure.

4.6 Study 1d (EVv1)

This study was designed to test whether people who expect social rejection show decreased cardiovascular threat response to social rejection (Sunami et al., unpublished). The preregistration of the original study is available at OSF (https://osf.io/4xn52) before data collection. Table 4.4 summarizes the measures used in the study.

Table 4.4: Summary of Measures for Study 1d
Measure	Time	Construct	Validity	Citation
Self-Assessment Manikin
Valence	Times 1, 2, 3, & 4	State valence	Con.	Bradley & Lang, 1994
Arousal	Times 1, 2, 3, & 4	State arousal	Dis.	Bradley & Lang, 1994
Dominance	Times 1, 2, 3, & 4	State dominance	Dis.	Bradley & Lang, 1994
Rosenberg Self-Esteem Scale	Time 1	Self-esteem	Con.	Rosenberg, 1965a
Need for Closure Scale	Time 1	Desire for an answer on any topic	Dis.	Roets & Van Hiel, 2011
Modified Need-Threat Scale
Belonging	Times 3 & 4	Belonging	Con.	Williams, 2009
Self-Esteem	Times 3 & 4	Self-esteem	Con.	Williams, 2009
Control	Times 3 & 4	Control	Con.	Williams, 2009
Meaningful Existence	Times 3 & 4	Meaning existence	Con.	Williams, 2009
Social Judgment Survey	Time 4	Adherence to the traditional cultural values	Dis.	Proulx & Heine, 2008; Rosenblatt et al., 1989
Note. Con. = Convergent Validity. Dis. = Discriminant Validity. (R) = Reverse association.

4.6.1 Participants

Two-hundred thirty-seven participants were recruited for the study. A debriefing coding procedure determined that 53 participants had either had suspicions or figured out the hypothesis of the study, and thus they were excluded. The final dataset consisted of 184 participants.

4.6.2 Procedure and Materials

Participants provided informed consent and wore electrocardiograph electrodes and a blood pressure cuff to the participant for cardiovascular recording, unrelated to the current scale validation. Then, participants completed demographics, the Heart Manikin (Time 1), and the Time 1 questionnaires in Table 4.4. Then, participants completed the participant desire manipulation similar to Study 1c and answered the Self-Assessment Manikin and the Heart Manikin (Time 2). Then, participants heard an audio recording ostensibly recorded by their confederate, serving as a rejection manipulation. In the rejection condition, the confederate said that the participant was not their type. In the acceptance condition, the confederate said that the participant was their type. After hearing the recording, participants completed the modified Need-Threat Scale (Williams, 2009), the Self-Assessment Manikin, and the Heart Manikin (Time 3). Then, participants completed a word-finding task with the confederate, unrelated to the current validation, the Heart Manikin (Time 4), and the Time 4 questionnaires in Table 4.4. See [Appendix] for the detailed descriptions of these measures.

4.6.3 Results

Convergent and Discriminant Validities. To test convergent and discriminant validities, I fist examined the bivariate correlations between the Heart Manikin and the included measures (Table 4.4). Detailed results are available in [Appendix]. I focus on the results of the non-repeated measures (self-esteem and social judgment survey) here since the results for the measures repeated throughout the study are reported in the mixed models below. Results showed that the self-esteem scores at Time 1 significantly correlated with the Heart Manikin scores at Time 1 (r(239) = 0.57, p < .001, 95%CI [0.48, 0.65]), supporting the hypothesized convergent validity. The social judgment survey scores at Time 4 did not correlate with the Heart Manikin scores at Time 4 (r(235) = -0.05, p = .466, 95%CI [-0.17, 0.08]), supporting the hypothesized discriminant validity.

For the Self-Assessment Manikin items measured across Times 1–4, I constructed a linear mixed model predicting the Heart Manikin. I included the fixed effect of a measured score (centered), Time (categorical), Confederate Desire (.5 = high, -.5 = low), Rejection (.5 = rejection, -.5 = acceptance), Time x Confederate Desire, Time x Rejection, Confederate Desire x Rejection, and Time x Confederate Desire x Rejection. I interpret the coefficient for the measured score as evidence for convergent or discriminant validity after controlling for the manipulations and timing of measurements. Results showed that valence scores predicted the Heart Manikin scores (B = 0.42, SE = 0.02, t = 18.49, p < .001, Figure 4.7), consistent with the hypothesized convergence. Contrary to the prediction, arousal (B = 0.18, SE = 0.03, t = 6.08, p < .001) and dominance (B = 0.41, SE = 0.03, t = 14.81, p < .001) also predicted the Heart manikin scores in these models.

Figure 4.7: Study 1d - Standardized Regression Coefficients Predicting Heart Manikin Scores after Controlling for Manipulations and Time

Sensitivity to Experimental Manipulation. To test the sensitivity of the Heart Manikin to experimental manipulation, I ran Welch’s t-test comparing the rejected and accepted participants at Time 3. Results showed that rejected participants (M = 5.77, SD = 2.01) reported lower Heart Manikin scores than accepted participants (M = 6.88, SD = 1.39, t(205.5) = 4.95, p < .001; see Figure 4.8). Also, see Figure B.7 in [Appendix] for the Heart Manikin scores over time across conditions.

Study 1d - Sensitivity of Heart Manikin to Rejection Manipulation. Rejected participants reported lower Heart Manikin scores than accepted participants.

Figure 4.8: Study 1d - Sensitivity of Heart Manikin to Rejection Manipulation. Rejected participants reported lower Heart Manikin scores than accepted participants.

Test-Retest Reliability. To test test-retest reliability, I created a linear mixed model predicting the Heart Manikin scores. The fixed predictor was the same as the model above testing convergent and discriminant validity except that the model did not include the measured score predictor. Results showed that the calculated ICC was 0.75, suggesting a moderate-to-good reliability of the Heart Manikin measure across Times 1 to 4.

Across Studies 1a, 1b, 1c, and 1d, I validated the Heart Manikin measure. In the next study, I continued the validation. In addition, I aimed to examine the effectiveness of the rejection manipulation that I planned to use for the subsequent studies.

4.7 Study 1e (NPSv2)

The original research question of this study was to test the reconnection hypothesis—whether the prospect of fulfilling belonging influences social responses to rejection (Sunami et al., 2019). The study was pre-registered before data collection (https://osf.io/xpr6b). Table 4.5 shows a summary of the measures included in this study.

Table 4.5: Summary of Measures for Study 1e
Measure	Time	Construct	Validity	Citation
Self-Assessment Manikin
Valence	Times 1, 2, 3, 4, 5, & 6	State valence	Con.	Bradley & Lang, 1994
Arousal	Times 1, 2, 3, 4, 5, & 6	State arousal	Dis.	Bradley & Lang, 1994
Dominance	Times 1, 2, 3, 4, 5, & 6	State dominance	Dis.	Bradley & Lang, 1994
Experiences in Close Relationships Scale—Short Form
Avoidance	Time 1	Attachment avoidance	Con. (R)	Wei et al., 2007
Anxiety	Time 1	Attachment anxiety	Con. (R)	Wei et al., 2007
Fear of Negative Evaluation Scale—Brief Version	Time 1	Apprehension in expecting negative judgment from others	Con. (R)	Leary, 1983
Rosenberg Self-Esteem Scale	Time 1	Self-esteem	Con.	Rosenberg, 1965a
MacArthur Scale of Subjective Social Status	Time 1	Subjective social status	Dis.	Adler et al., 2000
Rejection Sensitivity Questionnaire—Short Version	Time 1	Rejection sensitivity	Con. (R)	Downey & Feldman, 1996; Romero-Canyas et al., 2010
Modified Need-Threat Scale
Belonging	Time 3	Belonging	Con.	Williams, 2009
Self-Esteem	Time 3	Self-esteem	Con.	Williams, 2009
Control	Time 3	Control	Con.	Williams, 2009
Meaningful Existence	Time 3	Meaning existence	Con.	Williams, 2009
Note. Con. = Convergent Validity. Dis. = Discriminant Validity. (R) = Reverse association.

4.7.1 Procedure and Materials

The study was a 2 (Participant Desire; low vs. high) x 2 (Confederate Desire, low vs. high) x 2 (Social Rejection, rejection vs. control) design. I only describe the procedure and measures relevant to the current validation in detail here. More detailed descriptions are available in a published study (Sunami et al., 2019). On Day 1, participants answered the Heart Manikin and the Time 1 questionnaires (see Table 4.5). Then, participants completed the manipulation for the participants’ desire to affiliate with the confederate (Participant Desire) and answered the Heart Manikin and the Self-Assessment Manikin (Time 2). Participants then completed the manipulation of the confederate’s desire to affiliate with the participant (Confederate Desire) and answered the Modified Need-Threat Scale (Nadzan et al., 2019; Williams, 2009), the Self-Assessment Manikin, the Heart Manikin (Time 3). The manipulations for Participant Desire and Confederate Desire are unrelated to the current validation, and details are available in a published study (Sunami et al., 2019).

4.7.2 Participants

This study was a two-day study (separated by 6.22 days on average), with 674 participants on Day 1 and 605 participants on Day 2. A debriefing coding procedure determined that 67 participants had either had suspicions or figured out the hypothesis of the study, and thus they were excluded. The final analytic sample consisted of 538 participants.

Rejection Essay Manipulation. On Day 2, participants completed the Self-Assessment Manikin and the Heart Manikin (Time 4). Then, participants completed the social rejection manipulation essay where they were randomly assigned to either a rejection condition or a control condition (adapted from Twenge & Campbell, 2003). All participants spent 5 minutes writing the essay. In the rejection condition, participants wrote about a time when they felt rejected by a person or a group of their own age (excluding romantic rejection) for 5 minutes:

We’d like you to write about a time when you felt rejected or excluded by a person or a group about your own age. By “felt rejected” we mean that you felt like a person or persons did not value you or your relationship. That is, describe an episode in which you wanted to spend time with or do something with someone, and that person or persons did not let you do so. Make sure to be as detailed as possible and describe not only what happened, but also how you felt. If the rejection is by an organized group of people, make sure it is of people about your same age. For example, being rejected from a college or job is NOT what we are asking about. Please do NOT describe a romantic rejection, if possible.

In the control condition, participants wrote about their yesterday morning:

We’d like you to write about your morning yesterday. Please describe what you did yesterday morning. Make sure to be as detailed as possible and describe not only what happened, but also how you felt.

After naming their social surrogate and non-social surrogate video games, participants completed the social rejection essay task that I validate in Study 1e (Sunami et al., 2019). All participants wrote about a time when they felt rejected by a person or a group of their own age (excluding romantic rejection) for 5 minutes:

We’d like you to write about a time when you felt rejected or excluded by a person or a group about your own age. By “felt rejected” we mean that you felt like a person or persons did not value you or your relationship. That is, describe an episode in which you wanted to spend time with or do something with someone, and that person or persons did not let you do so. Make sure to be as detailed as possible and describe not only what happened, but also how you felt. If the rejection is by an organized group of people, make sure it is of people about your same age. For example, being rejected from a college or job is NOT what we are asking about. Please do NOT describe a romantic rejection, if possible.

After writing the essay, participants answered the Self-Assessment Manikin and the Heart Manikin, and the Need-Threat Scale (Time 5), completed experimental tasks unrelated to the current study (Sunami et al., 2019), and again answered the Self-Assessment Manikin and the Heart Manikin (Time 6).

4.7.3 Results

Convergent and Discriminant Validities. To test convergent and discriminant validities, I first examined bivariate correlations between the Heart Manikin scores and scores of the measures (Table 4.5). I focus on the results for the subjective socioeconomic status here since the results for the other repeated measures are reported in more detail in the mixed model analyses below. Results showed that the subjective socioeconomic status did not correlate with the Heart Manikin (r(536) = 0.05, p = .289, 95%CI [-0.04, 0.13]). An equivalence test showed that the 90% confidence interval of this correlation coefficient fell within the smallest effect size of interest (|r| = 0.17), suggesting that the observed coefficient was theoretically equivalent to zero (r(536) = 0.05, p = .289, 90%CI [-0.03, 0.12]). These results support the hypothesized discriminant validity of the Heart Manikin against subjective socioeconomic status

For the Self-Assessment Manikin scores measured over Times 1–6 and the Need-Threat scores over Times 3 and 5, I constructed a linear mixed model predicting the Heart Manikin scores. I created a dummy categorical variable (Grouping Dummy) representing the four experimental conditions for Participant Desire and Confederate Desire to reduce the number of interactions in the model (coded as 0-3). I included the fixed effects of the measured scores (centered), Time (categorical), Dummy for the Participant Desire and Confederate Desire conditions (categorical), social rejection (rejected = -.5, control = .5), Grouping Dummy, Grouping Dummy x Rejection, Rejection x Time, Rejection x Grouping Dummy, Rejection x Time x Grouping Dummy. Results showed that the Valence Manikin, belonging, self-esteem, control, and meaningful existence scores predicted the Heart Manikin scores after controlling for the manipulations and time, supporting the hypothesized convergent validity as expected (Valence: B = 0.35, SE = 0.01, t = 24.02, p < .001; Belonging: B = 0.57, SE = 0.03, t = 21.00, p < .001; Control: B = 0.30, SE = 0.03, t = 10.72, p < .001; Meaningful Existence: B = 0.46, SE = 0.03, t = 17.62, p < .001; Overall Need-Threat: B = 0.57, SE = 0.03, t = 21.64, p < .001). However, the Arousal and Dominance Manikin scores also predicted the Heart Manikin scores, contrary to the expectation (Arousal: B = 0.11, SE = 0.02, t = 7.04, p < .001; Dominance: B = 0.34, SE = 0.02, t = 21.07, p < .001). Overall, the mixed model analyses showed a strong support for the convergent validity of the Heart Manikin, but no support for the discriminant validity with the Arousal and Dominance Manikins.

Figure 4.9: Study 1e - Forestplot Showing Regression with Heart Manikin Scores from the Mixed Model

Sensitivity to Experimental Manipulation. To test the sensitivity of the Heart Manikin scores to a social rejection manipulation, I ran a Welch’s t-test comparing the rejected and control groups following the social rejection manipulation at Time 5 (right after the social rejection manipulation). The rejected participants (M = 6.47, SD = 1.88) reported lower Heart Manikin scores than the control participants (M = 6.78, SD = 1.55, t(518.8) = 2.11, p = .036 , d = 0.18, 95%CI [0.01, 0.35], see Figure 4.10).

Figure 4.10: Study 1e - Heart Manikin Scores.

Test-Retest Reliability. To test test-retest reliability, I created a linear mixed model predicting the Heart Manikin scores across measurements. I included the same fixed effects as the model above testing the convergent and discriminant validity except that the model did not include a predictor for a measured score. The calculated ICC was 0.72, indicating a moderate test-retest reliability after controlling for the effects of the experimental manipulations.

Effectiveness of Rejection Manipulation on Belonging. To test the effectiveness of the rejection manipulation on belonging, I performed a series of Welch’s t-tests on the belonging subscale of the Need-Threat Scale at Time 5 (Figure 4.11). Results showed that participants in the rejection condition reported lower belonging (M = 79.13, SD = 16.68) than those in the control condition (M = 82.04, SD = 14.18, t(521.6) = 2.18, p = .030, d = 0.19, 95%CI [0.02, 0.36]), indicating that the manipulation was effective. See Figure B.7 in [Appendix] for the Heart Manikin scores over time across conditions.

Figure 4.11: Study 1e - Need-Threat Scores Across Rejection and Control Conditions

I also ran Welch’s t-tests on other subscales of the need-threat scale (self-esteem, control, and meaningful existence). Results showed that rejected participants and control participants did not report different levels of self-esteem, control, meaningful existence, or the overall need-threat (self-esteem: t(524.4) = 1.71, p = .088, d = 0.15, 95%CI [-0.02, 0.32], control: t(536.0) = -0.63, p = .531, d = -0.05, 95%CI [-0.22, 0.12], meaning: t(527.8) = 1.78, p = .075, d = 0.15, 95%CI [-0.02, 0.32], overall need-threat: t(524.9) = 1.46, p = .145, d = 0.13, 95%CI [-0.04, 0.30]). These results suggest that although the rejection manipulation was effective in inducing lowered belonging, it may not have been effective in lowering self-esteem, control, meaningful existence, or overall fundamental need.

4.8 Deciding a Rejection Manipulation for Subsequent Studies

The effect size for the rejection manipulation of Study 1e was small (d = 0.18, 95%CI [0.01, 0.35]). Also, the manipulation did not lower the fundamental needs that usually track with the rejection manipulations in the other studies (Williams et al., 2005). These results raised a concern about the effectiveness of the rejection manipulation. Since I plan to use the rejection manipulation without a control or acceptance condition, I wanted to ensure the effectiveness of the manipulation. These results implied that the manipulation used in Study 1e may not be optimal to use for the purpose of this dissertation.

Study 1c included a different version of the essay rejection manipulation, which showed a large effect size when compared with an acceptance condition (d = -2.12, 95%CI [-2.41, -1.83]). However, I cannot directly compare this effect size with the effect size obtained in Study 1e since Study 1c contrasted rejection with acceptance condition. To make the effect size comparable as possible, I ran a pared t-test comparing the the Heart Manikin scores before and after the rejection manipulation (Times 1 vs. 2) only among the rejected participants. Results showed that rejected participants reported lower belonging at Time 2 than Time 1 (t(145.0) = 17.49, p < .001, d = 1.45, 95%CI [1.22, 1.68]). The obtained effect size in Study 1c’s manipulation was nearly 8 times larger than the effect size of Study 1e’s manipulation. Although the two effect sizes may not be directly comparable due to the difference in the study designs (within-subject for Study 1c and between for Study 1e), the magnitude of the difference was concerning given that both studies shared the same outcome measure (Heart Manikin). Overall, these results strongly indicated that Study 1c’s manipulation was more effective in manipulating belonging than Study 1e. To ensure the effectiveness of the rejection manipulation used in Studies 2 and 3, I have decided to use the rejection procedure in Study 1c instead of Study 1e’s procedure.

4.9 Discussion

Across 5 studies (1200+ participants), I examined the convergent validity, discriminant validity, test-retest reliability, and sensitivity to social rejection manipulation the Heart Manikin. Overall, I found a strong support for the convergent validity and sensitivity to social rejection manipulation. On the other hand, I found mixed evidence for discriminant validity.

Convergent Validity. The Heart Manikin scores correlated with the hypothesized convergent measures: belonging, self-esteem, control, and meaningful existence needs (Studies 1c, 1d, and 1e), social isolation (Study 1a), interpersonal relationship quality and conflict (Study 1b), depression (Studies 1a and 1b), and valence (Studies 1a, 1b, 1c, 1d, and 1e). The current results strongly support that the Heart Manikin scores converge with belonging and its associated measures.

Discriminant Validity. I found mixed results for the discriminant validity of the Heart Manikin scores. On one hand, the Heart Manikin scores showed evidence for discrimant validity against interpersonal reactivity, paradoxical mindset, self-monitoring, integrative complexity, sleep quality, physical abuse perpetration, intimidation perpetartion, emotional control, economic control, food craving, and body image. On the other hand, the Heart Manikin scores correlated with the measures of multiple identity (Study 1a), social monitoring (Study 1a), beliefs in biological differences between Black and White people (Study 1a), perpetration of threats against one’s partner (Study 1b), stress (Study 1b), narcissism (Study 1b), arousal (Studies 1c and 1e), and dominance (Studies 1c and 1e). The discriminant validity against the socioeconomic status was particularly mixed. In Studies 1b and 1e, I found that the Heart Manikin discriminated against socioeconomic status. In Study 1c, the Heart Manikin scores correlated with socioeconomic status, contrary to the prediction.

Although this is result-dependent reasoning, I realize that some of these discriminant measures can converge with belonging. For example, people with multiple identities could report more belonging since they belong to multiple groups, people with higher social monitoring can cultivate social connections easily, and people who do not threaten their partner experience more loving interactions. I do not have post-hoc explanations for why people with higher Heart Manikin scores reported lower narcissism, less biological beliefs in differences between Black and White people, and higher socioeconomic status.

Note that some of the observed associations can be attributed to Type I error. For example, I found the association between the socioeconomic status and the Heart Manikin in Study 1c, but not Studies 1b and 1e, adding to the possibility of Type I error. In contrast, this possibility of Type I error is less likely for arousal and dominance since I observed the associations for these measures across two studies (Studies 1c and 1e).

Test-Retest Reliability. I observed test-retest reliability of 0.33 in Study 1b (measured 3 times separated by 36.3days on average), 0.44 in Study 1c (measured 3 times in a 15-minute study), 0.75 in Study 1d (EVv1) (measured 4 times in an 1-hour study) , and 0.72 in Study 1e (measured 6 times in two 30-minute experimental studies separated by 6.22 days on average). The reliability ranged from poor to moderate, which suggests that the Heart Manikin scores vary relatively considerably across time, and thus may be suitable to be used as a state measure, rather than a trait measure.

Sensitivity to Social Rejection Manipulation. In Studies 1c, 1d, and 1e, I tested the sensitivity to social rejection of the Heart Manikin scores. The results showed that rejected participants reported lower Heart Manikin scores across the studies. I conclude that the Heart Manikin scores are sensitive to social rejection manipulation.

Overall, I suggest that Study 1 supported the convergent validity of the Heart Manikin. The results for the discriminant validity was mixed. I suggest that the Heart Manikin scores track belonging well but do not necessarily distinguish belonging from other constructs, including general stress, arousal, and dominance. Given the promising results of the convergent validity, I decided to use the Heart Manikin sores as key outcome variable in my subsequent studies.

The current study has constraints on generality. First, I used existing data for the current validation. As a result, the sample sizes of the studies were not based on a proper power analysis, making the results susceptible to Type I and Type II errors. In addition, the measures in the studies were not a priori selected to validate the Heart Manikin. Second, the sample demographics were limited to undergraduate students from an introductory psychology course at the University of Delaware. Most participants were young and predominantly White, and thus I do not know if the current results generalize to other populations with different characteristics. Despite the shortcomings, the current study included 65 measures from 5 studies, and thus maximizing the generality across measures and samples.