Smaller Is Better: Associations Between Personality and Demographics Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Are Improved by Examining Narrower Traits and Regions

The personality of individuals is clustered by geographic regions; a resident of a region is more similar to another resident than to a random non-resident. Research in geographical psychology often has focused on this clustering effect in broad regions, such as countries and states, using broad domains of personality, such as the Big Five. We examined the extent to which (a) a narrower geographic unit, the U.S. ZIP Code, accounted for more variance explained in aggregating personality than a broader region, the U.S. state; and (b) progressively narrower personality traits (domains, facets, and nuances, respectively) provided more specificity in describing personality-demographic relationships. Results from this study (n participants = 39,886, n zipcodes = 2,074) indicated that the variance explained by aggregating personality was multiple times as large for U.S. ZIP Codes than for states (median = 4.4). At the level of personality domains, ZIP Code population density and income disparity were positively correlated with Openness and negatively correlated with Conscientiousness and Agreeableness. Facets within each domain were differentially correlated with each demographic, which demonstrated that facets added specificity to the personality-demographic relationships beyond that of domains. Item-level analysis revealed the most specific finding: higher population density and income disparity were associated with politically liberal attitudes and beliefs of self-exceptionalism, while lower density

The personality of individuals is clustered by geographic regions; a resident of a region is more similar to another resident than to a random non-resident. Research in geographical psychology often has focused on this clustering effect in broad regions, such as countries and states, using broad domains of personality, such as the Big Five. We examined the extent to which (a) a narrower geographic unit, the U.S. ZIP Code, accounted for more variance explained in aggregating personality than a broader region, the U.S. state; and (b) progressively narrower personality traits (domains, facets, and nuances, respectively) provided more specificity in describing personality-demographic relationships. Results from this study (n participants = 39,886, n zipcodes = 2,074) indicated that the variance explained by aggregating personality was multiple times as large for U.S. ZIP Codes than for states (median = 4.4). At the level of personality domains, ZIP Code population density and income disparity were positively correlated with Openness and negatively correlated with Conscientiousness and Agreeableness. Facets within each domain were differentially correlated with each demographic, which demonstrated that facets added specificity to the personality-demographic relationships beyond that of domains. Item-level analysis revealed the most specific finding: higher population density and income disparity were associated with politically liberal attitudes and beliefs of self-exceptionalism, while lower density and income disparity were associated with authoritarian attitudes and concern for abiding by rules and laws. Findings suggest that future studies in geographical and personality psychology could benefit from using the narrowest feasible unit of analysis.
Over the last two decades, findings in geographical psychology have suggested that the personalities of individuals are clustered by geographic regions (Rentfrow & Jokela, 2016). That is, a resident of a region typically has a more similar personality to fellow residents than non-residents. Additionally, the aggregated personalities of regions are associated with important region-level demographics, such as rates of chronic disease (Pesta et al., 2012) and crime (Rentfrow et al., 2008). These results have been found for regions of varying sizes: countries (McCrae & Terracciano, 2008), large regions within a country (e.g., New England and the Midwest; Rentfrow et al., 2013), smaller regions within a country (e.g., U.S. states; Rentfrow et al., 2008), cities (Park & Peterson, 2010), and even neighborhoods within a city (Jokela et al., 2015).
There are at least three theories regarding why personality is aggregated by geographic regions and why regions' personalities are correlated with their demographics (for a full review, see Rentfrow et al., 2008). First, environmental factors may shape residents' thoughts, feelings, behaviors, and beliefs in consistent ways. For example, a neighborhood's income inequality may influence its residents' risktaking behavior, with greater inequality causing riskier behavior (Kawachi et al., 1997;Payne, 2017;Wilson & Daly, 1997). Second, similar individuals may migrate to the same region due to opportunities there. For instance, because cities, compared to small towns, create more opportunities for hedonistic and faster-paced lifestyles, they may also attract individuals who exhibit more antisocial personality traits (Jonason, 2018). A third theory posits that social influence may also play a role; social interactions with personality-typical residents may persuade new residents toward the prevailing beliefs and attitudes of the region (Rentfrow, 2010;Rentfrow et al., 2008).
The typical method of aggregating individual personality scores into regional scores is to calculate a mean personality score for each region. Total personality variance is composed of within-region variance (how much individual personalities vary within a region) and between-region variance (how much aggregated regional personalities vary). The amount of total variance explained by regional residence is expressed as a ratio of between-region variance over total variance (Bliese, 2000). Most geographical psychology studies have not reported this ratio, but a recent replication study found that across seven samples, U.S. state residence accounted for 0.3% of individual variance of Big Five personality, a small but reliable effect (Elleman et al., 2018). U.S. ZIP Codes may better aggregate personality than U.S. ZIP Codes may better aggregate personality than states states Regions as large as U.S. states may account for such a small percentage of personality variance because each region does not contain an especially homogeneous population. A state can encompass millions of people who are spread out over tens of thousands of square miles. Residents of the same state can live in dissimilar environments, such as a quiet, sparsely inhabited countryside and a bustling, crowded metropolis. These environmental differences are associated with individual differences of residents. For example, despite states often being represented as either "red" or "blue," residents of a state vary in their political affiliations (Seyle & Newman, 2006), and voting patterns are clustered by urban density (with some exceptions; Morrill et al., 2007). Additionally, compared to residents of smaller cities, residents of bigger cities in the U.S. tend to have higher levels of education and income (Glaeser & Maré, 2001), and different occupational skills (Bacolod et al., 2009). Even within a city, the population density of a neighborhood is associated with differences in neighborhood personality (Jokela et al., 2015).
Compared to state-sized regions, U.S. ZIP Codes may better cluster individuals' personalities because the environment of a ZIP Code is smaller and more uniform than an entire state. Individuals with similar personalities may cluster together in a ZIP Code because they face similar economic constraints or have similar desires for the amenities of the ZIP Code. There is evidence of this clustering for neighborhoods, geographic areas that are typically smaller than U.S. ZIP Codes. People prefer to live in neighborhoods where other residents are of similar political affiliations (Gimpel & Hui, 2015) and ideologies (Motyl et al., 2014). Additionally, liberals and conservatives differentially prefer certain aspects of neighborhoods, such as nearness to museums, ethnic diversity, and religious homogeneity (Pew Research Center, 2014). Besides political affiliation, a person may be more likely to migrate to a neighborhood in which the person's self-image closely matches the perceived typical resident (Sirgy et al., 2005). The benefits of this type of person-environment fit have been found at the city level: a better fit between the aggregated personality of a city and a city's resident is associated with higher selfesteem of the resident (Bleidorn et al., 2016). Additionally, characteristics of a neighborhood, such as its affluence or ethnic diversity, impact the individual differences of resident children and adolescents, such as their cognitive ability and behavioral problems (for a review, see Leventhal & Brooks-Gunn, 2000). Narrow personality traits may be more informative Narrow personality traits may be more informative than broad traits than broad traits The dominant trait model of personality psychology is the "Big Five" taxonomy, which is composed of five broad domains (Openness, Conscientiousness, Extraversion, Agreeableness, and Neuroticism; Goldberg, 1999). An abundance of studies correlating Big Five domains with life outcomes has helped the Big Five to become widely accepted personality constructs (for reviews, see Ozer &Benet-Martínez, 2006 andRoberts et al., 2007).
A limitation of the Big Five is their broad nature. Facets, narrower traits nested within domains, often account for more outcome variance than Big Five domains and provide greater specificity to personality-outcome relationships (Mershon & Gorsuch, 1988;Paunonen & Ashton, 2001).
Across multiple cultures, narrower traits have predicted more replicated behavioral outcomes than the Big Five (Paunonen et al., 2003). Additionally, personality-outcome relationships that appear to exist at the domain level may in fact be driven by facets. For example, Mõttus and colleagues (2012) found that the likelihood of sexually transmitted disease (STD) diagnosis was related to two of the Big Five domains (Neuroticism and Agreeableness) but only three of thirty personality facets (Impulsivity, Hostility, and Deliberation). At the aggregated level of U.S. states, facets of Extraversion (Assertiveness and Activity) have been found to be differentially correlated with state-level demographics (Rentfrow, 2014).
Personality researchers have recently delved into even narrower traits, or "nuances," which are the breadth of individual items (McCrae, 2015). Concerns about the low reliability of items have been answered with large-sample studies indicating that nuances are reliable and valid constructs; the unique variance of individual items has crossrater agreement, is rank-order stable, and is partially heritable (Mõttus et al., 2014(Mõttus et al., , 2017(Mõttus et al., , 2019. Growing evidence suggests that nuance-level models provide more specificity and predictive power than models that utilize facets or domains (Elleman et al., in press;Mõttus et al., 2015Mõttus et al., , 2017Revelle et al., 2020;Seeboth & Mõttus, 2018). This additional specificity can alter how findings are interpreted. For example, body mass index has been found to be positively correlated with the facet Impulsivity across several studies, but this relationship appears to be more accurately attributed to just two eating-related Impulsivity items (Terracciano et al., 2009;Vainik et al., 2015). Another study found that, across 40 outcomes, the specific variance of nuances accounted for a substantial portion of the predictive power of domain-level models, suggesting that the idiosyncrasies of individual items are more responsible for domain-level relationships than is commonly understood (Seeboth & Mõttus, 2018). Lastly, one recent study found that a participant's country of origin was better predicted with personality items than facets or domains (Achaa-Amankwaa et al., 2020). Taken together, this body of research indicates that examining personality-outcome relationships at a level higher than nuances "entails a substantial loss of information" (Mõttus et al., 2015, p. 25) and risks misattributing nuance-level personality-outcome relationships to broader traits.
Overview of the current study Overview of the current study We had two primary aims. The first was to estimate the variance explained by aggregating personality for U.S. ZIP Codes and states. Because the environment of a ZIP Code should be smaller and more homogeneous than a state, we predicted that the aggregation effect for U.S. ZIP Code residence would be larger than for U.S. state residence. The study's second aim was to determine whether greater specificity (i.e., a more specific relationship that could not be attributable to a broader trait) could be gleaned from geographical psychology findings with the use of progressively narrower personality traits (domains, facets, and nuances, respectively). First, we found the zero-order correlations between ZIP Code demographics and aggregated Big Five domains. We then investigated whether facets within a domain were differentially correlated with the demographic. To determine personality-demographic relationships at the nuance level, we first found the personality items that were most highly correlated with each demographic. For each demographic, we created a scale composed of these best items Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology (a "best-items scale"). This type of scale has also been referred to as a "polyitem score" (Mõttus et al., 2017) and is similar to the dustbowl empiricism of the Minnesota Multiphasic Personality Inventory (Hathaway & McKinley, 1942). Lastly, we compared the effect sizes of regression models using domains, facets, and nuances in order to determine if greater specificity was a benefit or detriment to prediction.

Participants Participants
Screening procedures. Participant data were collected from the Synthetic Aperture Personality Assessment (SAPA) project, an international online personality assessment (https://SAPA-project.org; Revelle et al., 2016). As compensation for taking the survey, participants received automated feedback concerning their personality. Participant data for this sample were collected from January 2013 to February 2017 and are publicly available (Condon et al., 2017b(Condon et al., , 2017aCondon & Revelle, 2015a. 1 To limit the sample to those living in the U.S. who, more likely than not, had a choice concerning where they resided, we included participants only if they provided a valid U.S. ZIP Code of their residence and indicated they were 18 years of age or older. Participants were excluded if they were 18 years old and indicated they had not finished high school, due to our assumption that they would still be residing in their parents' home (i.e., they did not yet have the choice concerning where to reside). As was standard lab practice, we removed participants who reported having previously taken the assessment and duplicate entries taken in the same internet browser session. Based on these initial criteria, there were 75,716 participants in 14,000 unique ZIP Codes, with a mean of 5.4 participants per ZIP Code (median = 3). The distribution of participants per ZIP Code was positively skewed, with 6,612 ZIP Codes containing one or two participants. To lower the error associated with aggregated personality estimates of ZIP Codes, ZIP Codes were excluded if they contained less than 10 participants, as this cutoff improved the reliabilities of ZIP Code measures while retaining a large number of participants and ZIP Codes. 2 This requirement reduced the sample to 39,886 participants in 2,074 ZIP Codes ( Figure 1) and increased the mean number of participants per ZIP Code to 19.2 (median = 14). This final sample had a younger median age, 24 years, compared to the initial sample, 25 years. Other descriptive information regarding differences in the initial and final samples are available in Table 1 of the supplemental materials. Participant demographic information. For the final sample that required at least 10 participants per ZIP Code, median self-reported age was 24 years (median absolute deviation = 7), with a range from 18 to 89 years. Female participants accounted for 65% of the sample. Of the 80% of participants who reported their ethnicity, 62% identified themselves as White, 13% as Hispanic American, 12% as African American, 5% as Asian American, 1% as Native Alaskan/ Hawaiian/American, and 8% as multi-racial or "other." Percent of minority participants in each ZIP Code was calculated by dividing the number of participants in a ZIP Code who indicated any ethnicity besides "White" by the number of participants in a ZIP Code who self-reported their ethnicity. The percent of minority participants in ZIP Codes had a large correlation ( , ) with the Census estimate of percent of minority residents in each ZIP Code, which indicated minority representativeness in ZIP Codes for this study's sample.
Participants self-reported their educational attainment and their parents' educational attainment, using a scale with seven options, ranging from "Less than 12 years" to "Graduate or professional school degree." Almost all participants provided their own education (97%) and at least one parent's education (91%).
3 College students accounted for 57% of the sample that reported their education, while 30% reported having at least a bachelor's degree. Of the participants who responded, 52% reported that at least one of their parents had attained at least a bachelor's degree.
Personality measures. Unlike most personality assessments, the SAPA project does not require that a participant answer every item from every scale. The SAPA project utilizes a massively-missing-completely-at-random (MMCAR) procedure in which participants receive randomly selected items from each inventory of interest (Revelle et al., 2010(Revelle et al., , 2016. Items from each inventory may have a different sampling rate. Due to high rates of missingness, missing scores were not imputed; aggregated personality scores for ZIP Codes and all analyses were trained only on non-missing item scores. Although imputation can allow for unbiased estimates of missing data (Little et al., 2013), a recent study of SAPA data with 90% data missingness indicated that models based on imputed data were less predictive than models trained only on the 10% actual data (Elleman et al., in press).
For all personality items, participants rated themselves using a six-point Likert-like scale (1 = "very inaccurate," 6 = "very accurate"). Big Five domains and facets were measured with items sampled from the 300-item IPIP-NEO inventory from the International Personality Item Pool (IPIP; Participant ZIP Code data have not been publicly shared but are available to researchers upon request. Additionally, the cited data sets are from 2010 to 2017, but SAPA began collecting ZIP Code information in 2013.
See Tables 1, 2, and 3 of the supplemental materials for analyses related to different minimums of participants per ZIP Code.
In instances of both parents' educational attainment being reported, an average of the two was taken so that one variable could represent mean parents' education. Although education categories were not technically intervals, each more advanced education category was approximately equivalent to two additional years of schooling.  http://ipip.ori.org/), an online repository for public domain items and scales (Goldberg, 1999;Goldberg et al., 2006). Each of the IPIP-NEO Big Five domains consisted of six facets and 60 items; each of the 30 IPIP-NEO facets consisted of ten items. The IPIP-NEO Big Five domains and facets are based on the NEO-PI-R conception of the Big Five (Costa & McCrae, 1992). On average, each participant was given 30 Big Five items (10% of the inventory). For item-level analyses, 696 items from 92 public-domain personality scales were used (Condon et al., 2017c;Condon & Revelle, 2015b), which included the 300 IPIP-NEO items.

ZIP Code demographic measures ZIP Code demographic measures
Four demographics of interest were selected: population density, income disparity, 4 median income, and ethnic diversity. Three of the four demographics (population density, income disparity, and ethnic diversity) were chosen due to previous evidence in the literature that indicated these demographics were correlated with personality at geographic levels smaller than countries (Glaeser & Maré, 2001;Jokela, 2014;Jokela et al., 2015;Pew Research Center, 2014;Tausanovitch & Warshaw, 2014). Median income was chosen as an exploratory variable and a contrast to income disparity. All four ZIP Code-level demographics were chosen in part because they were variables of convenience; they were publicly available from the U.S. Census Bureau.
Population density and ethnic household data were taken from the 2010 Census (Alabama State Data Center, 2012). Income data were taken from the 2013 American Community Survey (U.S. Census Bureau, 2017). Ethnic diversity was calculated using an "index of fractionalization" formula, which is the probability that two randomly selected residents of a ZIP Code would be of different ethnicities (Easterly & Levine, 1997). For the final sample, the average ZIP Code ethnic diversity was .36 (SD = .17). Income disparity was measured with the Gini index, as reported in the 2013 American Community Survey. The Gini index is a coefficient with a range from 0 to 1 that measures the distribution of incomes among a group of people (Ceriani & Verme, 2011). In the context of this study, a ZIP Code with a Gini of 0 would indicate that income was distributed equally among residents of the ZIP Code, while a Gini of 1 would indicate that one resident accounted for all the income of the ZIP Code. For the final sample, the average ZIP Code Gini was .45 (SD = .06); for comparison, the World Bank's (2016) estimate of U.S. national Gini is .41.
Of the four demographics, readers may be the least familiar with income disparity. Income disparity refers to the extent to which income is distributed unevenly among residents in a ZIP Code (see Methods for more detail on the calculation). Income disparity is not synonymous with median income; the median income of more income disparate ZIP Codes may be lower than the national median (e.g., Pittsburgh, PA, 15213) or higher (e.g., New York, NY, 10013). Conversely, there may be less income disparity in a ZIP Code when residents tend to earn less than the national median (e.g., Killeen, TX, 76544; near Fort Hood) or more (e.g., Ashburn, VA, 20147; a suburb of Washington, DC). Compared to equal-income areas, income disparate areas tend to have higher rates of crime (Daly et al., 2001;Rufrancos et al., 2013;Wilson & Daly, 1997), worse economic outcomes for individuals (Frank, 2013) and lower life expectancy (Pickett & Wilkinson, 2015). 4 Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology

Statistical analyses Statistical analyses
All analyses were performed in RStudio (RStudio Team, 2020), an integrated development environment for the programming language R (R Core Team, 2018), using, most notably, the packages "psych" (Revelle, 2020) and "ggplot2" (Wickham, 2016). All analyses were at the aggregated ZIP Code level unless otherwise noted. ZIP Code-level scores of individual-level scales and items were calculated by taking the mean of individual scores for each ZIP Code that contained at least 10 participants. Significance testing of ZIP Code-level analyses was based upon the number of ZIP Codes included in the final sample ( ) unless otherwise noted. Median income and population density were natural-log-transformed for correlation and regression analyses due to each variable's positive skewness.

Results Results
The magnitude and reliability of personality The magnitude and reliability of personality differences at the U.S. ZIP Code and state levels differences at the U.S. ZIP Code and state levels To estimate the personality variance explained by ZIP Code and U.S. state residence, we calculated intraclass correlations (ICCs). ICCs partition total variance into withingroup and between-group variance and reflect various ratio terms. Specifically, ICC1 indicates how much total variation is explained by aggregation (i.e., what percent of total variance is accounted for by ZIP Code residence; Shrout & Fleiss, 1979). For all 35 personality variables of interest, the ZIP Code aggregation effect had a larger point estimate than the state aggregation effect ( Figure 2). 5 A paired t-test indicated that there was a significant difference in the percent of personality variance explained by ZIP Codes (M = 2.6, SD = 3.9) and states (M = 0.6, SD = 1.0); t(34)= , p .001. Based on the median ratio, the ZIP Code aggregation effect for personality was 4.4 times as large as the effect for U.S. states. The personality variance explained by ZIP Code residence had a range from 1.0-1.4% for the Big Five domains and 0.2-7.0% for the 30 personality facets. 6 ICC2 measures group mean reliability and estimates the extent to which one would expect aggregated differences (i.e., aggregated ZIP Code personality differences) to replicate in a new sample of a similar size (Bliese, 2000). ICC2 values have a range from 0 to 1. Similar to the Spearman-Brown prophecy formula, as the average number of participants per group (ZIP Code) increases, even a small aggregated effect (ICC1) can have a reliable group mean difference (ICC2). Because ICC2 values are determined by the number of participants in a group and the magnitude of ICC1 values, ICC2 reliabilities were not large for Big Five domains (range: -) or facets (range: -; Table 3 of the supplemental materials). 7 Zero-order correlations between ZIP Code Zero-order correlations between ZIP Code demographics and aggregated participant Big Five demographics and aggregated participant Big Five personality personality Zero-order correlations at the aggregated ZIP Code level were found for the four ZIP Code demographics and participants' Big Five domain scores. Because scores from ZIP Codes with more participants were more accurate estimates than ZIP Codes with fewer participants, correlations were weighted by the number of participants in a ZIP Code. To adjust for multiple comparisons, a Holm adjustment (Holm, 1979) was used to account for all 36 correlations. With the adjustment, the required minimum absolute correlation for typical statistical significance ( ) was still very small ( ). In order to focus on effects of a larger magnitude, the effect size threshold for a notable correlation was set to ( ). Based on this criterion, there were six notable correlations: population density was positively correlated with Openness ( ), and negatively correlated with Conscientiousness ( ) and Agreeableness ( ); and income disparity was positively correlated with Openness ( ), and negatively correlated with Conscientiousness ( ) and Agreeableness ( ; Table  1; see Table 4 of the supplemental materials for a correlation matrix that includes participant demographics).
The robustness of the basic correlational structure. To ensure these zero-order correlational results were not an artifact of the requirement of 10 participants per ZIP Code, the robustness of the pattern of correlations was tested using a minimum of 5, 10, 15, and 20 participants per ZIP Code. For aggregated participant variables, a similar pattern of correlations emerged for all four minimums, although the absolute value of correlations typically increased as the minimum number of participants increased (Figures 1-4 of the supplemental materials). That is, the structure of the correlation matrix was largely the same, regardless of minimum participants per ZIP Code, but improved ICC2 reliabilities resulted in more accurate estimates of aggregated ZIP Code traits and thus larger absolute correlations between participant variables. Correlations between ZIP Code-level variables from the Census Bureau were mostly unchanged, regardless of the number of participants per ZIP Code.
Zero-order correlations between ZIP Code Zero-order correlations between ZIP Code demographics and aggregated personality facets demographics and aggregated personality facets Because population density and income disparity were found to be correlated with three of the Big Five (Openness, Conscientiousness, and Agreeableness), we examined whether facets within each of these domains were differentially correlated with population density (Figure 3) and income disparity (Figure 4). Facets of traits visually appeared to be differentially correlated with population density and To determine if the personality variance explained by ZIP Code residence would still be consistently larger than that of states, even if the data were random, a simulation (1,000 iterations) was run such that participants were randomly assigned to ZIP Codes and states. In the simulation, the personality variance explained was less than 0.01% for ZIP Codes and states, and ZIP Code ICC1s had a larger point estimate than state ICC1s for 20 of the 35 variables ( ).
To ensure that ICC1 values were not due to the minimum of 10 participants per ZIP Code, we compared ICC1 values across four minimums (5, 10, 15, and 20 participants per ZIP Code). ICC1 values for personality variables were stable across the four minimums (Table 2 of the supplemental materials).
ICC2 reliabilities were consistently improved by increasing the required minimum number of participants per ZIP Code (Table 3 of the supplemental materials).   income disparity, which prompted the statistical analyses of these potential differences.
In order to determine which facets within a domain were most highly correlated with each demographic, the following approach was taken: First, for each domain, the facet with the largest absolute correlation with a demographic was identified (e.g., for Openness and population density, the Liberalism facet had the largest absolute correlation). Then, five tests determining the differences in non-independent correlations (Steiger, 1980) were performed to compare that demographic-facet correlation with each other same-domain demographic-facet correlation (e.g., the population density-Liberalism correlation was compared against the correlations of the other five facets of Openness and population density). P-values were Holm-adjusted for the 30 comparisons. For the correlations with population density and the three relevant Big Five traits, six of eighteen facets were most responsible: Liberalism and Intellect (Openness); Self-discipline and Dutifulness (a facet of Conscientiousness); and Modesty and Morality (Agreeableness; Table 5 of the supplemental materials). For the correlations with income disparity and the three relevant Big Five traits, an almost identical list of seven facets was found, adding Orderliness (Conscientiousness; Table 5 of the supplemental materials).
Because facets within a domain could have been differentially correlated with a demographic such that the opposing effects would have cancelled each other out at the level of the domain, we also found facet-level correlations for all domains that were not meaningfully correlated with a ZIP Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology

Zero-order correlations between ZIP Code
Zero-order correlations between ZIP Code demographics and aggregated personality nuances demographics and aggregated personality nuances In order to gain even greater specificity in the relation-Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology Table 2  Table 2. The 21 personality items most highly correlated with ZIP Code . The 21 personality items most highly correlated with ZIP Code population density population density. The . The column "Corr." is a mean of 1,000 zero-order correlations, each using a 90% sample. Lower and upper column "Corr." is a mean of 1,000 zero-order correlations, each using a 90% sample. Lower and upper confidence intervals are ± 2 SDs based upon the 1,000 iterations. The column "Key" indicates whether confidence intervals are ± 2 SDs based upon the 1,000 iterations. The column "Key" indicates whether the item was positively or negatively keyed on the listed domain/facet. Some items may appear in more the item was positively or negatively keyed on the listed domain/facet. Some items may appear in more than one inventory. than one inventory. ship between the four ZIP Code demographics and aggregated personality, we correlated each demographic with all 696 personality items. To lower the likelihood of false-positive findings, we took an iterative approach that was similar to k-fold cross-validation (Chapman et al., 2016) and bootstrap aggregation ("bagging"; Breiman, 1996). In each iteration, we found correlations for a random 90% training sample and 10% validation sample. For the training sample, we selected up to 34 personality items (5% of the item pool) with the largest correlations with each demographic. We discarded correlations smaller than | | = .11, which reflected the Bonferroni-adjusted (Dunn, 1961) minimum absolute correlation for standard statistical significance ( ) based upon 696 comparisons and the minimum number of ZIP Code scores across the 696 items ( ). We found a multiple correlation for each demographic and the selected items for the training and validation samples. This process was iterated 1,000 times. We found and compared averages for training and validation multiple correlations. Then, for each demographic, we selected a final set of personality items, which was based upon having been in the top 34 items in at least 99% of the iterations. We combined each final set of items into a best-items scale and found the correlation between the scale and the corresponding demographic for the full sample. Lastly, we qualitatively assessed the item content in each best-items scale (which assumed the face validity of each item's content) to determine if there were consistent themes across the items.
For population density, the average training multiple correlation was and the average validation correlation was . There were 21 personality items selected. In the full sample, a best-items scale made up of these items had a moderate correlation with population density ( ). The items in the best-items scale for population density were conceptually consistent and indicated that the personality of residents in densely populated ZIP Codes included: politically liberal, areligious, and anti-authoritarian attitudes; low concern for abiding by rules, laws, and routines; and beliefs of self-exceptionalism ( Table 2). Six of ten Liberalism items were in this best-items scale, suggesting that the relationship between Liberalism and population density was not due to just a few items. However, no other facet Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology Table 3  Table 3. The 18 personality items most highly correlated with ZIP Code . The 18 personality items most highly correlated with ZIP Code income disparity income disparity. The column . The column "Corr." is a mean of 1,000 zero-order correlations, each using a random 90% sample. Lower and upper "Corr." is a mean of 1,000 zero-order correlations, each using a random 90% sample. Lower and upper confidence intervals are ± 2 SDs based upon the 1,000 iterations. The column "Key" indicates whether confidence intervals are ± 2 SDs based upon the 1,000 iterations. The column "Key" indicates whether the item was positively or negatively keyed on the listed domain/facet. Some items may appear in more the item was positively or negatively keyed on the listed domain/facet. Some items may appear in more than one inventory. than one inventory. was as well-represented. Additionally, there was one instance where items with similar content were represented by seemingly different traits. A theme of aversion to rules was represented by items from five different scales: Dutifulness ("Break rules"), Morality ("Stick to the rules"), Orderliness ("See that rules are observed"), Honesty-Propriety ("Cannot imagine lying or cheating"), and Psychoticism ("People should always respect the law"). For income disparity, the average training multiple correlation was and the average validation correlation was . There were 18 personality items selected. In the full sample, a best-items scale made up of these items had a moderate correlation with income disparity ( ). The items in the best-items scale for income disparity were conceptually consistent and indicated that the personality of residents in income disparate ZIP Codes included: political-ly liberal, areligious, and anti-authoritarian attitudes; intellectual interests; low concern for abiding by rules and laws; and beliefs of self-exceptionalism (Table 3). Six of ten Liberalism items were in this best-items scale, suggesting that the relationship between Liberalism and income disparity was not due to just a few items. Although no other facet was as well represented, there was a common theme of intellectual interests among three similar scales: the Intellect domain ("Try to avoid complex people"), the Intellect facet ("Avoid difficult reading material"), and the Inquisitiveness facet ("Find political discussions interesting"). Twelve of the items in the empirical scale for income disparity were also in the best-items personality scale for population density, and differences in the mismatching items indicated that residents in densely populated ZIP Codes were unique in having low concern for routines, whereas residents in in-Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology Table 4  Table 4. Regression models accounting for variance in ZIP Code . Regression models accounting for variance in ZIP Code population density population density: : β coefficients for coefficients for personality variables. The column "Corr." lists zero-order correlations with population density. The personality variables. The column "Corr." lists zero-order correlations with population density. The number of personality items used in a given model is listed. Standard errors are small for all number of personality items used in a given model is listed. Standard errors are small for all β coefficients in all models (.02 coefficients in all models (.02 ≤ SE SE ≤ .03); the unadjusted minimum significant | .03); the unadjusted minimum significant |β| = .04 (p < .05). | = .04 (p < .05). come disparate ZIP Codes were unique in their intellectual interests. For median income, the average training multiple correlation was and the average validation correlation was . Only one personality item, "Believe that I am better than others," was selected. Because one item was not sufficient to create a personality scale, best-items scale analysis was terminated for median income. For ethnic diversity, the average training multiple correlation was and the average validation correlation was . Only two personality items, "Am an extraordinary person" and "Need the approval of others," were selected. Because two items were not sufficient to create a personality scale, best-items scale analysis was terminated for ethnic diversity.

Multiple regression model comparison with
Multiple regression model comparison with personality nuances, facets, and domains personality nuances, facets, and domains To compare how strongly ZIP Code population density and income disparity were related to personality at the nuance, facet, and domain levels, we created three linear regression models for each criterion, with each model using a different level of personality. 8 The models used only ZIP Code-level data (i.e., in multilevel modeling terminology, only level 2 data). We did not include median income or ethnic diversity in these analyses because the previous null results indicated that a best-items scale could not be generated for these two demographics. For ZIP Code population density, regression models built with nuances (Model 1) and facets (Model 2) accounted for approximately the same amount of variance ( .14), while the Big Five (Model 3) accounted for slightly less ( .12), despite differences in the number of personality items used in each model ( 21, 60, and 180, respectively; Table 4). For ZIP Code income disparity, variance accounted for was largest for the regression model built with nuances (Model 1; .18), was slightly less for facets (Model 2; .17), and was even less for the Big Five (Model 3; .14), despite differences in the number of personality items used in each ( 18, 70, and 180, respectively; Table 5).

Discussion Discussion
ZIP Codes were better than states as units of aggregated ZIP Codes were better than states as units of aggregated personality. personality. Participants were found to be clustered in ZIP Codes by personality. Consistent with our hypothesis, this clustering effect was 4.4 times as large for U.S. ZIP Code residence than for U.S. state residence. In terms of aggregating individual personality in this sample, ZIP Codes were a better geographic unit than states. This finding suggests that in addition to being a relevant unit of geographical psychology, ZIP Codes may better cluster individual personality than larger geographic regions within a country.
ZIP Code demographics were systematically related to ZIP Code demographics were systematically related to aggregated personality. aggregated personality. Population density and income disparity were found to be positively correlated with Openness and negatively correlated with Conscientiousness and Agreeableness, which was consistent with two previous studies (de Vries et al., 2011;Jokela et al., 2015). Neither Openness nor Conscientiousness was notably correlated with ethnic diversity, and Extraversion was not correlated with population density, which was inconsistent with a previous study (Jokela et al., 2015). This inconsistency across studies may be explained by the fact that the previous study examined the postal districts of London, whereas the current study examined ZIP Codes from all across the United States. Certain ZIP Code demographics, such as ethnic di-ZIP Codes varied by participant demographics: educational attainment, parents' education, average age, gender ratio, and percentage with minority status. For regression models that include those variables as covariates, see Tables 6 and 7 of the supplemental materials. 8 Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology Table 5  Table 5. Regression models accounting for variance in ZIP Code . Regression models accounting for variance in ZIP Code income disparity income disparity: : β coefficients for coefficients for personality variables. The column "Corr." lists zero-order correlations with income disparity. The personality variables. The column "Corr." lists zero-order correlations with income disparity. The number of personality items used in a given model is listed. Standard errors are small for all number of personality items used in a given model is listed. Standard errors are small for all β coefficients in all models (.02 coefficients in all models (.02 ≤ SE SE ≤ .03); the unadjusted minimum significant | .03); the unadjusted minimum significant |β| = .04 (p < .05). | = .04 (p < .05). versity, may exhibit systematic relationships with aggregated personality within a city but not across cities, per the Yule-Simpson paradox (Kievit et al., 2013;Simpson, 1951;Yule, 1903). Compared to domains, facets provided more specificity Compared to domains, facets provided more specificity concerning the relationships between demographics and concerning the relationships between demographics and personality. personality. Facets within the Big Five domains of Openness, Conscientiousness, and Agreeableness were differentially correlated with population density and income disparity. Specifically, six or seven facets were most responsible for these relationships between personality and the two demographics: Intellect, Liberalism, Dutifulness, Selfdiscipline, Morality, Modesty, and (only for income disparity) Orderliness. Liberalism's positive correlation with population density was consistent with a previous study (Tausanovitch & Warshaw, 2014), while Liberalism's lack of positive correlation with ethnic diversity (Figure 8 of the supplemental materials) was inconsistent with a previous study (Pew Research Center, 2014). This inconsistency across studies may be due to differences in studies; the previous study measured self-identified liberals' preference for a neighborhood's ethnic diversity, whereas the current study measured the extent to which ZIP Codes higher in Liberalism were also ethnically diverse.
Compared to facets, nuances provided more specificity Compared to facets, nuances provided more specificity concerning the relationships between demographics and concerning the relationships between demographics and personality. personality. The personality items that correlated most strongly with population density or income disparity provided a precise description of the personality-demographic relationships. Although the two sets of items were not completely identical, descriptive summaries of each set of items were similar: compared to residents of sparse and equal-income ZIP Codes, residents of densely populated and income disparate ZIP Codes tended to have more politically liberal, areligious, and anti-authoritarian attitudes; lower concern for abiding by rules and laws; and greater beliefs of selfexceptionalism. Residents of densely populated ZIP Codes also showed lower concern for routines, while residents of income disparate neighborhoods reported more intellectual interests. The best-items scale for population density had a moderate correlation with population density ( ), as did the best-items scale for income disparity with income disparity ( ). The accuracy of models using a small subset of nuances The accuracy of models using a small subset of nuances was as good or slightly better than that of models that used was as good or slightly better than that of models that used facets or domains. facets or domains. We compared the variance explained for population density and income disparity across the three levels of personality (nuances, facets, and domains), using multiple regression models. In the case of population density, nuances and facets explained the same amount of variance ( = .14), while domains explained slightly less ( = .12). In the case of income disparity, nuances ( = .18) explained slightly more than facets ( = .17), which explained more than domains ( = .14). Previous research has found that models utilizing nuances are substantially more predictive of outcomes than domains or facets, but these nuance-level models have utilized all of the items in an item pool (Mõttus et al., , 2017Seeboth & Mõttus, 2018). The predictive utility of nuances may have been underestimated in this study due to the approach of retaining a small subset of the items that most highly correlated with a demographic.
Nuance-level specificity may help generate better-Nuance-level specificity may help generate betterinformed personality theory informed personality theory Facet-level analysis may help guide psychological research away from overgeneralizations that would have been difficult to avoid with domain-level analysis. If the current study were to have examined the relationship between Openness and population density only at the domain level, the theory generated to explain the existence of these results might have relied on the domain of Openness as a whole. We even might have conjured an image of residents that would have relied more on a few facets that weren't actually as important, such as free-spirited bohemians or imaginative artists. Instead, the current study narrowed the personality traits of interest from 18 facets (contained within three domains) to just six or seven facets. Nuance-level analysis described an even more precise Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology personality pattern, indicating that residents of densely populated and income disparate ZIP Codes tended to have more politically liberal beliefs, more anti-authoritarian attitudes, a stronger aversion to rules, and greater beliefs of self-exceptionalism. Conversely, residents of more sparsely populated and equal-income ZIP Codes typically were more politically conservative, authoritarian, and believing in the rule of law. The broader facet-level analysis identified Dutifulness, Morality, and Orderliness as being associated with population density, whereas nuance-level descriptive analysis indicated that nuances from five different traits (Dutifulness, Morality, Orderliness, Honesty-Propriety, and Psychoticism) were correlated with population density, and these nuances all clustered around the theme of aversion to rules. A qualitative interpretation of the nuance-level results is that the three identified facets were not uniquely important, but instead only appeared to be related to population density because all three contained items related to aversion to rules. Thus, a more precise description of the relationship between ZIP Code population density and personality would ignore the broader facets of Dutifulness, Morality, and Orderliness and instead focus on nuances related to aversion to rules, which themselves may indicate a novel facet-sized trait. By contrast, there was strong evidence that the whole facet of Liberalism was positively associated with population density and income disparity; six of ten Liberalism items were present in each demographic's best-items scale. A similar pattern of nuances was found in the best items for ZIP Code population density and income disparity, which could indicate a common underlying personality pattern (political orientation, views on authoritarianism, beliefs concerning the importance of rules and how rules apply to oneself) that is related to both demographic variables. This pattern of nuances is consistent with prior research that has linked population density with antisocial personality traits (Jonason, 2018), and income inequality with crime (Daly et al., 2001;Rufrancos et al., 2013;Wilson & Daly, 1997), disinvestment in social capital (Kawachi et al., 1997), risk-taking , and support for liberal policies (Brown-Iannuzzi et al., 2014).
Plausible mechanisms for the personality-Plausible mechanisms for the personalitydemographic relationships in this study demographic relationships in this study Each of the three broad hypotheses concerning how relationships between aggregated personality and demographics form (environmental factors, migration choices, and social influence) is plausible for explaining why personality aggregated by ZIP Code was related to population density and income disparity in this study. For example, ZIP Code population density and income disparity may act like environmental factors such that they affect residents' beliefs concerning politics, authority, and rules. Living in close proximity to others who are vastly wealthier may lead residents to become more averse to laws, rules, and authority figures that appear to reinforce that inequality. Alternatively, individuals who migrate to densely populated or income disparate ZIP Codes may be more willing to live in actual or perceived higher-crime areas because of their particular beliefs and behaviors that deemphasize the importance of rules, punishment, and authority figures. Conversely, residents of less dense or more equal-income ZIP Codes may seek the safety of lower crime ZIP Codes because they prefer a stronger authoritarian presence and the enforcement of rules and laws through punishment. Additionally, personality-typical residents of a ZIP Code may socially influence new residents toward the prevailing thoughts, feelings, behaviors, and beliefs of the ZIP Code. Lastly, all three of these explanations may be partially responsible for the personality-demographic relationships.

Limitations Limitations
One limitation of this study was the relatively low number of participants per ZIP Code (mean = , median = ), which led to low reliabilities at the ZIP Code-level of analysis. However, we found that the correlational structure of the data was robust to increases in the number of participants per ZIP Code. For a future study to reach high ZIP Code reliability of personality variables, a very large sample would be needed. For example, if one assumed that ZIP Code residence would explain 2% of personality variance and they desired an ICC2 reliability of .80, an average of 196 participants per ZIP Code would be needed, requiring a sample roughly ten times as large as the current study.
A second limitation of this study was that ZIP Codes were perhaps less ideal units of geographical psychology than neighborhoods. ZIP Codes were created by the U.S. Post Office for efficient mail delivery, not for demarcating small communities or neighborhoods (U.S. Postal Service Office of Inspector General, 2013). A ZIP Code can cover an area larger than a neighborhood, sometimes an entire town (e.g., Sand Springs, Oklahoma, 74063). Ideally, a geographical psychology study would utilize even smaller standardized geographic units that are closer in size to neighborhoods, such as census tracts. However, there are at least two disadvantages of studying census tracts. First, U.S. residents know the ZIP Code in which they live, while few are aware of their census tract. Thus it would be necessary to ask participants to share their home addresses. Some individuals may not be willing to share this information due to privacy concerns. Second, compared to the ZIP Code, a geographic unit as small as a census tract would require even more participants to achieve an acceptable number of participants-per-unit. Thus, millions of participants could be required.

Constraints on generality Constraints on generality
In accordance with recommendations by Simons et al. (2017), we acknowledge the following constraints on generality. One constraint concerns sample representativeness. Although the sample was ethnically diverse, participants tended to be more educated than the U.S. population. Thus, it is likely that this study undersampled individuals of lower socioeconomic status. In general, online samples are not representative of the U.S. population, but tend to be more diverse than traditional samples from psychological research (Gosling et al., 2004). We expect our findings to replicate in other large samples collected by personality psychologists, but a truly representative sample of the U.S. could show different results regarding ZIP Code personality-demographic relationships. We expect for personality to cluster by U.S. ZIP Codes, and for this effect to be stronger than for U.S. states, so long as the new sample is at least as diverse as the current one. Both clustering effects may be stronger in a more representative sample due to the current sample having a somewhat restricted range (specifically in terms of educational attainment). We expect for our results to generalize to a measure of neighborhoods (e.g., U.S. census tracts), but perhaps not geographic units as large as U.S. states, since a state-wide measure of population density would not accurately capture the density of cities or rur-Smaller Is Better: Associations Between Personality and Demographics Are Improved by Examining Narrower Traits and Regions Collabra: Psychology al areas within the state. As our data were self-reported, we expect our results to generalize to self-reported measures of personality. Although we performed sensitivity analyses to determine the extent to which results were robust against including more or fewer ZIP Codes, substantially diverging from the number of ZIP Codes in this study (either more or fewer) could impact the extent to which these results will replicate in a new study. In particular, the results of this study may not generalize to an examination of ZIP Code personality-demographic relationships within a single metropolitan area. A replication of our nuance-level results would require a large pool of items from multiple inventories. We have no reason to believe that the results depend on other characteristics of the participants, materials, or context.

Future directions Future directions
Future studies in all branches of personality psychology could benefit from examining the extent to which facetand nuance-level analyses provide greater specificity to domain-level findings, as long as researchers have the resources to collect a large sample of participants and randomly sample from an item pool of hundreds of items. Narrowing the unit of personality analysis has the potential to help guide new theories away from overgeneralizations that are easily committed when only using domain-level data. The field of geographical psychology may benefit from determining whether smaller units of analysis always provide better clustering of personality or whether there is an optimal unit of analysis, such as ZIP Codes or census tracts. It would also be valuable to determine the extent to which postal codes or neighborhoods are comparable across cities, and whether the comparison of personality aggregated by postal codes is more appropriate within a city than across cities.

Conclusion Conclusion
This study presented evidence that smaller units of analysis were better, both for geographical and personality psychology. For geographical psychology, the personality variance explained was four times as large for U.S. ZIP Codes than for U.S. states. For personality psychology, facets provided more specificity than domains concerning personality-demographic relationships, and nuances provided even more specificity than facets. This study found that ZIP Code population density and income disparity were positively correlated with a pattern of nuances that included an aversion to rules, political liberalism, anti-authoritarianism, and self-exceptionalism. While this finding is informative in itself, it also illustrates the utility of facet-and nuance-level analyses in large samples. Domains like the Big Five describe, by definition, broad generalizations concerning personality. Facet-and nuance-level approaches reveal more informative patterns of how personality is systematically related to our world.

Contributions Contributions
Contributed to conception and design: LGE, DMC, NSH, WR.
Contributed to analysis and interpretation of data: LGE, WR.
Drafted and/or revised the article: LGE. Approved the submitted version for publication: LGE, DMC, NSH, WR.

Funding information Funding information
Preparation of this manuscript was funded in part by grant SMA-1419324 from the National Science Foundation.