As scholars regularly document,1 states have frequently changed their systems of judicial selection and retention. What remains unknown is whether these systems actually address the kinds of qualities citizens value in their state judges.
Since 1946, the most frequent change has been to adopt some kind of system that proponents describe as âmerit selectionâ that constrains the governor or other selecting authority to choose from a list of candidates nominated or approved by a screening body. The most common form of merit selection is a version of Missouriâs Nonpartisan Court Plan, often referred to as the âMissouri Plan.â In this system, the governor fills all vacancies by making an appointment for an initial term from a list forwarded by a nominating commission, and incumbents stand in yes-no âretention electionsâ for subsequent terms. Thirteen states have adopted this system.2 But, interestingly, the Missouri Plan was last approved by voters more than three and a half decades ago â in Utah, in 1985.3Â Since then, voters in several states have rejected this system for some or all courts.
The other version of merit selection, which I label more generally as âconstrained appointment,â similarly limits selections to persons nominated or otherwise screened by a legally required nominating or selection body, but it does not involve retention elections.4 Six states have adopted this system for some or all major courts.5
The debate over judicial selection is often framed around judicial independence versus accountability. Those favoring independence over accountability tend to promote systems of selection that deemphasize a direct role for voters; those favoring accountability over independence tend to prefer popular elections. In a recent book, Charles Geyh, who has long favored appointment over election,6 acknowledged that there are some good reasons for preferring elective systems, especially for state supreme courts.7 Geyh goes on to note, however, that much of the debate has focused on appellate judiciaries, and argues that contested elections for trial court judges may be even more problematic. The logic is that most of the work of trial courts âconsists of routine matters in which the law is clear and the policy implications of the courtâs legal rulings are limited,â and that the appeals process âkeeps the excesses of trial courts in check.â8 He points out the research that has found perverse effects of the election cycle on criminal sentencing in states where trial judges stand for retention in potentially contested partisan or nonpartisan elections.9 In contrast, state appellate courts, particularly courts of last resort, have a significant lawmaking function, particularly regarding common law issues; moreover, the decisions of state supreme courts are rarely subject to review by the U.S. Supreme Court.10 Given the greater policy role of appellate courts, electing the judges of those courts seems more justifiable.
Geyhâs argument that it is more difficult to justify electing trial court judges than appellate judges is counter to actual practice, assuming one counts the Missouri Plan as an appointive system. Ignoring the issue of appointments to fill interim judicial vacancies, 17 states have abandoned popular elections for the selection of some or all appellate court judges over the last 100 years.11 Sixteen of those states now use retention elections for subsequent or full terms for appellate judges; the exception is New York where appellate judges are subject to reappointment. There are no states that elect appellate judges but appoint judges of major trial courts.12
In three states, Florida, Oklahoma, and South Dakota, voters approved the adoption of a Missouri Plan system for the statesâ appellate courts but later rejected proposals that would have extended the system to trial courts. In several other states that adopted the Missouri Plan for appellate courts, the legislature also considered the system for trial court â but there was never sufficient support to put it before the voters.
These divergent patterns of change suggest at least two questions: Why are voters in some states apparently unwilling to give up elections for trial courts even though they are willing to do so for appellate courts? And why are legislatures in additional states not even willing to put before voters the question of whether to forego elections for trial courts? One possible explanation is that voters and legislators see different characteristics as desirable for judges at the appellate and trial level.13 How might one describe possible characteristics that voters view as important in selecting judges?14
Surprisingly, the question of what citizens view as important in selecting judges has not been extensively explored. James Gibson, in a study of the 2006 nonpartisan election for the Kentucky Supreme Court, asked survey respondents to rate the importance of ten characteristics of a âgood Kentucky Supreme Court judge.â15Â Gibson omitted some obvious things such as âbeing fair and impartialâ because he expected there would be little or no variation on those items. The two top characteristics were âprotect people without powerâ and âstrictly follow the law,â with 72.9 percent and 71.8 percent respectively rating them as âvery important.â Next was âstate how they stand on important legal and political issues as part of their campaigns,â with 64.2 percent rating this as âvery important.â The two lowest-rated characteristics were âdecide the way the majority wantsâ (30.1 percent, very important) and âbase decisions on party affiliationsâ (18.5 percent, very important). To the extent these are expectations of what a judge should do, they may be more relevant when deciding whether to retain a judge â rather than elect her for the first time â since at least some of these characteristics would be difficult to assess without a history of judicial decisions made by the candidate.
Gibson also provides some data from a 2001 national survey conducted by Justice at Stake (JaS), a now-defunct organization that advocated the reform of judicial selection.16Â That survey asked respondents to rate the importance of ten responsibilities of courts and judges using a 0-to-10 scale. The mean responses ranged from 8.19 (51.8 percent rating at 10) for âdefending constitutional rights and freedomsâ to 6.23 (18.1 percent rating at 10) for âadvancing social and economic justice.â The others rated near the top were âensuring fairness under the law,â âprotecting civil liberties,â and âprotecting individual rights.â Toward the bottom one finds âresisting political pressureâ and âbeing an independent check on other branches of government.â As with Gibsonâs own survey, the JaS survey focused more on expectations than on qualifications.
What, then, are the characteristics Americans want in their state judges, and do these characteristics differ depending on the type of court a judge will serve on?
For answers to these questions, I conducted a short survey, distinguishing between what I label âpolitical characteristicsâ and âprofessional characteristics.â I identified six characteristics that I believe are essentially political in nature:
And another six that are more professional in nature:18
The respondents rated each of the 12 characteristics twice, once for the state supreme court and once for local trial courts, using a four-point scale with the points labeled âessential (4),â âvery important (3),â âsomewhat important (2),â and ânot important (1).â One potential point of confusion is that in New York what is called the âsupreme courtâ is not what most people think of as the âstate supreme courtâ; the highest court in New York is called the Court of Appeals.19 The survey instructions alerted respondents to this issue in order to make it clear that they were to think about the stateâs highest court.20Â The survey also asked for the respondentâs political party affiliation, self-described ideology, gender, level of education, age, and the first three digits of the respondentâs zip code (used to determine the respondentâs state).
Due to limited time and resources, I obtained a sample using Amazon Mechanical Turk (MTurk).21Â MTurk respondents are self-selected rather than randomly sampled. Consequently, one must be careful in interpreting results based on MTurk samples, because there are some known biases, including overrepresentation of males, political liberals, persons under 45, and those with at least a college education. To ameliorate over-representation of liberals and over-representation of males, I did the survey in three stages.
In the first stage, the initial sample of 500 respondents overrepresented persons describing themselves as liberal as compared to what was shown in recent random sample surveys.22 To correct for this, I collected in phase two an additional 100 responses to balance the survey using a feature of MTurk that allowed me to restrict respondents to self-identified âconservatives.â Analysis of the combined sample of 600 showed both that women were underrepresented and that the women on average rated all of the characteristics as more important than did men. Consequently, to balance on gender, I collected in phase three a second supplemental sample of 65 women, producing a final sample of 329 men and 336 women, including one transgender female (plus three respondents selecting âGender Variant/Nonconformingâ and one who preferred not to answer the gender question). The two biases I did not adjust for were age and education;23 preliminary analysis showed that there was little correlation between either age or education and respondentsâ ratings of desirable judicial characteristics.24Â My final sample had usable responses from 669 respondents.
A first question is whether the 12 characteristics diverged along the lines of professional and political as I hypothesized. To assess this, I applied a statistical method called factor analyses that can be used to assess whether a set of questions groups along one or more dimensions.25 I applied the method â both combining the responses regarding the two levels of courts and separately for local trial courts and state supreme courts â and found that the 12 characteristics did indeed group along these two dimensions.26Â With one possible exception, the specific characteristics aligned nicely with my expectations, including experience as a criminal prosecutor, reflecting a professional dimension and a political dimension. The one exception is ârespected by leaders of the legal community,â which for local trial judges split across the two dimensions, but which was slightly stronger on the professional dimension.
Turning to how the characteristics were rated, Figure 1 (above) shows the distribution of responses for each characteristic, with the professional characteristics at the top and the political characteristics at the bottom. The percentages rating a characteristic as âessentialâ appear in orange. The figure clearly shows the greater importance assigned to professional characteristics, with the percentage of respondents rating those characteristics as âessentialâ higher than any of the political characteristics â with the one exception of âunderstands community preferencesâ as regards to judges of local trial courts.
Table 1 (above) shows two statistics for each court. The first column in each pair (âMeanâ) is the mean rating and the second column (â% Essentialâ) is the percentage of respondents rating the characteristic as âessential.â The characteristics are ordered based on the percentage rating a characteristic as essential for state supreme court justices. As indicated in the table, for 10 of the 12 characteristics, respondents, on average, differentiated between the two courts when rating the characteristics.27
Regarding desired characteristics for state supreme court justices, the mean ratings for the professional characteristics all exceeded 3.4 on the 4-point scale, ranging up to 3.8, while the mean ratings for the political characteristics were all less than 3.0, ranging from 2.3 to 2.9. Over 80 percent of respondents rated two of the professional characteristics â âdeep legal knowledgeâ and âreputation for integrity/high ethical standardsâ â as essential for supreme court justices. Three additional characteristics in the professional category â âexcelled in law school,â âsubstantial experience practicing law in the courtroom,â and âreputation as a good listenerâ â were deemed essential for state supreme court justices by 70 to 77 percent of respondents. The highest ranked political characteristic for state supreme court justices was ârespected by elected political officials,â but only 40.4 percent rated it as essential. In fact, for state supreme court justices, all the professional characteristics were rated higher than any of the political characteristics. However, it is noteworthy that three of the six political characteristics were also rated higher for state supreme courts than for local trial courts. Overall, this suggests that the public may have higher expectations for state supreme court justices than for local trial judges, particularly with regard to professional qualifications.
Although the professional characteristics also tended to be deemed the most important for trial court judges, there were notable differences in ratings compared to those for supreme court justices. The means for the professional characteristics ranged from 3.1 to 3.7, very close to but never exceeding the corresponding means for the state supreme court. The range for the political characteristics was 2.1 to 3.3, several exceeding the corresponding rating for the state supreme court. No characteristic was deemed essential for local trial judges by more than 80 percent of respondents, with the highest, âreputation for integrity/high ethical standards,â deemed essential by 76.1 percent of respondents. The top-ranked characteristic for the state supreme court, âdeep legal knowledge,â was deemed essential for local trial courts by only 66.1 percent of respondents compared to 88.0 percent for the state supreme court. And although 77.0 percent rated âexcelled in law schoolâ essential for state supreme court justices, only 34.4 percent thought it was essential for local trial court judges. Interestingly, only 49.3 percent rated âsubstantial experience practicing law in the courtroomâ as essential for trial court judges compared to 72.2 percent saying it was essential for a state supreme court justice.
While three of the professional characteristics stood at the top of the rankings for trial court judges, two of the political characteristics were rated substantially higher for trial court judges than for state supreme court justices. âUnderstands community preferencesâ was deemed essential for trial court judges by 54.4 percent of respondents; only 21.2 percent rated this as essential for state supreme court justices (means 3.3 and 2.3). Similarly, 31.4 percent rated âactive in community organizationsâ as essential for trial court judges compared to 15.7 percent for supreme court justices (means 2.8 and 2.3). Thus, there was measurably less emphasis on professional characteristics and more emphasis on characteristics reflecting local knowledge and connections for trial court judges than for state supreme court justices.
Given the degree of political polarization in the United States as this is written, it is interesting that âstrong support from the leaders of my preferred political partyâ was close to the bottom for both courts. Also, one might ask whether there were systematic differences between Republicans and Democrats in preferred characteristics. The answer is largely no. Comparing Democrats and Republicans,28Â only 2 of 24 comparisons met the criterion for statistically significant differences, and those differences were modest. Republicans rated âexperience as a criminal prosecutorâ more important for state supreme court justices than did Democrats (33.1 percent essential versus 27.2 percent). Democrats rated âactive in community organizationsâ higher for local trial court judges than did Republicans (34.1 percent essential versus 29.8 percent). Although both met the criterion to be statistically significant, the differences are of minimal substantive significance.
Based on the two-factor analyses, I combined the responses to obtain four scales, two for each court.29 One pair of scales was associated with political characteristics and the other with professional characteristics. I adjusted all scales to have an average of five and a standard deviation of one, with higher scores indicating a higher level of importance assigned to characteristics associated with that dimension. Table 2 (below) shows the scale averages broken down for each of the political and demographic variables included in the survey.30 The table also shows the probability (the âp-valueâ) that the variation across the categories of a variable could be attributed to chance, which when very low is referred to as âstatistical significance.â31
The only differences that meet the criteria for statistical significance (i.e., a probability of occurring by chance of .05 or less) for self-identified ideology were the professional scale for local trial courts (with conservatives rating professional qualities lower, on average, than either liberals or those labeling themselves middle-of-the-road) and the political scale for the state supreme court (with liberals rating political qualities lower, on average, than the other two groups).
In addition to asking the respondents their self-identified ideology, I asked them which political party they identified with (Democrat, Republican, Independent, or other), what political scientists refer to as âparty identification.â The pattern with party identification is interesting. Those identifying as Democrats or Republicans had higher average scores on the political scale than those identifying as Independents, even Independents leaning toward one of the parties; these differences meet the criterion for statistical significance. The pattern of the relationship between the professional scale and party identification is muddled, although it tends toward the opposite direction (i.e., higher average scores for Independents than for Democrats or Republicans). Thus, the differences here reflect âpartisanshipâ â identifying with a political party rather than which political party a respondent identified with.
The respondentsâ states (based on the zip code information) were recoded into the statesâ initial selection system: contested election (ignoring appointments to fill interim vacancies), Missouri Plan, or appointment.32 Separate variables were created for appellate selection and for trial selection; states in which trial selection varied by county or judicial district were coded âmissingâ for trial selection. As Table 2 shows, there were no statistically significant differences based on the formal system of initial selection in the respondentâs state of residence, although variations in the political characteristics scale for trial courts approached statistical significance: It is unclear what to make of the fact that respondents where the Missouri Plan is used for judicial selection in trial courts rate political characteristics higher than do respondents in states using contested elections or other appointment systems.
Turning to the demographic variables, gender stands out â with women rating both scales for both courts higher than men by about one-quarter standard deviation, rising to one-third standard deviation for the professional scale for the trial court. Moreover, ratings for the professional scale appear to rise with age, while ratings for the political scale appear to decline with age. The maximum differences related to age for the political scale are greater than a full standard deviation; the maximums for the professional scale slightly exceed one-third standard deviation. Regarding education, the differences for the trial court do not reach the standard of statistical significance for either the political or professional scale. Regarding the state supreme court, both scales tend to decline as education increases, but these findings meet the criterion for statistical significance only for the professional scale.
What happens if the five predictor variables â ideology, partisanship,33 age, education, and gender â are taken together? To answer this question, I combined those predictors in a regression model, details of which are provided in the online appendix.34Â The patterns shown in the regression results were generally consistent with the bivariate results in Table 2.
There are some particularly interesting observations based on my analysis. First, women tended to rate both professional and political qualities higher than did men. Second, there appears to be a kind of âpartisan impact,â with those who clearly identify with a party rating the political scale higher and the professional scale lower than the two categories of Independents. Finally, the patterns for age and education raise intriguing questions: Why do younger respondents tend to view political qualities in a judge as less important, while older respondents view professional qualities as more important? And why do more highly educated respondents value professional qualities less than those with a lower level of education?
The major limitation of this study is the sampling source. Ideally, the survey would have been done using a true national random sample rather than a self-selected MTurk sample. Hopefully the results presented here will inspire a replication using a better sample and include additional questions that measure variables such as political knowledge and political interest. In the interim, the results presented help account for why voters (and legislatures) in some states were willing to adopt variants of the Missouri Plan for state supreme courts but rejected that system for trial courts: Members of the public view certain professional characteristics â the very characteristics that Missouri Plan advocates argue nominating commissions will emphasize â as more important for state supreme courts than for trial courts. The public apparently views an understanding of the local political situation as more important for trial court judges than for state supreme court justices. This suggests that there is at least some understanding of the differing roles played by local trial courts and the top state appellate courts.
This also partially explains a greater willingness to adopt variants of the Missouri Plan for appellate courts than for local trial courts. Specifically, popular elections are more likely to keep judges tied to the local community than is selection through appointment. However, if voters understood that most trial judges in most âelectionâ states initially obtain their positions by appointment to fill interim vacancies, the preference for election over appointment might decrease.35 Nonetheless, using contested elections for retention arguably allows voters in a community to reject appointees of governors not from the locally dominant political party, and there is some evidence that this does sometimes happen.36
Even with the differences between the two types of courts, professional characteristics tend to be deemed more important than political characteristics for both levels of courts. This in turn raises the question as to why there has been a lack of success in recent years in adopting versions of the Missouri Plan, which tends to emphasize professional characteristics. Essentially, there is an irony here: Voters appear to want judges with strong professional characteristics but seem increasingly inclined to distrust and reject mechanisms for judicial selection designed to focus on those very characteristics.
I argue elsewhere that this partly reflects that business interests that once supported systems such as the Missouri Plan now prefer contested elections.37 Those interests have learned that, in such elections, they can get their desired candidates elected and defeat judges perceived to be hostile to interests of business. The broader conservative movement, epitomized by the Federalist Society38 and the Heritage Foundation,39 has come to see elections as preferable to systems using nominating commissions (like those central to the Missouri Plan) based on the belief that commission lawyers tend to produce liberal judges. The argument is that lawyers tend to be more liberal than the general electorate, which leads to those commissions nominating lawyers who are also more liberal than the electorate.40 Conservatives also argue, relatedly, that domination of lawyers in the nominating process of the Missouri Plan makes the judicial selection process overly elitist and lacking in democratic legitimacy.41 Opponents of the Missouri Plan have learned that they can successfully argue the plan turns selection over to lawyers and deprives the public of its right to vote on who should be selected as a judge. Moreover, the argument goes, given the very small number of judges defeated in the Missouri Plan retention elections, judges would effectively be selected to serve until they die, reach mandatory retirement, or choose to depart the bench voluntarily. In several Missouri Plan states, conservative opponents of the plan have undertaken campaigns to end the requirement that the governor appoint from a list forwarded by a nominating commission in filling vacancies on appellate courts. These efforts have been successful or partially successful in two states: Tennessee for all appellate courts42 and Kansas for the stateâs intermediate appellate courts.43
The results of the survey reported here raise the interesting question of whether proponents of systems that involve formalized nominating commissions, either via a full Missouri Plan system or a system of constrained appointment for all major judicial vacancies, could educate the public about at least four things:
To be effective, such a public education program would need not be specifically tied to any event but could include a range of activities over a period of years.
A major part of this challenge is the American love affair with elections. The number of state and local offices elected, even omitting judges, is probably unique to the United States. There are even places where the dog catcher (or âanimal control officerâ in modern parlance) is elected.44 This love affair, baffling to people in other countries,45Â is one of the major roadblocks to Americans accepting nonelective systems of state judicial selection. It is debatable whether such a public education campaign could change votersâ views on how judges should be selected, but this research does suggest how such a campaign might be framed.
However, even if the public could be convinced that the intent of systems employing a nominating commission is to produce judges with the kind of qualifications citizens view as important in judges, it is not clear that the kinds of systems now in use actually will achieve that goal. Extensive research has sought to assess whether the different selection systems used by the American states do in fact differ in the qualifications possessed by the resulting judges. Although there may be some differences in the prior background of the judges (e.g., systems of legislative election put more former legislators on the bench than do other systems), the general conclusion is that there is little or no difference in the qualifications of the judges selected.46 Perhaps if there were a way to professionalize the screening process and design that process to go beyond the kind of reputational assessment now used, nominees would have better qualifications than those produced under the current system.47
Finally, one can ask whether more should be done to further specify the nature of the political characteristics that voters are concerned about regarding who should be selected as judges. Two of the characteristics that showed statistically significant differences between the two levels of courts (âactive in community organizationsâ and âunderstands community preferencesâ) dealt specifically with the potential judgesâ connections to the local community and presumably their understandings of the local community. It is unclear whether this translates to a belief that judges will act in accordance with community preferences when confronted with difficult cases. It is important to think about the difficulties in getting at this question. Recall that the two lowest-rated characteristics in Gibsonâs study of Kentuckiansâ views of state supreme court candidates were âdecide the way the majority wantsâ and âbase decisions on party affiliations.â Voters may be reluctant to state a preference for judicial decisions to be based on majority preferences or party characteristics, but may still have in the back of their minds a hope or expectation that this will actually be the case.
***This paper is an extended analysis of a survey reported in my recent book on judicial selection reform, Judicial Selection in the States: Politics and the Struggle for Reform (2020). Thanks to Lawrence Baum, Charles Gardner Geyh, James Gibson, and Melinda Gann Hall for helpful thoughts in the design of the survey reported below and/or on a draft of this paper. The funding for the survey was provided by the University of Minnesota Law Schoolâs Steen Fund.***