Results 1 - 10
of
600
Analyzing Incomplete Political Science Data: An Alternative Algorithm for Multiple Imputation
- American Political Science Review
, 2000
"... We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticians agree that "multiple imputation" is a superior approach to the problem of missing data scatter ..."
Abstract
-
Cited by 419 (50 self)
- Add to MetaCart
We propose a remedy for the discrepancy between the way political scientists analyze data with missing values and the recommendations of the statistics community. Methodologists and statisticians agree that "multiple imputation" is a superior approach to the problem of missing data scattered through one's explanatory and dependent variables than the methods currently used in applied data analysis. The reason for this discrepancy lies with the fact that the computational algorithms used to apply the best multiple imputation models have been slow, difficult to implement, impossible to run with existing commercial statistical packages, and demanding of considerable expertise. In this paper, we adapt an existing algorithm, and use it to implement a generalpurpose, multiple imputation model for missing data. This algorithm is considerably faster and easier to use than the leading method recommended in the statistics literature. We also quantify the risks of current missing data practices, ...
2001): “Clarify: Software for Interpreting and Presenting Statistical Results
- Journal of Statistical Software
"... and distribute this program provided that no charge is made and the copy is identical to the original. To request an exception, please contact Michael Tomz. Contents 1 ..."
Abstract
-
Cited by 319 (3 self)
- Add to MetaCart
and distribute this program provided that no charge is made and the copy is identical to the original. To request an exception, please contact Michael Tomz. Contents 1
Logistic Regression in Rare Events Data
, 1999
"... We study rare events data, binary dependent variables with dozens to thousands of times fewer ones (events, such as wars, vetoes, cases of political activism, or epidemiological infections) than zeros (“nonevents”). In many literatures, these variables have proven difficult to explain and predict, a ..."
Abstract
-
Cited by 169 (4 self)
- Add to MetaCart
We study rare events data, binary dependent variables with dozens to thousands of times fewer ones (events, such as wars, vetoes, cases of political activism, or epidemiological infections) than zeros (“nonevents”). In many literatures, these variables have proven difficult to explain and predict, a problem that seems to have at least two sources. First, popular statistical procedures, such as logistic regression, can sharply underestimate the probability of rare events. We recommend corrections that outperform existing methods and change the estimates of absolute and relative risks by as much as some estimated effects reported in the literature. Second, commonly used data collection strategies are grossly inefficient for rare events data. The fear of collecting data with too few events has led to data collections with huge numbers of observations but relatively few, and poorly measured, explanatory variables, such as in international conflict data with more than a quarter-million dyads, only a few of which are at war. As it turns out, more efficient sampling designs exist for making valid inferences, such as sampling all available events (e.g., wars) and a tiny fraction of nonevents (peace). This enables scholars to save as much as 99 % of their (nonfixed) data collection costs or to collect much more meaningful explanatory
Labor-market competition and individual preferences over immigration policy
, 2000
"... This paper uses three years of individual-level data to analyze the determinants of individual preferences over immigration policy in the United States. We have two main empirical results. First, less-skilled workers are significantly more likely to prefer limiting immigrant inflows into the United ..."
Abstract
-
Cited by 133 (4 self)
- Add to MetaCart
This paper uses three years of individual-level data to analyze the determinants of individual preferences over immigration policy in the United States. We have two main empirical results. First, less-skilled workers are significantly more likely to prefer limiting immigrant inflows into the United States. Our finding suggests that, over the time horizons that are relevant to individuals when evaluating immigration policy, individuals think that the U.S. economy absorbs immigrant inflows at least partly by changing wages. Second, we find no evidence that the relationship between skills and immigration opinions is stronger in high-immigration communities.
Does Conflict Beget Conflict? Explaining Recurring Civil War.”
- Journal of Peace Research
, 2004
"... ..."
A Statistical Model for Multiparty Electoral Data
- American Political Science Review
, 1999
"... e propose a comprehensive statistical model for analyzing multiparty, district-level elections. This model, which provides a tool for comparative politics research analogous to that which regression analysis provides in the American two-party context, can be used to explain or predict how geographic ..."
Abstract
-
Cited by 64 (12 self)
- Add to MetaCart
e propose a comprehensive statistical model for analyzing multiparty, district-level elections. This model, which provides a tool for comparative politics research analogous to that which regression analysis provides in the American two-party context, can be used to explain or predict how geographic distributions of electoral results depend upon economic conditions, neighborhood ethnic compositions, campaign spending, and other features of the election campaign or aggregate areas. We also provide new graphical representations for data exploration, model evaluation, and substantive interpretation. We illustrate the use of this model by attempting to resolve a controversy over the size of and trend in the electoral advantage of incumbency in Britain. Contraiy to previous analyses, all based on measures now known to be biased, we demonstrate that the advantage is small but meaningfkl, varies substantially across the parties, and is not growing. Finally, we show how to estimate the party from which each party's advantage is predominantly drawn. w e propose the first internally consistent statistical model for analyzing multiparty, districtlevel aggregate election data. Our model can
What to do about missing values in time series cross-section data
, 2009
"... Applications of modern methods for analyzing data with missing values, based primarily on multiple imputation, have in the last half-decade become common in American politics and political behavior. Scholars in this subset of political science have thus increasingly avoided the biases and inefficien ..."
Abstract
-
Cited by 58 (8 self)
- Add to MetaCart
Applications of modern methods for analyzing data with missing values, based primarily on multiple imputation, have in the last half-decade become common in American politics and political behavior. Scholars in this subset of political science have thus increasingly avoided the biases and inefficiencies caused by ad hoc methods like listwise deletion and best guess imputation. However, researchers in much of comparative politics and international relations, and others with similar data, have been unable to do the same because the best available imputation methods work poorly with the time-series cross section data structures common in these fields. Weattempttorectify this situation with three related developments. First, we build a multiple imputation model that allows smooth time trends, shifts across cross-sectional units, and correlations over time and space, resulting in far more accurate imputations. Second, we enable analysts to incorporate knowledge from area studies experts via priors on individual missing cell values, rather than on difficult-to-interpret model parameters. Third, because these tasks could not be accomplished within existing imputation algorithms, in that they cannot handle as many variables as needed even in the simpler cross-sectional data for which they were designed, we also develop a new algorithm that substantially expands the range of computationally feasible data types and sizes for which multiple imputation can be used. These developments also make it possible to implement the methods introduced here in freely available open source software that is considerably more reliable than existing algorithms. We develop an approach to analyzing data with
Cultural cognition of scientific consensus.
- Journal of Risk Research, September,
, 2010
"... Abstract Why do members of the public disagree-sharply and persistently-about facts on which expert scientists largely agree? We designed a study to test a distinctive explanation: the cultural cognition of scientific consensus. The "cultural cognition of risk" refers to the tendency of i ..."
Abstract
-
Cited by 57 (6 self)
- Add to MetaCart
Abstract Why do members of the public disagree-sharply and persistently-about facts on which expert scientists largely agree? We designed a study to test a distinctive explanation: the cultural cognition of scientific consensus. The "cultural cognition of risk" refers to the tendency of individuals to form risk perceptions that are congenial to their values. The study presents both correlational and experimental evidence confirming that cultural cognition shapes individuals' beliefs about the existence of scientific consensus, and the process by which they form such beliefs, relating to climate change, the disposal of nuclear wastes, and the effect of permitting concealed possession of handguns. The implications of this dynamic for science communication and public policy-making are discussed.
Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies
- American Political Science Review
, 2011
"... Identifying causal mechanisms is a fundamental goal of social science. Researchers seek to study not only whether one variable affects another but also how such a causal relationship arises. Yet commonly used statistical methods for identifying causal mechanisms rely upon untestable assumptions and ..."
Abstract
-
Cited by 52 (8 self)
- Add to MetaCart
Identifying causal mechanisms is a fundamental goal of social science. Researchers seek to study not only whether one variable affects another but also how such a causal relationship arises. Yet commonly used statistical methods for identifying causal mechanisms rely upon untestable assumptions and are often inappropriate even under those assumptions. Randomizing treatment and intermediate variables is also insufficient. Despite these difficulties, the study of causal mechanisms is too important to abandon. We make three contributions to improve research on causal mechanisms. First, we present a minimum set of assumptions required under standard designs of experimental and observational studies and develop a general algorithm for estimating causal mediation effects. Second, we provide a method for assessing the sensitivity of conclusions to potential violations of a key assumption. Third, we offer alternative research designs for identifying causal mechanisms under weaker assumptions. The proposed approach is illustrated using media framing experiments and incumbency advantage studies. Over the last couple of decades, social scientists have given greater attention to methodological issues related to causation. This trend has led to a growing number of laboratory, field, and survey