Is the left-right ideology similar on average for those who did and not answer the item NatFrEst?

Final Coursework
Quantitative Data Analysis (POLS0083)

Section 1
Public perceptions of benefit fraud
Trust in social welfare institutions relies on how the public perceives of the deservingness of recipients of such
benefits. In particular, notions of so-called benefit cheats erodes public confidence in social welfare.
Past surveys have shown that the public tend to greatly over-estimate the financial scale of benefit fraud. For
example, a 2013 Ipsos MORI study found that on average, respondents guessed that £24 out of every £100
in benefit claims is done fraudulently, whereas the official estimate is around £0.70 of every £100.
In this section, we explore public perceptions about fraudulently claiming benefits, and in particular, about
how widespread false benefits claims are. More specifically, we will examine whether certain socio-demographic
characteristics are associated with public perceptions about how widely false benefit claims are made in the
UK.
We will use part of the British Social Attitudes (BSA) data set on poverty and wealth, which you can download
as bsa-poverty.csv from the POLS0083 Moodle page. The data set contains the following variables:
Variable name Description
NatFrEst Answer to the question Out of every 100 people receiving benefits in Britain,
how many have broken the law by giving false information to support their
claim?
leftrigh Five-point left-right ideological scale, with 0 to the left and 4 to the right
RSex Sex of respondent, 1 if male or 0 if female
HEdQual3 Completed university degree, 1 if the respondent completed degree or 0 if not
You can load the data set by using the following command:
bsa <- read.csv("data/bsa-poverty.csv") Questions (42 Marks) 1. How many individuals are included in the data set? 2. Is the left-right ideology similar on average for those who did and not answer the item NatFrEst? 3. Calculate the median of the variable NatFrEst. What does this tell us about the distribution of perceptions about fraudulent benefits claims? 4. Create a histogram for NatFrEst and interpret it. What does this tell us about public perceptions about fraudulent benefits claims? 5. We are interested in seeing whether there is a relationship between a persons left-right orientation and how widespread they think fraudulent benefit claims are. Fit the relevant simple linear regression model and interpret the substantive significance of the estimated slope coefficient. You do not need to discuss statistical significance. 6. State a null and an alternative hypothesis for the estimated slope coefficient, decide whether to reject the null hypothesis, and provide a conclusion. 7. How is your conclusion in Question 6 related to Type I and Type II error? 8. We now add whether the respondent completed a university degree and respondent sex to our analysis. Interpret the estimated coefficient for the left-right orientation. Does your answer to the question about the relationship between a persons left-right orientation and NatFrEst change? If so, how? 9. Calculate and interpret the 99% confidence interval for the estimated coefficient for completing a university degree from the model you fitted in Question 8. What does the estimated standard error tell 2 us? 10. Finally, we are interested in whether the relationship between left-right orientation and perceptions of how widespread benefit fraud is also depends on an individuals education level. a. Add the relevant interaction term to your regression model and display your results. b. Assess the model fit for this regression model. c. Interpret the estimated interaction term and discuss its substantive and statistical significance. d. Interpret the intercept and its statistical significance. Is the intercept meaningful in this regression model? e. Visually represent the results from the regression model using four lines (one each for male respondents with a university degree, female respondents with a university degree, male respondents without a university degree and female respondents without a university degree) and describe what your graph shows. 3 Section 2 Dystopian fiction and willingness to justify radical political action We now look at part of the recent study by Jones and Paris (2018), who conducted a series of survey experiments to examine whether exposure to dystopian fiction affects an individuals political beliefs. They define dystopian fiction as portray[ing] a dark and disturbing world dominated by an overwhelmingly powerful. . . controlling entity that acts to undermine core values. In particular, the authors hypothesised that exposure to dystopian fiction would lead to higher support for more radical political action, especially for more violent action. In the first study, the authors randomly assigned the US-based respondents into two groups. Respondents assigned to the treatment group first read an excerpt from the first book in Hunger Games, and then watched a 17-minute video with various violent scenes from the Hunger Games films. Respondents assigned to the control group were not exposed to any media. The premise of Hunger Games is that an overwhelmingly powerful government forces individuals to take part in a contest where the participants are forced to kill each other until only one survivor remains. The outcome variables are a series of attitudinal items about willingness to justify different disruptive activities: civil disobedience, damaging government property, cyberattacks on government websites, armed rebellion, and violent protest. The data file you will use, which can be downloaded on the POLS0083 Moodle page, is titled dystopia.csv. The data includes the following variables: Variable name Description hgindic Treatment group, with 1 for those exposed to Hunger Games and 0 for the control group j_disobed How much civil diobedience can be justified, on a 0-1 scale j_damage How much damaging government property can be justified, on a 0-1 scale j_cyber How much cyberattacks on government websites can be justified, on a 0-1 scale j_rebel How much armed rebellion can be justified, on a 0-1 scale j_violent How much violent protest can be justified, on a 0-1 scale female Respondent sex, with 1 as female and 0 as male ideo Respondent left-right ideological orientation, with higher scores indicating the right, on a 1-6 scale You can load the data set by using the following command: dystopia <- read.csv("data/dystopia.csv") Questions (33 Marks) 1. What are the proportions of female respondents within the treatment and control groups? 2. We are interested in the outcome variables for civil disobedience and armed rebellion. For each of these outcome variables: a. Create a boxplot for each treatment group and interpret the results. b. Calculate difference in means by treatment group and interpret the results. You do not need to discuss statistical significance. c. Conduct the appropriate hypothesis test and interpret the results for the difference in means. 4 3. In the current study, in which two ways can we understand the population from which the sample is drawn? 4. With what assumptions can we interpret the results from Question 2 as causal? 5. Fit a regression model for each outcome variable with the treatment group, sex, and ideology. Does the estimated difference in means change? What does this tell us about the randomisation of treatment in this study? 6. Do your results support the authors hypothesis about the link between dystopian fiction and willingness to support radical political action? State your conclusions in terms of substantial and statistical significance. 5 Section 3 Direct democracy and citizens local support Does direct democracy (such as local referendums) increase satisfaction with local policies and governmental institutions? Marien and Kern (2018) explore this question in their article The Winner Takes It All: Revisiting the Effect of Direct Democracy on Citizens Political Support. Drawing on the existing literature, the authors argue that direct democratic instruments have short-term positive effects on satisfaction with local policies and institutions, since they increase the perceived fairness of decision-making and ability to influence political outcomes among the electorate. To test their hypothesis, the authors conducted multiple surveys during the spring of 2015, when the Belgian city Mechelen conducted a referendum on traffic circulation, after citizens had objected to the local governments initial plans. The researchers collected survey data before and after the referendum took place both from a sample of local residents in Mechelen who lived in neighbourhoods affected by the traffic diversion plans (treatment group) and from a sample of residents of a similar neighbourhood in Mechelen not affected by the traffic circulation plans (control group). The data file you will use, which can be downloaded on the POLS0083 Moodle page, is titled referendum.csv. The data includes the following variables: Variable name Description idresp Respondent ID neighb Neighbourhood, 1 as the treatment group and 0 as the control w1_byear Respondent birth year primary Highest level of education completed - primary - with 1 Yes and 0 No secondary Highest level of education completed - secondary - with 1 Yes and 0 No tertiary Highest level of education completed - tertiary - with 1 Yes and 0 No w1_trust_general General level of trust on a 0-10 scale, with higher scores meaning more trust, before the referendum w1_pol_interest Political interest on a 0-10 scale, with higher scores meaning more interest, before the referendum w1_democracy_satis Democratic satisfaction on a 0-10 scale, before the referendum, with higher scores meaning higher satisfaction w2_democracy_satis Democratic satisfaction on a 0-10 scale, after the referendum, with higher scores meaning higher satisfaction You can load the data set by using the following command: referendum <- read.csv("data/referendum.csv") Questions (25 Marks) 1. Is the average level of democratic satisfaction before the referendum significantly different (statistically and substantively) between respondents in the treatment and control groups? 2. Provide some evidence that the treatment and control neighbourhoods are similar in terms of respondent characteristics before the referendum took place. 3. Focusing on the respondents living in a neighbourhood affected by the governments plan on traffic circulation, use a before-after design to estimate the average treatment effect. Is this difference significant both statistically and substatively? 4. With what assumptions can we consider the results of Question 3 as causal? 6 5. Calculate and interpret the difference-in-differences for democratic satisfaction for the two neighbourhoods and interpret your results. 6. Do the results in Question 5 support the authors hypothesis about the link between direct democracy and citizens perceptions about democracy? Why or why not? 7. With what assumptions can we consider the results of Question 6 as causal? 7