MP abuse analysis Q1–3 2019: Supplementary materials

Sunburst diagram showing the amount and proportion of abuse in tweets to MPs.

The heatmap in the paper gives percentages of abuse for the topics that attracted the most abuse. In the version shown here, we see volumes for the topics that attracted significantly more abuse than typical across the entire period:

Topics heatmap – Brexit attracted the most abuse especially from March to September

The paper gives one two-month time period, as an illustration.

The paper gives a timeline of discussion of religious hatred topics.

Topic detection

Words appearing most frequently for the 44 topics/most representative of them are given below to assist in understanding the them.

arts_and_culture: mainly the word "cultural", followed by "library", "arts", "museum".

borders_and_immigration: mainly "racist" and "racism", followed by "immigration", "migrants", "asylum".

brexit: mainly "brexit".

business_and_enterprise: "companies", "businesses", "industry", "jobs", "manufacturing".

children_and_young_people: mainly the word "youth" followed by "students".

climate_change/environment: mainly "climate" and "climate change" followed by "environment", "energy".

community_and_society: mainly "council", "community", "communities" and "Muslim".

consumer_rights_and_issues: mainly "consumer" and "consumer rights".

crime_and_policing: mainly "police" and "crime", followed by "prison".

defence_and_armed_forces: mainly "army" and "military".

democracy: mainly "parliament", also "election".

uk_economy: mainly "economy", "austerity", "poverty", "economic".

employment: mainly "jobs".

equality_rights_and_citizenship: mainly "equality" and "citizenship".

europe: mainly "EU", followed by "European", "Europe".

financial_services: mainly "bank", followed by "debt".

food_and_farming: mostly the word "animal", followed by "farmers" and "farming".

foreign_affairs: mainly "USA", also "sanctions".

further_education: "FE", "education", "schools".

government_efficiency_transparency and accountability: "government transparency", "government waste".

government_spending: "government spending", "subsidy", "rebate".

higher_education: "university", "students", "tuition fees".

housing: "rent", "social housing", "landlords", "mortgage".

international_aid: "foreign aid", "ngo".

law_and_the_justice_system: mainly "law" and "laws".

local_government: "local government", also "council tax".

national_security: mostly "terrorism", also "ISIS", "national security", "bombing".

northern_ireland: "Northern Ireland", also "Belfast".

pensions_and_ageing_society: "DWP", followed by "pension", "pensions", "retirement".

planning_and_building: mainly "construction", followed by "architecture".

public_health: mainly "NHS", followed by "health", "mental health", "nurses", "patients".

public_safety_and_emergencies: mainly "surveillance", also "public safety".

rural_and_countryside: mainly "fishing", also "hunting", "countryside".

schools: mainly "school" and "schools". Also "education", "teachers".

science_innovation: "technology", "science", also "internet".

scotland: mainly "Scotland", also "Glasgow", "Edinburgh".

social_care: mainly "social care".

sports: mainly "sports".

tax_and_revenue: mainly "tax", also "taxes", "national insurance", "tax cuts".

transport: mainly "transport", "rail", also "public transport", "trains", "railways".

wales: mainly "Wales", also "Plaid Cymru".

welfare: mainly "welfare", also "social care", "food bank".

wildlife: "wildlife".

workers_rights: mainly "workers rights", "minimum wage".

Abuse detection

Abuse detection makes use of lexica of slurs, offensive words and sensitive markers. These are combined using rules to decide if abuse is present, and whether it is aimed at the recipient of the tweet. The word lists are given below. First are the potentially sensitive identity words, that are not especially offensive. Following that are some very offensive words – WARNING!

Sensitive words

