Academic Research - NYU’s Center for Social Media, AI, and Politics

CSMAP faculty, postdoctoral fellows, and students publish rigorous, peer-reviewed research in top academic journals and post working papers sharing ongoing work.

Journal Article
Quantifying Narrative Similarity Across Languages
Hannah Waight,

Sol Messing,

Anton Shirikov,

Margaret E. Roberts,

Jonathan Nagler,

Jason Greenfield,

Megan A. Brown,

Kevin Aslett,

Joshua A. Tucker
Sociological Methods & Research, 2025
View Article View abstract

How can one understand the spread of ideas across text data? This is a key measurement problem in sociological inquiry, from the study of how interest groups shape media discourse, to the spread of policy across institutions, to the diffusion of organizational structures and institution themselves. To study how ideas and narratives diffuse across text, we must first develop a method to identify whether texts share the same information and narratives, rather than the same broad themes or exact features. We propose a novel approach to measure this quantity of interest, which we call “narrative similarity,” by using large language models to distill texts to their core ideas and then compare the similarity of claims rather than of words, phrases, or sentences. The result is an estimand much closer to narrative similarity than what is possible with past relevant alternatives, including exact text reuse, which returns lexically similar documents; topic modeling, which returns topically similar documents; or an array of alternative approaches. We devise an approach to providing out-of-sample measures of performance (precision, recall, F1) and show that our approach outperforms relevant alternatives by a large margin. We apply our approach to an important case study: The spread of Russian claims about the development of a Ukrainian bioweapons program in U.S. mainstream and fringe news websites. While we focus on news in this application, our approach can be applied more broadly to the study of propaganda, misinformation, diffusion of policy and cultural objects, among other topics.
Area of Study

Data Science Methodology

Foreign Influence Campaigns
Date Posted

Jul 14, 2025
Tags

Methods,

Text and Content Analysis,

Ukraine,

Russia,

United States,

Large Language Models
Journal Article
Labeling Social Media Posts: Does Showing Coders Multimodal Content Produce Better Human Annotation, and a Better Machine Classifier?
Haohan Chen,

James Bisbee,

Joshua A. Tucker,

Jonathan Nagler
Political Science Research and Methods, 2025
View Article View abstract

The increasing multimodality (e.g., images, videos, links) of social media data presents opportunities and challenges. But text-as-data methods continue to dominate as modes of classification, as multimodal social media data are costly to collect and label. Researchers who face a budget constraint may need to make informed decisions regarding whether to collect and label only the textual content of social media data or their full multimodal content. In this article, we develop five measures and an experimental framework to assist with these decisions. We propose five performance metrics to measure the costs and benefits of multimodal labeling: average time per post, average time per valid response, valid response rate, intercoder agreement, and classifier’s predictive power. To estimate these measures, we introduce an experimental framework to evaluate coders’ performance under text-only and multimodal labeling conditions. We illustrate the method with a tweet labeling experiment.
Area of Study

Data Science Methodology

Elite & Mass Political Behavior
Date Posted

Jul 13, 2025
Tags

Text and Content Analysis,

Methods,

Twitter/X,

United States
Working Paper
The Effect of Deactivating Facebook and Instagram on Users’ Emotional State
Hunt Allcott,

Matthew Gentzkow,

Benjamin Wittenbrink,

Juan Carlos Cisneros,

Adriana Crespo-Tenorio,

Drew Dimmery,

Deen Freelon,

Sandra González-Bailón,

Andrew M. Guess,

Young Mie Kim,

David Lazer,

Neil Malhotra,

Devra Moehler,

Sameer Nair-Desai,

Brendan Nyhan,

Jennifer Pan,

Jaime Settle,

Emily Thorson,

Rebekah Tromble,

Carlos Velasco Rivera,

Arjun Wilkins,

Magdalena Wojcieszak,

Annie Franco,

Chad Kiewiet de Jonge,

Winter Mason,

Natalie Jomini Stroud,

Joshua A. Tucker
Working Paper, April 2025
View Article View abstract

We estimate the effect of social media deactivation on users’ emotional state in two large randomized experiments before the 2020 U.S. election. People who deactivated Facebook for the six weeks before the election reported a 0.060 standard deviation improvement in an index of happiness, depression, and anxiety, relative to controls who deactivated for just the first of those six weeks. People who deactivated Instagram for those six weeks reported a 0.041 standard deviation improvement relative to controls. Exploratory analysis suggests the Facebook effect is driven by people over 35, while the Instagram effect is driven by women under 25.
Area of Study

Elite & Mass Political Behavior
Date Posted

Jul 12, 2025
Tags

2020 Election,

Elections,

Facebook,

Instagram,

United States,

US 2020 Election Study
Working Paper
The Effects of Political Advertising on Facebook and Instagram Before the 2020 US Election
Hunt Allcott,

Matthew Gentzkow,

Ro’ee Levy,

Adriana Crespo-Tenorio,

Natasha Dumas,

Winter Mason,

Devra Moehler,

Pablo Barberá,

Taylor Brown,

Juan Carlos Cisneros,

Drew Dimmery,

Deen Freelon,

Sandra González-Bailón,

Andrew M. Guess,

Young Mie Kim,

David Lazer,

Neil Malhotra,

Sameer Nair-Desai,

Brendan Nyhan,

Ana Carolina Paixao de Queiroz,

Jennifer Pan,

Jaime Settle,

Emily Thorson,

Rebekah Tromble,

Carlos Velasco Rivera,

Benjamin Wittenbrink,

Magdalena Wojcieszak,

Shiqi Yang,

Saam Zahedian,

Annie Franco,

Chad Kiewiet de Jonge,

Natalie Jomini Stroud,

Joshua A. Tucker
Working Paper, May 2025
View Article View abstract

We study the effects of social media political advertising by randomizing subsets of 36,906 Facebook users and 25,925 Instagram users to have political ads removed from their news feeds for six weeks before the 2020 US presidential election. We show that most presidential ads were targeted toward parties’ own supporters and that fundraising ads were most common. On both Facebook and Instagram, we found no detectable effects of removing political ads on political knowledge, polarization, perceived legitimacy of the election, political participation (including campaign contributions), candidate favorability, and turnout. This was true overall and for both Democrats and Republicans separately.
Area of Study

Elite & Mass Political Behavior
Date Posted

Jul 12, 2025
Tags

2020 Election,

Elections,

Facebook,

Instagram,

United States,

US 2020 Election Study
Journal Article
Bottom Up? Top Down? Determinants of Issue-Attention in State Politics
Andreu Casas,

Oscar Stuhler,

Julia Payson,

Joshua A. Tucker,

Richard Bonneau,

Jonathan Nagler
The Journal of Politics, 2025
View Article View abstract

Who shapes the issue-attention cycle of state legislators? Although state governments make critical policy decisions, data and methodological constraints have limited researchers’ ability to study state-level agenda setting. For this paper, we collect more than 122 million Twitter messages sent by state and national actors in 2018 and 2021. We then employ supervised machine learning and time series techniques to study how the issue-attention of state lawmakers evolves vis-à-vis various local- and national-level actors. Our findings suggest that state legislators operate at the confluence of national and local influences. In line with arguments highlighting the nationalization of state politics, we find that state legislators are consistently responsive to policy debates among members of Congress. However, despite growing nationalization concerns, we also find strong evidence of issue responsiveness by legislators to members of the public in their states and moderate responsiveness to regional media sources.
Area of Study

Elite & Mass Political Behavior
Date Posted

Mar 28, 2025
Tags

2018 Election,

Twitter/X,

United States
Journal Article
To Moderate, or Not to Moderate: Strategic Domain Sharing by Congressional Campaigns
Maggie Macdonald,

Megan A. Brown,

Joshua A. Tucker,

Jonathan Nagler
Electoral Studies, 2025
View Article View abstract

We test whether candidates move to the extremes before a primary but then return to the center for the general election to appeal to the different preferences of each electorate. Incumbents are now more vulnerable to primary challenges than ever as social media offers a viable pathway for fundraising and messaging for challengers, while homogeneity of districts has reduced general election competitiveness. To assess candidates’ ideological trajectories, we estimate the messaging ideology of 2020 congressional campaigns before and after their primaries using a homophily-based measure of domains shared on Twitter. This method provides temporally granular data to observe changes in communication within a single election campaign cycle. We find suggestive evidence that incumbents in safe seats moved towards the extreme before their primaries and back towards the center for the general election, but only when threatened by a well-funded primary challenge.
Area of Study

Elite & Mass Political Behavior
Date Posted

Mar 17, 2025
Tags

2020 Election,

Twitter/X,

United States
Journal Article
Understanding Latino Political Engagement and Activity on Social Media
Marisa A. Abrajano,

Marianna Garcia,

Aaron Pope,

Edwin Kamau,

Robert Vidigal,

Joshua A. Tucker,

Jonathan Nagler
Political Research Quarterly, 2025
View Article View abstract

Social media is used by millions of Americans to access news and politics. Yet there are no studies, to date, examining whether these behaviors systematically vary for those whose political incorporation process is distinct from those in the majority. We fill this void by examining how Latino online political activity compares to that of white Americans and the role of language in Latinos’ online political engagement. We hypothesize that Latino online political activity is comparable to whites. Moreover, given media reports suggesting that greater quantities of political misinformation are circulating on Spanish versus English-language social media, we expect reliance on Spanish-language social media for news predicts beliefs in inaccurate political narratives. Our survey findings, which we believe to be the largest original survey of the online political activity of Latinos and whites, reveal support for these expectations. Latino social media political activity, as measured by sharing/viewing news, talking about politics, and following politicians, is comparable to whites, both in self-reported and digital trace data. Latinos also turned to social media for news about COVID-19 more often than did whites. Finally, Latinos relying on Spanish-language social media usage for news predicts beliefs in election fraud in the 2020 U.S. Presidential election.
Area of Study

Media Consumption

Online Information Environment

Elite & Mass Political Behavior
Date Posted

Feb 03, 2025
Tags

2022 Election,

Bilingual Election Monitor,

Covid-19,

Facebook,

Instagram,

Twitter/X,

United States,

WhatsApp,

YouTube
Working Paper
Web Scraping for Research: Legal, Ethical, Institutional, and Scientific Considerations
Megan A. Brown,

Andrew Gruen,

Gabe Maldoff,

Sol Messing,

Zeve Sanderson,

Michael Zimmer
Working Paper, December 2024
View Article View abstract

Scientists across disciplines often use data from the internet to conduct research, generating valuable insights about human behavior. However, as generative AI relying on massive text corpora becomes increasingly valuable, platforms have greatly restricted access to data through official channels. As a result, researchers will likely engage in more web scraping to collect data, introducing new challenges and concerns for researchers. This paper proposes a comprehensive framework for web scraping in social science research for U.S.-based researchers, examining the legal, ethical, institutional, and scientific factors that researchers should consider when scraping the web. We present an overview of the current regulatory environment impacting when and how researchers can access, collect, store, and share data via scraping. We then provide researchers with recommendations to conduct scraping in a scientifically legitimate and ethical manner. We aim to equip researchers with the relevant information to mitigate risks and maximize the impact of their research amidst this evolving data access landscape.
Area of Study

Data Science Methodology
Date Posted

Dec 19, 2024
Tags

Data Access,

United States
Journal Article
Concept-Guided Chain-of-Thought Prompting for Pairwise Comparison Scoring of Texts with Large Language Models
Patrick Y. Wu,

Jonathan Nagler,

Joshua A. Tucker,

Sol Messing
IEEE International Conference on Big Data, 2024
View Article View abstract

Existing text scoring methods require a large corpus, struggle with short texts, or require hand-labeled data. We develop a text scoring framework that leverages generative large language models (LLMs) to (1) set texts against the backdrop of information from the near-totality of the web and digitized media, and (2) effectively transform pairwise text comparisons from a reasoning problem to a pattern recognition task. Our approach, concept-guided chain-of-thought (CGCoT), utilizes a chain of researcher-designed prompts with an LLM to generate a concept-specific breakdown for each text, akin to guidance provided to human coders. We then pairwise compare breakdowns using an LLM and aggregate answers into a score using a probability model. We apply this approach to better understand speech reflecting aversion to specific political parties on Twitter, a topic that has commanded increasing interest because of its potential contributions to democratic backsliding. We achieve stronger correlations with human judgments than widely used unsupervised text scoring methods like Wordfish. In a supervised setting, besides a small pilot dataset to develop CGCoT prompts, our measures require no additional hand-labeled data and produce predictions on par with RoBERTa-Large fine-tuned on thousands of hand-labeled tweets. This project showcases the potential of combining human expertise and LLMs for scoring tasks.
Area of Study

Data Science Methodology

Political Polarization
Date Posted

Dec 15, 2024
Tags

Generative AI,

Twitter/X,

United States
Journal Article
The Diffusion and Reach of (Mis)Information on Facebook During the U.S. 2020 Election
Sandra González-Bailón,

David Lazer,

Pablo Barberá,

William Godel,

Hunt Alcott,

Taylor Brown,

Adriana Crespo-Tenorio,

Deen Freelon,

Matthew Gentzkow,

Andrew M. Guess,

Shanto Iyengar,

Young Mie Kim,

Neil Malhotra,

Devra Moehler,

Brendan Nyhan,

Jennifer Pan,

Carlos Velasco Rivera,

Jaime Settle,

Emily Thorson,

Rebekah Tromble,

Arjun Wilkins,

Magdalena Wojcieszak,

Chad Kiewiet De Jong,

Annie Franco,

Winter Mason,

Natalie Jomini Stroud,

Joshua A. Tucker
Sociological Science, 2024
View Article View abstract

Social media creates the possibility for rapid, viral spread of content, but how many posts actually reach millions? And is misinformation special in how it propagates? We answer these questions by analyzing the virality of and exposure to information on Facebook during the U.S. 2020 presidential election. We examine the diffusion trees of the approximately 1 B posts that were re-shared at least once by U.S.-based adults from July 1, 2020, to February 1, 2021. We differentiate misinformation from non-misinformation posts to show that (1) misinformation diffused more slowly, relying on a small number of active users that spread misinformation via long chains of peer-to-peer diffusion that reached millions; non-misinformation spread primarily through one-to-many affordances (mainly, Pages); (2) the relative importance of peer-to-peer spread for misinformation was likely due to an enforcement gap in content moderation policies designed to target mostly Pages and Groups; and (3) periods of aggressive content moderation proximate to the election coincide with dramatic drops in the spread and reach of misinformation and (to a lesser extent) political content.
Area of Study

Media Consumption

Online Information Environment
Date Posted

Dec 11, 2024
Tags

2020 Election,

Facebook,

United States,

US 2020 Election Study
Journal Article
How Reliance on Spanish-Language Social Media Predicts Beliefs in False Political Narratives Amongst Latinos
Marisa A. Abrajano,

Marianna Garcia,

Aaron Pope,

Robert Vidigal,

Joshua A. Tucker,

Jonathan Nagler
PNAS Nexus, 2024
View Article View abstract

False political narratives are nearly inescapable on social media in the United States. They are a particularly acute problem for Latinos, and especially for those who rely on Spanish-language social media for news and information. Studies have shown that Latinos are vulnerable to misinformation because they rely more heavily on social media and messaging platforms than non-Hispanic whites. Moreover, fact-checking algorithms are not as robust in Spanish as they are in English, and social media platforms put far more effort into combating misinformation on English-language media than Spanish-language media, which compounds the likelihood of being exposed to misinformation. As a result, we expect that Latinos who use Spanish-language social media to be more likely to believe in false political narratives when compared with Latinos who primarily rely on English-language social media for news. To test this expectation, we fielded the largest online survey to date of social media usage and belief in political misinformation of Latinos. Our study, fielded in the months leading up to and following the 2022 midterm elections, examines a variety of false political narratives that were circulating in both Spanish and English on social media. We find that social media reliance for news predicts one’s belief in false political stories, and that Latinos who use Spanish-language social media have a higher probability of believing in false political narratives, compared with Latinos using English-language social media.
Area of Study

Online Information Environment

Media Consumption

Elite & Mass Political Behavior
Date Posted

Nov 19, 2024
Tags

2022 Election,

Bilingual Election Monitor,

United States,

Covid-19
Journal Article
News Sharing on Social Media: Mapping the Ideology of News Media, Politicians, and the Mass Public
Gregory Eady,

Richard Bonneau,

Joshua A. Tucker,

Jonathan Nagler
Political Analysis, 2024
View Article View abstract

This article examines the information sharing behavior of U.S. politicians and the mass public by mapping the ideological sharing space of political news on social media. As data, we use the near-universal currency of online information exchange: web links. We introduce a methodological approach and software to unify the measurement of ideology across social media platforms by using sharing data to jointly estimate the ideology of news media organizations, politicians, and the mass public. Empirically, we show that (1) politicians who share ideologically polarized content share, by far, the most political news and commentary and (2) that the less competitive elections are, the more likely politicians are to share polarized information. These results demonstrate that news and commentary shared by politicians come from a highly unrepresentative set of ideologically extreme legislators and that decreases in election pressures (e.g., by gerrymandering) may encourage polarized sharing behavior.
Area of Study

Elite & Mass Political Behavior

Media Consumption

Political Polarization

Data Science Methodology
Date Posted

Nov 19, 2024
Tags

Twitter/X,

United States
Journal Article
The Trump Advantage in Policy Recall Among Voters
Jan Zilinsky,

Joshua A. Tucker,

Jonathan Nagler
American Politics Research, 2024
View Article View abstract

Research in political science suggests campaigns have a minimal effect on voters’ attitudes and vote choice. We evaluate the effectiveness of the 2016 Trump and Clinton campaigns at informing voters by giving respondents an opportunity to name policy positions of candidates that they felt would make them better off. The relatively high rates of respondents’ ability to name a Trump policy that would make them better off suggests that the success of his campaign can be partly attributed to its ability to communicate memorable information. Our evidence also suggests that cable television informed voters: respondents exposed to higher levels of liberal news were more likely to be able to name Clinton policies, and voters exposed to higher levels of conservative news were more likely to name Trump policies; these effects hold even conditioning on respondents’ ideology and exposure to mainstream media. Our results demonstrate the advantages of using novel survey questions and provide additional insights into the 2016 campaign that challenge one part of the conventional narrative about the presumed non-importance of operational ideology.
Area of Study

Elite & Mass Political Behavior

Public Opinion
Date Posted

Oct 30, 2024
Tags

2016 Election,

United States
Journal Article
Measuring Receptivity to Misinformation at Scale on a Social Media Platform
Christopher K. Tokita,

Kevin Aslett,

William Godel,

Zeve Sanderson,

Joshua A. Tucker,

Jonathan Nagler,

Nathaniel Persily,

Richard Bonneau
PNAS Nexus, 2024
View Article View abstract

Measuring the impact of online misinformation is challenging. Traditional measures, such as user views or shares on social media, are incomplete because not everyone who is exposed to misinformation is equally likely to believe it. To address this issue, we developed a method that combines survey data with observational Twitter data to probabilistically estimate the number of users both exposed to and likely to believe a specific news story. As a proof of concept, we applied this method to 139 viral news articles and find that although false news reaches an audience with diverse political views, users who are both exposed and receptive to believing false news tend to have more extreme ideologies. These receptive users are also more likely to encounter misinformation earlier than those who are unlikely to believe it. This mismatch between overall user exposure and receptive user exposure underscores the limitation of relying solely on exposure or interaction data to measure the impact of misinformation, as well as the challenge of implementing effective interventions. To demonstrate how our approach can address this challenge, we then conducted data-driven simulations of common interventions used by social media platforms. We find that these interventions are only modestly effective at reducing exposure among users likely to believe misinformation, and their effectiveness quickly diminishes unless implemented soon after misinformation’s initial spread. Our paper provides a more precise estimate of misinformation’s impact by focusing on the exposure of users likely to believe it, offering insights for effective mitigation strategies on social media.
Area of Study

Data Science Methodology

Media Consumption

Online Information Environment

Political Polarization
Date Posted

Oct 08, 2024
Tags

2024 Election,

2020 Election,

Twitter/X,

United States
Working Paper
Reaching Across the Political Aisle: Overcoming Challenges in Using Social Media for Recruiting Politically Diverse Respondents
Maggie Macdonald,

Megan A. Brown,

Nejla Ašimović,

Rajeshwari Majumdar,

Lena Song,

Laura Huber,

Sarah Graham,

Abby Budiman,

Joshua A. Tucker,

Jonathan Nagler
Working Paper, August 2024
View Article View abstract

A challenge for public opinion surveys is achieving representativeness of respondents across demographic groups. We test the extent to which ideological alignment with a survey’s sponsor shapes differential partisan response and users’ choice of whether to participate in a research study on Facebook. While the use of Facebook advertisements for recruitment has increased in recent years and offers potential benefits, it can yield difficulties in recruiting politically representative samples. We recruit respondents for a short survey through two otherwise identical advertisements associated with either New York University (from a liberal state) or the University of Mississippi (from a conservative state). Contrary to our expectations, we don’t find an asymmetry in completion rates between self-reported Democrats and Republicans based on the survey sponsor. Nor do we find statistically significant differences in attitudes of respondents across the two survey sponsors when we control for observables.
Area of Study

Public Opinion

Data Science Methodology
Date Posted

Aug 13, 2024
Tags

Facebook,

United States
Journal Article
Digital Town Square? Nextdoor's Offline Contexts and Online Discourse
Megan A. Brown,

Zeve Sanderson,

Sarah Graham,

Minjoo Kim,

Joshua A. Tucker,

Sol Messing
Journal of Quantitative Description: Digital Media, 2024
View Article View abstract

There is scant quantitative research describing Nextdoor, the world's largest and most important hyperlocal social media network. Due to its localized structure, Nextdoor data are notoriously difficult to collect and work with. We build multiple datasets that allow us to generate descriptive analyses of the platform's offline contexts and online content. We first create a comprehensive dataset of all Nextdoor neighborhoods joined with U.S. Census data, which we analyze at the community-level (block-group). Our findings suggests that Nextdoor is primarily used in communities where the populations are whiter, more educated, more likely to own a home, and with higher levels of average income, potentially impacting the platform's ability to create new opportunities for social capital formation and citizen engagement. At the same time, Nextdoor neighborhoods are more likely to have active government agency accounts---and law enforcement agencies in particular---where offline communities are more urban, have larger nonwhite populations, greater income inequality, and higher average home values. We then build a convenience sample of 30 Nextdoor neighborhoods, for which we collect daily posts and comments appearing in the feed (115,716 posts and 163,903 comments), as well as associated metadata. Among the accounts for which we collected posts and comments, posts seeking or offering services were the most frequent, while those reporting potentially suspicious people or activities received the highest average number of comments. Taken together, our study describes the ecosystem of and discussion on Nextdoor, as well as introduces data for quantitatively studying the platform.
Area of Study

Elite & Mass Political Behavior

Public Opinion
Date Posted

May 29, 2024
Tags

Nextdoor,

United States
Book
Online Data and the Insurrection
Megan A. Brown
Media and January 6th, 2024
View Book View abstract

Online data is key to understanding the leadup to the January 6 insurrection, including how and why election fraud conspiracies spread online, how conspiracy groups organized online to participate in the insurrection, and other factors of online life that led to the insurrection. However, there are significant challenges in accessing data for this research. First, platforms restrict which researchers get access to data, as well as what researchers can do with the data they access. Second, this data is ephemeral; that is, once users or the platform remove the data, researchers can no longer access it. These factors affect what research questions can ever be asked and answered.
Area of Study

Elite & Mass Political Behavior

Online Information Environment

Political Polarization
Date Posted

Mar 11, 2024
Tags

2020 Election,

Data Access,

United States
Journal Article
Estimating the Ideology of Political YouTube Videos
Angela Lai,

Megan A. Brown,

James Bisbee,

Richard Bonneau,

Joshua A. Tucker,

Jonathan Nagler
Political Analysis, 2024
View Article View abstract

We present a method for estimating the ideology of political YouTube videos. As online media increasingly influences how people engage with politics, so does the importance of quantifying the ideology of such media for research. The subfield of estimating ideology as a latent variable has often focused on traditional actors such as legislators, while more recent work has used social media data to estimate the ideology of ordinary users, political elites, and media sources. We build on this work by developing a method to estimate the ideologies of YouTube videos, an important subset of media, based on their accompanying text metadata. First, we take Reddit posts linking to YouTube videos and use correspondence analysis to place those videos in an ideological space. We then train a text-based model with those estimated ideologies as training labels, enabling us to estimate the ideologies of videos not posted on Reddit. These predicted ideologies are then validated against human labels. Finally, we demonstrate the utility of this method by applying it to the watch histories of survey respondents with self-identified ideologies to evaluate the prevalence of echo chambers on YouTube. Our approach gives video-level scores based only on supplied text metadata, is scalable, and can be easily adjusted to account for changes in the ideological climate. This method could also be generalized to estimate the ideology of other items referenced or posted on Reddit.
Area of Study

Data Science Methodology

Media Consumption
Date Posted

Feb 13, 2024
Tags

Reddit,

YouTube,

United States
Journal Article
Online Searches to Evaluate Misinformation Can Increase its Perceived Veracity
Kevin Aslett,

Zeve Sanderson,

William Godel,

Nathaniel Persily,

Jonathan Nagler,

Joshua A. Tucker
Nature, 2024
View Article View abstract

Considerable scholarly attention has been paid to understanding belief in online misinformation, with a particular focus on social networks. However, the dominant role of search engines in the information environment remains underexplored, even though the use of online search to evaluate the veracity of information is a central component of media literacy interventions. Although conventional wisdom suggests that searching online when evaluating misinformation would reduce belief in it, there is little empirical evidence to evaluate this claim. Here, across five experiments, we present consistent evidence that online search to evaluate the truthfulness of false news articles actually increases the probability of believing them. To shed light on this relationship, we combine survey data with digital trace data collected using a custom browser extension. We find that the search effect is concentrated among individuals for whom search engines return lower-quality information. Our results indicate that those who search online to evaluate misinformation risk falling into data voids, or informational spaces in which there is corroborating evidence from low-quality sources. We also find consistent evidence that searching online to evaluate news increases belief in true news from low-quality sources, but inconsistent evidence that it increases belief in true news from mainstream sources. Our findings highlight the need for media literacy programmes to ground their recommendations in empirically tested strategies and for search engines to invest in solutions to the challenges identified here.
Area of Study

Online Information Environment
Date Posted

Dec 20, 2023
Tags

Google,

Covid-19,

United States
Working Paper
Large Language Models Can Be Used to Estimate the Latent Positions of Politicians
Patrick Y. Wu,

Jonathan Nagler,

Joshua A. Tucker,

Sol Messing
Working Paper, September 2023
View Article View abstract

Existing approaches to estimating politicians' latent positions along specific dimensions often fail when relevant data is limited. We leverage the embedded knowledge in generative large language models (LLMs) to address this challenge and measure lawmakers' positions along specific political or policy dimensions. We prompt an instruction/dialogue-tuned LLM to pairwise compare lawmakers and then scale the resulting graph using the Bradley-Terry model. We estimate novel measures of U.S. senators' positions on liberal-conservative ideology, gun control, and abortion. Our liberal-conservative scale, used to validate LLM-driven scaling, strongly correlates with existing measures and offsets interpretive gaps, suggesting LLMs synthesize relevant data from internet and digitized media rather than memorizing existing measures. Our gun control and abortion measures -- the first of their kind -- differ from the liberal-conservative scale in face-valid ways and predict interest group ratings and legislator votes better than ideology alone. Our findings suggest LLMs hold promise for solving complex social science measurement problems.
Area of Study

Data Science Methodology

Elite & Mass Political Behavior
Date Posted

Sep 26, 2023
Tags

United States,

Generative AI
Journal Article
Like-Minded Sources On Facebook Are Prevalent But Not Polarizing
Brendan Nyhan,

Jaime Settle,

Emily Thorson,

Magdalena Wojcieszak,

Pablo Barberá,

Annie Y. Chen,

Hunt Alcott,

Taylor Brown,

Adriana Crespo-Tenorio,

Drew Dimmery,

Deen Freelon,

Matthew Gentzkow,

Sandra González-Bailón,

Andrew M. Guess,

Edward Kennedy,

Young Mie Kim,

David Lazer,

Neil Malhotra,

Devra Moehler,

Jennifer Pan,

Daniel Robert Thomas,

Rebekah Tromble,

Carlos Velasco Rivera,

Arjun Wilkins,

Beixian Xiong,

Chad Kiewiet De Jong,

Annie Franco,

Winter Mason,

Natalie Jomini Stroud,

Joshua A. Tucker
Nature, 2023
View Article View abstract

Many critics raise concerns about the prevalence of ‘echo chambers’ on social media and their potential role in increasing political polarization. However, the lack of available data and the challenges of conducting large-scale field experiments have made it difficult to assess the scope of the problem1,2. Here we present data from 2020 for the entire population of active adult Facebook users in the USA showing that content from ‘like-minded’ sources constitutes the majority of what people see on the platform, although political information and news represent only a small fraction of these exposures. To evaluate a potential response to concerns about the effects of echo chambers, we conducted a multi-wave field experiment on Facebook among 23,377 users for whom we reduced exposure to content from like-minded sources during the 2020 US presidential election by about one-third. We found that the intervention increased their exposure to content from cross-cutting sources and decreased exposure to uncivil language, but had no measurable effects on eight preregistered attitudinal measures such as affective polarization, ideological extremity, candidate evaluations and belief in false claims. These precisely estimated results suggest that although exposure to content from like-minded sources on social media is common, reducing its prevalence during the 2020 US presidential election did not correspondingly reduce polarization in beliefs or attitudes.
Area of Study

Elite & Mass Political Behavior

Media Consumption

Political Polarization

Online Information Environment
Date Posted

Jul 27, 2023
Tags

2020 Election,

Data Access,

Facebook,

Instagram,

United States,

US 2020 Election Study
Journal Article
How Do Social Media Feed Algorithms Affect Attitudes and Behavior in an Election Campaign?
Andrew M. Guess,

Neil Malhotra,

Jennifer Pan,

Pablo Barberá,

Hunt Alcott,

Taylor Brown,

Adriana Crespo-Tenorio,

Drew Dimmery,

Deen Freelon,

Matthew Gentzkow,

Sandra González-Bailón,

Edward Kennedy,

Young Mie Kim,

David Lazer,

Devra Moehler,

Brendan Nyhan,

Jaime Settle,

Calos Velasco-Rivera,

Daniel Robert Thomas,

Emily Thorson,

Rebekah Tromble,

Beixian Xiong,

Chad Kiewiet De Jong,

Annie Franco,

Winter Mason,

Natalie Jomini Stroud,

Joshua A. Tucker
Science, 2023
View Article View abstract

We investigated the effects of Facebook’s and Instagram’s feed algorithms during the 2020 US election. We assigned a sample of consenting users to reverse-chronologically-ordered feeds instead of the default algorithms. Moving users out of algorithmic feeds substantially decreased the time they spent on the platforms and their activity. The chronological feed also affected exposure to content: The amount of political and untrustworthy content they saw increased on both platforms, the amount of content classified as uncivil or containing slur words they saw decreased on Facebook, and the amount of content from moderate friends and sources with ideologically mixed audiences they saw increased on Facebook. Despite these substantial changes in users’ on-platform experience, the chronological feed did not significantly alter levels of issue polarization, affective polarization, political knowledge, or other key attitudes during the 3-month study period.
Area of Study

Elite & Mass Political Behavior

Media Consumption

Online Information Environment

Political Polarization
Date Posted

Jul 27, 2023
Tags

2020 Election,

Data Access,

Instagram,

Facebook,

United States,

US 2020 Election Study
Journal Article
Reshares on Social Media Amplify Political News But Do Not Detectably Affect Beliefs or Opinions
Andrew M. Guess,

Neil Malhotra,

Jennifer Pan,

Pablo Barberá,

Hunt Alcott,

Taylor Brown,

Adriana Crespo-Tenorio,

Drew Dimmery,

Deen Freelon,

Matthew Gentzkow,

Sandra González-Bailón,

Edward Kennedy,

Young Mie Kim,

David Lazer,

Devra Moehler,

Brendan Nyhan,

Carlos Velasco Rivera,

Jaime Settle,

Daniel Robert Thomas,

Emily Thorson,

Rebekah Tromble,

Arjun Wilkins,

Magdalena Wojcieszak,

Beixian Xiong,

Chad Kiewiet De Jong,

Annie Franco,

Winter Mason,

Natalie Jomini Stroud,

Joshua A. Tucker
Science, 2023
View Article View abstract

We studied the effects of exposure to reshared content on Facebook during the 2020 US election by assigning a random set of consenting, US-based users to feeds that did not contain any reshares over a 3-month period. We find that removing reshared content substantially decreases the amount of political news, including content from untrustworthy sources, to which users are exposed; decreases overall clicks and reactions; and reduces partisan news clicks. Further, we observe that removing reshared content produces clear decreases in news knowledge within the sample, although there is some uncertainty about how this would generalize to all users. Contrary to expectations, the treatment does not significantly affect political polarization or any measure of individual-level political attitudes.
Area of Study

Elite & Mass Political Behavior

Media Consumption

Online Information Environment

Political Polarization
Date Posted

Jul 27, 2023
Tags

2020 Election,

Facebook,

Instagram,

United States,

US 2020 Election Study
Journal Article
Asymmetric Ideological Segregation In Exposure To Political News on Facebook
Sandra González-Bailón,

David Lazer,

Pablo Barberá,

Meiqing Zhang,

Hunt Alcott,

Taylor Brown,

Adriana Crespo-Tenorio,

Deen Freelon,

Matthew Gentzkow,

Andrew M. Guess,

Shanto Iyengar,

Young Mie Kim,

Neil Malhotra,

Devra Moehler,

Brendan Nyhan,

Jennifer Pan,

Caros Velasco Rivera,

Jaime Settle,

Emily Thorson,

Rebekah Tromble,

Arjun Wilkins,

Magdalena Wojcieszak,

Chad Kiewiet De Jong,

Annie Franco,

Winter Mason,

Joshua A. Tucker,

Natalie Jomini Stroud
Science, 2023
View Article View abstract

Does Facebook enable ideological segregation in political news consumption? We analyzed exposure to news during the US 2020 election using aggregated data for 208 million US Facebook users. We compared the inventory of all political news that users could have seen in their feeds with the information that they saw (after algorithmic curation) and the information with which they engaged. We show that (i) ideological segregation is high and increases as we shift from potential exposure to actual exposure to engagement; (ii) there is an asymmetry between conservative and liberal audiences, with a substantial corner of the news ecosystem consumed exclusively by conservatives; and (iii) most misinformation, as identified by Meta’s Third-Party Fact-Checking Program, exists within this homogeneously conservative corner, which has no equivalent on the liberal side. Sources favored by conservative audiences were more prevalent on Facebook’s news ecosystem than those favored by liberals.
Area of Study

Media Consumption

Elite & Mass Political Behavior

Online Information Environment

Political Polarization
Date Posted

Jul 27, 2023
Tags

2020 Election,

Data Access,

Facebook,

Instagram,

United States,

US 2020 Election Study
Journal Article
Measuring the Ideology of Audiences for Web Links and Domains Using Differentially Private Engagement Data
Cody L. Buntain,

Richard Bonneau,

Jonathan Nagler,

Joshua A. Tucker
Proceedings of the International AAAI Conference on Web and Social Media, 2023
View Article View abstract

This paper demonstrates the use of differentially private hyperlink-level engagement data for measuring ideologies of audiences for web domains, individual links, or aggregations thereof. We examine a simple metric for measuring this ideological position and assess the conditions under which the metric is robust to injected, privacy-preserving noise. This assessment provides insights into and constraints on the level of activity one should observe when applying this metric to privacy-protected data. Grounding this work is a massive dataset of social media engagement activity where privacy-preserving noise has been injected into the activity data, provided by Facebook and the Social Science One (SS1) consortium. Using this dataset, we validate our ideology measures by comparing to similar, published work on sharing-based, homophily- and content-oriented measures, where we show consistently high correlation (>0.87). We then apply this metric to individual links from several popular news domains and demonstrate how one can assess link-level distributions of ideological audiences. We further show this estimator is robust to selection of engagement types besides sharing, where domain-level audience-ideology assessments based on views and likes show no significant difference compared to sharing-based estimates. Estimates of partisanship, however, suggest the viewing audience is more moderate than the audiences who share and like these domains. Beyond providing thresholds on sufficient activity for measuring audience ideology and comparing three types of engagement, this analysis provides a blueprint for ensuring robustness of future work to differential privacy protections.
Area of Study

Data Science Methodology
Date Posted

Jun 02, 2023
Tags

Facebook,

United States

Search or Filter

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags

Area of Study

Date Posted

Tags