P296: LLMs and the language sciences: material, semiotic, and linguistic perspectives from STS and linguistic anthropology

Click the star to add/remove an item to/from your individual schedule.
You need to be logged in to avail of this functionality. Log in

P296

LLMs and the language sciences: material, semiotic, and linguistic perspectives from STS and linguistic anthropology

Convenors:: Anna Weichselbraun (University of Vienna)
Michael Castelle (University of Warwick)
Siri Lamoureaux (University of Siegen)
Send message to Convenors

Chair:: Siri Lamoureaux (University of Siegen)

Format:: Traditional Open Panel

Location:: HG-05A24

Sessions:: Friday 19 July, 9:00-10:30, 11:00-12:30
Time zone: Europe/Amsterdam

Short Abstract:

This panel convenes linguistic anthropology and STS to consider how large language models (LLMs) emerge from language-ideological and material-semiotic practices. We invite papers that contribute to better understanding the constitutive work of LLMs at multiple stages and via multiple stakeholders.

Long Abstract:

This panel brings together linguistic anthropology and STS to consider how large language models (LLMS) are transforming both the language sciences (linguistics, computational linguistics, NLP) and technosocial practices. With the release of OpenAI’s ChatGPT in 2022, discussions have focused on what its uncannily human-like generated text means for politics, education, knowledge production, authorship, sociality, care, etc. Despite the interest of social scientists in critiques of LLMs over, e.g., the reproduction of biases in datasets, minority representation, and energy consumption, much of this work now takes place in computing sciences and/or industry. Few of these internalist critiques, however, center language as a form of social and cultural action, the purview of linguistic anthropology. This panel addresses this gap, encouraging both linguistic anthropologists and scholars of the language sciences to interrogate the construction, development, and imaginaries surrounding LLMs.

Linguistic anthropology challenges Enlightenment notions of “language” and “representation” prevalent in the computational and social sciences, and instead emphasizes the situated, pragmatic and indexical functions of language. While interested in the technological mediation of language, it has largely overlooked the transformation of the concept of language by computational linguists, scholars in NLP, and the designers of programming “languages”. In turn, posthumanist trends in STS have favored a “material-semiotics” set in opposition to language-as-representation — a contrast actively dissolved by LLMs. Both fields could contribute to new understandings of LLMs.

These papers contribute to understanding the work of LLMs on questions of (1) language policy and governance (e.g., how do policymakers understand LLMs? What language ideologies motivate the efforts to police LLMs?); (2) R&D in practice (what are the implicit or explicit language ideologies of individuals, professions or companies developing large language models?); (3) users and implementation (with what expectations do users encounter and interact with LLMs? What do these reveal about language practices?)

Accepted papers:

Session 1 Friday 19 July, 2024, 9:00-10:30

Contextualizing high-dimensional communication: the relevance of linguistic anthropology for theorizing large language models

Michael Castelle (University of Warwick)

Send message to Author

Short abstract:

This paper provides an overview of the fundamental relevance of the field of linguistic anthropology to the understanding and/or critique of Large Language Models, clarifying aspects of current debates about LLMs within NLP research as well as among those in the social science and humanities.

Long abstract:

At the core of the controversiality of Large Language Models (LLMs), on the one hand, their implicit rejection of influential theories in mainstream linguistics and cognitive science; and on the other, the unconscious adoption of interactional paradigms — such as the overt dialogicality of the "instruction-tuned" ChatGPT — championed more frequently in the humanities and social sciences as fundamental to sense-making. Indeed, many computer scientists in contemporary NLP do not find it necessary to concern themselves with the wide variety of past or present theories of language and learning. However, a prominent and arguably misguided assumption has been made by members of the AI research community that applying increasing scale to these models' training data, training time, and/or architectural size is likely to lead to the achievement of superhuman intelligence; this perspective, like many individualist approaches to cognition, necessarily downplays the role of indexical embodiment, social interaction, and contextually reflexive cultural practices in already-sociotechnical human communication. I will argue that a better understanding of the field known as linguistic anthropology can help understand both current and future successes and failure modes of LLMs, as well as helping social scientists and humanists to avoid some common, but misguided, avenues of critique for LLMs. From century-old works of American anthropological linguistics to the more contemporary insights of Michael Silverstein's theories of pragmatics and metapragmatics, this resolutely empirical — but semiotically and ethnographically well-grounded — school of thought provides surprising insights into both the intriguing strengths and fundamental limits of these computational artifacts.

LLMs as conversational agents in the interview society

Spencer Kaplan (Yale University)

Send message to Author

Short abstract:

This paper studies the language ideologies of AI research through ethnographic fieldwork in San Francisco. Here, LLM-based conversational agents invoke the ideologies of the interview society. These ideologies inform LLMs’ perceived capacities and authority yet pose certain hazards for their use.

Long abstract:

This paper examines the language ideologies motivating generative AI development in the San Francisco Bay Area. To do so, it offers two case studies from ongoing ethnographic fieldwork among AI researchers in the region. The first case comes from the subfield of AI Safety, which seeks to “align” AI models with so-called human values. Here, researchers employ LLMs as “conversational agents” that supposedly discern human interlocutors’ underlying values through deliberative interaction. The second case comes from researchers’ use of LLMs in their personal lives. It describes efforts to fine-tune models like OpenAI’s ChatGPT with transcripts from discussions about topics like AI’s societal implications, creating purported conversational experts in the topics.

Both cases exemplify the use of LLMs as technologies that collect conversational data for interpretation through abduction. In such an application, LLMs employ the interactional and epistemic techniques of the “interview society” as described by Atkinson, Silverman, and, later, Briggs. Here, interviews offer privileged and authoritative access to knowledge that is otherwise hidden—especially knowledge about persons. To do so, interviews invoke Liberal and Romantic language ideologies about public reason, inner expression, and authenticity. In conversational agents, these ideologies now inform the perceived capacities and authority of LLMs. Yet interviews are always partial and positioned, posing hazards for LLM applications. These hazards are already faced by social researchers engaging in interview methodologies. By approaching interviews as an interactional form common to LLMs and social researchers alike, this paper also raises important reflexive questions for scholars of AI.

Crafting Arabic: the work of metaphor in constructing LLMs within Jordan’s digital content ‘powerhouse'

Tariq Adely (George Washington University)

Send message to Author

Short abstract:

This paper draws on ethnographic research with language workers involved in producing LLMs and other AI technologies in Amman, Jordan to analyze metaphors—and the language ideologies animating them—as crucial material-semiotic practices for rendering Arabic an object of technological advancement.

Long abstract:

In May 2023, OpenAI CEO Sam Altman spoke at the Xpand Technology Conference in Amman, Jordan about artificial intelligence and large language models (LLMs). In his opening remarks, tech mogul and moderator Fouad Jeryes emphasized the significance of the event happening in Jordan: “We remain a powerhouse here for tech…we are the creators of the overwhelming majority of [Arabic] content on the Internet.”

Discourses about Jordan’s inordinate production of digital content have circulated for over a decade but have taken on greater importance with the rise of LLMs and other AI-enabled technologies built on massive language corpora. Today, language workers in Jordan’s tech sector—a historically Anglocentric industry—not only accumulate socio-economic capital through their Arabic competencies; through everyday labor and discursive practices, they craft Arabic into a data-rich language of technological advancement.

This paper draws on a year of ethnographic research with language workers who help build and maintain LLMs in Amman’s tech sector—e.g., annotators, proofreaders, lexicographers—to understand the constitutive work of metaphor in constructing language technologies. Drawing on semi-structured interviews and participant observation, it analyzes the language ideologies that animate these metaphors and the mobilization of metaphor—successful or not—to translate technical concepts grounded in Anglocentric assumptions of how language ought to work. Bringing together STS and anthropological scholarship on metaphor (DeLoughrey 2013), language ideologies (Bauman and Briggs 2003), and language work (Orr 1996), this paper centers metaphor as a crucial material-semiotic practice for producing complex sociotechnical systems like LLMs, especially in linguistic contexts and political economies outside the Global North.

Human tests for machine models: listening for the echo of the archive

Anna Weichselbraun (University of Vienna) Noya Kohavi

Session 2 Friday 19 July, 2024, 11:00-12:30

The problem of alignment

Tsvetelina Hristova (Southampton University) Liam Magee (Western Sydney University)

Send message to Authors

Short abstract:

The paper explores how historical opposition between deep structure and surface statistics in linguistics has organised understanding of the relationship between language and meaning. As LLMs today struggle to align with human norms, revisiting these debates can clarify the aims of machine training.

Long abstract:

Large Language Models produce sequences learned as statistical patterns from large corpora. In order not to reproduce corpus biases, after initial training models must be aligned with human values, preferencing certain continuations over others. This supplementary process can be viewed as the superimposition of normative structure onto a statistical model. We examine one practice of this structuration in how ChatGPT4 redacts and interprets fragments of Joyce’s Ulysses, a text that deliberately contravenes literary norms. We demonstrate that despite observing the form of the text, its idiosyncrasies and ‘literariness’ of the text are smoothed over in the model’s rearticulation. We then situate this alignment problem historically, revisiting earlier postwar linguistic debates which counterposed two views of meaning: as discrete structures, and as continuous probability distributions. We discuss the largely occluded work of the Moscow Linguistic School, which sought to reconcile this opposition by studying language as a communicative system in which its elements are both coordinated relationally (as structuralism argued) and occur with differential frequency, according to extra-linguistic social norms (as speech act and information theory suggested). Our attention to the Moscow School and later related arguments by Searle and Kristeva casts the problem of alignment in a new light: as one involving attention to the social structuration of linguistic practice, including structuration of anomalies that, like the Joycean text, exist in defiance of expressive conventions. These debates around the communicative orientation toward language can help explain some of the contemporary behaviours and interdependencies that take place between users and LLMs.

Large language models in everyday life

Mark Dingemanse (Radboud University) Andreas Liesenfeld

The chatbot’s real self: textuality, cultural concepts, and the archaeology of artificial personas

Courtney Handman (University of Texas at Austin)

Finding consensus in a polarized world: LLMs and public opinion in deliberative democracy

Janet Connor (Leiden University)