P116: Experiments with computer vision: transforming and re-envisioning visual data

Click the star to add/remove an item to/from your individual schedule.
You need to be logged in to avail of this functionality. Log in

P116

Experiments with computer vision: transforming and re-envisioning visual data

Convenors:: Chrys Vilvang (Concordia University)
Gabriel Pereira (University of Amsterdam)
Bruno Moreschi (Collegium Helveticum ETHZ)
Aikaterini Mniestri (London School of Economics and Political Science)
Send message to Convenors

Format:: Combined Format Open Panel

Location:: Agora 4, main building

Sessions:: Friday 19 July, 14:30-16:00, 16:30-18:00
Time zone: Europe/Amsterdam

Short Abstract:

This panel looks at the theoretical and practical aspects of algorithmic image processing, exploring the data techniques that train and enable machine learning and computer vision. How can these sociotechnical processes be reimagined to foster more radical ways of seeing the world through machines?

Long Abstract:

An age-old adage says that a picture is worth a thousand words. Although this has taken the meaning that an image can hold much information, it also reminds us that images are multifaceted and may contain within them multiple interpretations, practices, and subjective perceptions.

This panel engages with the way images have become a constitutive part of algorithmic processing systems today, particularly as they are variously used to constitute training data sets for machine learning. It builds upon much recent STS work that has sought to understand (and transform) the relations between images and algorithms, particularly within "critical data set studies" (Thylstrup), "ways of machine seeing" (Azar et al), or even "platform seeing" (Mackenzie & Munster).

The panel deals critically with the way images are organized, tagged, curated, and otherwise made to work within algorithmic pipelines, and the sociotechnical processes that they enable. Questions may include: How do image data sets constitute computer vision? How do image tracking algorithms define and represent minoritized bodies? What are other, more critical ways that data sets could be constituted? What human practices (beyond the images themselves) are not being highlighted in computer vision? How is/could fake or synthetic data enable alternative data sets?

This Combined Format Open Panel welcomes academic paper presentations, but also encourages scholars, artists, and activists to experiment with other forms of knowledge expression, particularly artistic and practice-based methodologies. These can be shown as, e.g., video essays, net art, short workshops, interactive modes of presentations, etc. Please include details on how your contribution would be best performed and we'll work to manage the different needs of selected contributors. We are open to academic research, but welcome more artistic and experimental formats, especially those that "think outside the box".

Accepted contributions:

Session 1 Friday 19 July, 2024, 14:30-16:00

Imagi(ni)ng realities: enacting referential chains within computer vision

Isak Engdahl (Lund University)

What the good memory looks like: how computer vision resurfaces your photos (and your past)

Chrys Vilvang (Concordia University)

Multimodal fabulations in understanding image recognition algorithms.

Ildikó Plájás (University of Amsterdam)

Green crossings: student/machine learning in the life sciences

Amy Cheatle (Cornell University) Bernie Boscoe (Southern Oregon University)

Navigating and drawing through a data-set: An effort to circumvent the meaning boxes way of seeing

Gabriella Gonçalles (Hochschule für Kunst Bremen)

Session 2 Friday 19 July, 2024, 16:30-18:00

The algorithm will see you now: on the distorted re-envisioning of trans bodies through reverse image searching algorithms

Aikaterini Mniestri (London School of Economics and Political Science)

Send message to Author

Short abstract:

How do image tracking algorithms represent minoritized bodies? I have used situational mapping to capture the misappropriation of trans bodies across different internet locations, as those resurface through reverse search algorithms. What we see through the software becomes the focus of critique.

Long abstract:

It is common practice to post images of one’s body on social media. For some, this constitutes a routine, a naturalized aspect of networked social existence. For others, the digital publication of images of their bodies represents a contentious practice, a call for solidarity. In particular, trans-identifying content creators post images of their bodies to their followers to normalize the diversity of the trans experience. Their images invite viewers to embrace the presentation of different gender expressions and, potentially, to embrace their own non-normative physique in a society that is otherwise dominated by strict stereotypes of gender presentation. However, these images are downloaded, re-uploaded and misappropriated by various actors, who are not necessarily sympathetic or mindful of the original creator’s intentions. I used reverse-image-searching tools to investigate the misappropriation of images of trans bodies across the web. This method yields an algorithmically-curated mapping of the locations, where these images resurface online. Using situational mapping, I captured the misappropriation of these images by third-party-actors so as to draw attention to the double bind of the online representation of trans bodies. On one hand, trans content creators use social media platforms to achieve the broadest possible visibility. On the other hand, third parties take advantage of the public nature of social media platforms to misappropriate images of trans bodies for their own ends. Ultimately, this paper encourages readers to mind the numerous ways in which the non-normative body is treated online and to question how the attention economy affects different bodies.

I've never been to Acapulco. But I like it there.

Bruno Moreschi (Collegium Helveticum ETHZ)

Seeing like algorithmic surveillance

Gabriel Pereira (University of Amsterdam)

Excavating the interesting: surfacing scenarios in autonomous vehicle training datasets

Sam Hind (University of Manchester)

Send message to Author

Short abstract:

Offering a comparison between two training datasets, this paper considers the role of ‘interestingness’ as an empirical quality sought after by machine vision researchers. In such cases, the search for interestingness leads researchers to design elaborate ways to define, categorize, and quantify it.

Long abstract:

In 2012, the KITTI Vision Benchmark Suite was launched, a training dataset used to compare real-world benchmarks useful for the development of autonomous vehicles. Funded through a collaboration between the Karlsruhe Institute of Technology (KIT) in Germany and the Toyota Technological Institute at Chicago (TTI-C) in the USA – hence KIT-TI – the Vision Benchmark Suite provided the foundation for the early ‘benchmark era’ of autonomous driving in the 2010s. Seven years later in 2019, Google/Alphabet’s autonomous vehicle division launched the Waymo Open Dataset, indebted to KITTI and other such open-source benchmark projects, establishing a new ‘incrementalist’ phase of autonomous vehicle development. Tied to annual iterations of their Open Dataset Challenges, Waymo published updates to the dataset in 2021 and 2022, adding unrivalled 'domain diversity' to their offering. Together, both dataset and challenge constitute Waymo’s vision to ‘platformize’ autonomous driving, mobilizing open data initiatives and logics as the basis for commercial development, locking prospective users into their plug-and-play machine learning (ML) stack. Offering a comparison between these two training datasets, representative of different phases in the development of autonomous vehicles, this paper considers the role of ‘interestingness’ as an empirical quality sought after by machine vision researchers in the compilation of such training datasets. In these cases, the search for interestingness leads researchers to design and test ever-more elaborate ways to define the kinds of scenes, situations and scenarios captured in the training datasets themselves, resulting in the quantification of interestingness as an increasing degree of interaction between agents.