T0132


Data-Driven Humanities in Japan: From AI Infrastructure to Computational Literary Analysis 
Convenor:
Nobuhiko Kikuchi (National Institute of Japanese Literature)
Send message to Convenor
Chair:
Nobuhiko Kikuchi (National Institute of Japanese Literature)
Discussant:
Nobuhiko Kikuchi (National Institute of Japanese Literature)
Format:
Panel
Section:
Interdisciplinary Section: Digital Humanities

Short Abstract

Showcasing "The Model Building in Humanities through Data-Driven Problem Solving," this panel progresses from AI infrastructure to literary analysis. Papers cover CLIP image retrieval, RAG-based learning tools, gender in Kokinshu, and modern novel evolution, offering new digital pathways.

Long Abstract

This panel presents the latest achievements of the National Institute of Japanese Literature’s large-scale project, "The Model Building in Humanities through Data-Driven Problem Solving." Structured to trace the progression from digital infrastructure to analytical application, the four presentations demonstrate how computational approaches are reshaping Japanese humanities research.

The first half addresses the construction of AI-driven infrastructure for accessibility and education. The first presentation introduces a deep learning method using CLIP to retrieve illustrations via natural language queries. By fine-tuning models to recognize historical items like kichou, the study enables the efficient retrieval of visual materials lacking metadata. This approach supports intuitive exploration of classical materials and contributes to advanced historical and humanities research. Bridging retrieval and learning, the second study proposes an interactive search system using Retrieval-Augmented Generation (RAG). By visualizing textual variations across editions, this system supports secondary-level learners in understanding the complexity of textual transmission in classical books.

Building on these technological foundations, the second half applies quantitative methods to derive new literary insights. The third paper conducts a collocation network analysis of The Kokin Wakashuu, focusing on the verb omou. This systematic investigation not only offers a new perspective on Heian gender ideology but also sheds light on how quantitative methods reveal patterns that would otherwise remain unidentified by previous scholarship. Finally, the fourth presentation expands the scope to a macro-analysis of one million newspaper novels (1875–2025). By evidencing a genre-wide evolution toward psychological interiority, the study illustrates how quantitative measures, when combined with close reading, can reposition forgotten works and open new comparative perspectives on media formats, genres, and narrative techniques.

As "Data-Driven Humanities" rapidly establishes itself as a key term in Japanese academia, this panel seeks to go beyond mere presentation. By sharing these methodologies—ranging from AI-based archival retrieval to large-scale literary history—we aim to initiate a collaborative dialogue with European scholars to co-create a new paradigm of humanities research that transcends geographical boundaries and traditional methodological confines.

Abstract in Japanese (if needed)

Accepted papers