Draw Me Like My Triples: Leveraging Generative AI for Wikidata Image Completion (Poster)

Raia Abu Ahmad; Martin Critelli; Şefika Efeoğlu; Eleonora Mancini; Célian Ringwald; Xinyue Zhang; Albert Meroño-Peñuela

Poster De Conférence Année : 2023

Draw Me Like My Triples: Leveraging Generative AI for Wikidata Image Completion (Poster)

(1) , (2) , (3, 4) , (5) , (6) , (7) , (8)

1
2
3
4
5
6
7
8

Raia Abu Ahmad

Fonction : Auteur

Deutsches Forschungszentrum für Künstliche Intelligenz GmbH = German Research Center for Artificial Intelligence

Martin Critelli

Fonction : Auteur

University of Ca’ Foscari [Venice, Italy]

Şefika Efeoğlu

Fonction : Auteur

Freie Universität Berlin

Technical University of Berlin / Technische Universität Berlin

Eleonora Mancini

Fonction : Auteur

Dept. Of Electrical, Electronic And Information Engineering, University Of Bologna

Célian Ringwald

Fonction : Auteur

Web-Instrumented Man-Machine Interactions, Communities and Semantics

Xinyue Zhang

Fonction : Auteur

University of Oxford

Albert Meroño-Peñuela

Fonction : Auteur

King‘s College London

Résumé

We leverage generative AI for the task of creating images for Wikidata items that do not have them. Our approach uses knowledge contained in Wikidata triples of items describing fictional characters and uses the fine-tuned T5 model based on the WDV dataset to generate natural text descriptions of items about fictional characters with missing images. We use those natural text descriptions as prompts for a transformer-based text-to-image model, Stable Diffusion (SD) v2.1, to generate plausible candidate images for Wikidata image completion. We motivate this choice by the fact that querying Wikidata shows that only 7% out of the 83.7K instances of the fictional character class have an image. Our work addresses the following Research Questions (RQs): - RQ1: To what extent can different types of prompts based on triples be used in text-to-image models to produce high-quality images? - RQ2: To what extent can the output of generative AI be used for Wikidata image completion? - RQ3: How can generative text-to-image models be evaluated?

Domaines

Traitement du texte et du document Apprentissage [cs.LG] Traitement des images [eess.IV]

Fichier principal

ISWC_poster_DrawMeLikeMyTriples_me-12.pdf (954.24 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Celian RINGWALD : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04526119

Soumis le : vendredi 29 mars 2024-10:02:40

Dernière modification le : dimanche 31 mars 2024-03:15:39

Dates et versions

hal-04526119 , version 1 (29-03-2024)

Identifiants

HAL Id : hal-04526119 , version 1

Citer

Raia Abu Ahmad, Martin Critelli, Şefika Efeoğlu, Eleonora Mancini, Célian Ringwald, et al.. Draw Me Like My Triples: Leveraging Generative AI for Wikidata Image Completion (Poster). ISWC23 - 22nd International Semantic Web Conference, Nov 2023, Athens, Greece. . ⟨hal-04526119⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA I3S WIMMICS INRIA2 UNIV-COTEDAZUR 3IA-COTEDAZUR ANR

5 Consultations

3 Téléchargements

Draw Me Like My Triples: Leveraging Generative AI for Wikidata Image Completion (Poster)

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager