MAP-Elites with Transverse Assessment for Multimodal Problems in Creative Domains

Zammit, Marvin; Liapis, Antonios; Yannakakis, Georgios N.

doi:10.1007/978-3-031-56992-0_26

Computer Science > Neural and Evolutionary Computing

arXiv:2403.07182 (cs)

[Submitted on 11 Mar 2024]

Title:MAP-Elites with Transverse Assessment for Multimodal Problems in Creative Domains

Authors:Marvin Zammit, Antonios Liapis, Georgios N. Yannakakis

View PDF HTML (experimental)

Abstract:The recent advances in language-based generative models have paved the way for the orchestration of multiple generators of different artefact types (text, image, audio, etc.) into one system. Presently, many open-source pre-trained models combine text with other modalities, thus enabling shared vector embeddings to be compared across different generators. Within this context we propose a novel approach to handle multimodal creative tasks using Quality Diversity evolution. Our contribution is a variation of the MAP-Elites algorithm, MAP-Elites with Transverse Assessment (MEliTA), which is tailored for multimodal creative tasks and leverages deep learned models that assess coherence across modalities. MEliTA decouples the artefacts' modalities and promotes cross-pollination between elites. As a test bed for this algorithm, we generate text descriptions and cover images for a hypothetical video game and assign each artefact a unique modality-specific behavioural characteristic. Results indicate that MEliTA can improve text-to-image mappings within the solution space, compared to a baseline MAP-Elites algorithm that strictly treats each image-text pair as one solution. Our approach represents a significant step forward in multimodal bottom-up orchestration and lays the groundwork for more complex systems coordinating multimodal creative agents in the future.

Comments:	18 pages, 5 figures To be published in the proceedings of the 13th International Conference on Artificial Intelligence in Music, Sound, Art and Design (EvoMUSART) 2024
Subjects:	Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2403.07182 [cs.NE]
	(or arXiv:2403.07182v1 [cs.NE] for this version)
	https://doi.org/10.48550/arXiv.2403.07182
Related DOI:	https://doi.org/10.1007/978-3-031-56992-0_26

Submission history

From: Marvin Zammit [view email]
[v1] Mon, 11 Mar 2024 21:50:22 UTC (8,618 KB)

Computer Science > Neural and Evolutionary Computing

Title:MAP-Elites with Transverse Assessment for Multimodal Problems in Creative Domains

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Neural and Evolutionary Computing

Title:MAP-Elites with Transverse Assessment for Multimodal Problems in Creative Domains

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators