From Sound to Sight: Towards AI-authored Music Videos

Authors: Leo Vitasovic, Stella Graßhof, Agnes Mercedes Kloft, Ville V. Lehtola, Martin Cunneen, Justyna Starostka, Glenn McGarry, Kun Li, Sami Sebastian BrandtBrandt

2025 IEEE/CVF International Conference on Computer Vision (ICCV)

Best Demo Award · ICCV 2025 Workshop on Generative AI for Storytelling (AISTORY)

Link to the article

Extending perceived stereo baseline with vector-base amplitude panning and polarity inversion

Authors: Christopher Gjørup, Caroline Gaudeoso, Sami S. Brandt

The 11th Congress of the Alps Adria Acoustics Association (AAAA), 2025

Link to the article

Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation

Authors: Patrick Glandorf, Bodo Rosenhahn

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025

Link to the article

SPAN: Learning Similarity between Scene Graphs and Images with Transformers

Authors: Yuren Cong; Wentong Liao; Bodo Rosenhahn; Michael Ying Yang

IEEE Transactions on Pattern Analysis and Machine Intelligence 2025

Link to the article

CHiQPM: Calibrated Hierarchical Interpretable Image Classification

Authors: Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Neslihan Kose, Ramesh Manuvinakurike, Bodo Rosenhahn

The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)

Link to the article

Planner3D: LLM-enhanced graph prior meets 3D indoor scene explicit regularization

Authors: Yao Wei, Martin Renqiang Min, George Vosselman, Li Erran Li, Michael Ying Yang

IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025

Link to the article

DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal

Authors: Mengmeng Liu, Michael Ying Yang, Jiuming Liu, Yunpeng Zhang, Jiangtao Li, Sander Oude Elberink, George Vosselman, Hao Cheng

2025 IEEE International Conference on Robotics and Automation (ICRA)

Link to the article

Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits

Authors: Yannik Mahlau, Maximilian Schier, Christoph Reinders, Frederik Schubert, Marco Bügling, Bodo Rosenhahn

Reinforcement Learning Conference (RLC’25)

Link to the article

A Rotation-Invariant Embedded Platform for (Neural) Cellular Automata

Authors: Dominik Woiwode, Jakob Marten, Bodo Rosenhahn

The 2025 Conference on Artificial Life (ALIFE)

Link to the article

Explainable Reinforcement Learning via Dynamic Mixture Policies

Authors: Maximilian Schier, Frederik Schubert, Bodo Rosenhahn

2025 IEEE International Conference on Robotics and Automation (ICRA)

Link to the article

Direction-Aware Room Impulse Response Estimation for Immersive Audio Rendering in Real Environments

Authors: Giovanni Zanin, Ritujoy Biswas, Pietro Morerio, Sylvio Barbon Junior, Alberto Carini, Alessio Del Bue, Vittorio Murino

ACM Multimedia, Dublin, Ireland, 2025

Link to the article

QPM: Discrete Optimization for Globally Interpretable Image Classification

Authors: Thomas Norrenbrock, Timo Kaiser, Sovan Biswas, Ramesh Manuvinakurike, Bodo Rosenhahn

The Thirteenth International Conference on Learning Representations (ICLR’25)

Link to the article

UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model

Authors: Timo Kaiser, Thomas Norrenbrock, Bodo Rosenhahn

2025 International Conference on Machine Learning (ICML’25)

Link to the article

Multi-Flow: Multi-View-Enriched Normalizing Flows for Industrial Anomaly Detection

Authors: Mathis Kruse, Bodo Rosenhahn

Conference on Computer Vision and Pattern Recognition – Visual Anomaly and Novelty Detection Workshop, 2025

Link to the article

In the middle of the music: A qualitative study of headset and loudspeaker distributed interactive spatial audio within a musical mixed-reality environment

Authors: Cliffe, Laurence; Khan, Mairah; Benford, Steve

Audio Mostly 2025, Coimbra, Portugal, 2025

Link to the article

Scale-wise Bidirectional Alignment Network for Referring Remote Sensing Image Segmentation

Authors: Kun Li, George Vosselman, Michael Ying Yang

ISPRS Journal of Photogrammetry and Remote Sensing, 2025

Link to the article

Don't They Really Hear Us? A Design Space for Private Conversations in Social Virtual Reality

Authors: Josephus Jasper Limbago, Robin Welsch, Florian Müller & Mario Di Francesco

IEEE Transactions on Visualization and Computer Graphics, 2025

Link to the article

Multimodal Rationales for Explainable Visual Question Answering

Authors: Kun Li, George Vosselman & Michael Ying Yang

CVPR Workshops 2025

Link to the article:

Attribute-Centric Compositional Text-to-Image Generation

Authors: Yuren Cong, Martin Renqiang Min, Li Erran Li, Bodo Rosenhahn & Michael Ying Yang

International Journal of Computer Vision (2025)

Link to the article

Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery

Authors: Tom Wehrbein, Marco Rudolph, Bodo Rosenhahn, and Bastian Wandt

Proceedings of the Winter Conference on Applications of Computer Vision (WACV), February 2025, p. 5852-5862.

Link to the article

Into the Here and Now: Explorations within a New Acoustic Virtual Reality

Author:  Laurence Cliffe

Leonardo, 2025, p. 118–124.

Link to the article

Interfacing with history: curating with audio augmented objects

Author:  Laurence Cliffe

Museum Management and Curatorship, 2024, p. 1–19.

Link to the article

AI enhances our performance, I have no doubt this one will do the same": The Placebo effect is robust to negative descriptions of AI

Authors:  Agnes Mercedes Kloft, Robin Welsch, Thomas Kosch, Steeven Villa

CHI ’24: Proceedings of the CHI Conference on Human Factors in Computing Systems, May 2024, Article No.: 299, Pages 1–24

Link to the article

Navigating the Virtual Gaze: Social Anxiety's Role in VR Proxemics

Authors:  Beatriz Mello, Robin Welsch, Marissa Christien Verbokkem, Pascal Knierim, Martin Johannes Dechant

CHI ’24: Proceedings of the CHI Conference on Human Factors in Computing Systems, May 2024, Article No.: 598, Pages 1–15

Link to the article