From Sound to Sight: Towards AI-authored Music Videos
2025 IEEE/CVF International Conference on Computer Vision (ICCV)
Best Demo Award · ICCV 2025 Workshop on Generative AI for Storytelling (AISTORY)
Extending perceived stereo baseline with vector-base amplitude panning and polarity inversion
The 11th Congress of the Alps Adria Acoustics Association (AAAA), 2025
Pruning by Block Benefit: Exploring the Properties of Vision Transformer Blocks during Domain Adaptation
Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2025
SPAN: Learning Similarity between Scene Graphs and Images with Transformers
IEEE Transactions on Pattern Analysis and Machine Intelligence 2025
CHiQPM: Calibrated Hierarchical Interpretable Image Classification
The Thirty-Ninth Annual Conference on Neural Information Processing Systems (NeurIPS 2025)
Planner3D: LLM-enhanced graph prior meets 3D indoor scene explicit regularization
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2025
DVLO4D: Deep Visual-Lidar Odometry with Sparse Spatial-temporal
2025 IEEE International Conference on Robotics and Automation (ICRA)
Multi-Agent Reinforcement Learning for Inverse Design in Photonic Integrated Circuits
Reinforcement Learning Conference (RLC’25)
A Rotation-Invariant Embedded Platform for (Neural) Cellular Automata
The 2025 Conference on Artificial Life (ALIFE)
Explainable Reinforcement Learning via Dynamic Mixture Policies
2025 IEEE International Conference on Robotics and Automation (ICRA)
Direction-Aware Room Impulse Response Estimation for Immersive Audio Rendering in Real Environments
ACM Multimedia, Dublin, Ireland, 2025
QPM: Discrete Optimization for Globally Interpretable Image Classification
The Thirteenth International Conference on Learning Representations (ICLR’25)
UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model
2025 International Conference on Machine Learning (ICML’25)
Multi-Flow: Multi-View-Enriched Normalizing Flows for Industrial Anomaly Detection
Conference on Computer Vision and Pattern Recognition – Visual Anomaly and Novelty Detection Workshop, 2025
In the middle of the music: A qualitative study of headset and loudspeaker distributed interactive spatial audio within a musical mixed-reality environment
Audio Mostly 2025, Coimbra, Portugal, 2025
Scale-wise Bidirectional Alignment Network for Referring Remote Sensing Image Segmentation
ISPRS Journal of Photogrammetry and Remote Sensing, 2025
Don't They Really Hear Us? A Design Space for Private Conversations in Social Virtual Reality
IEEE Transactions on Visualization and Computer Graphics, 2025
Multimodal Rationales for Explainable Visual Question Answering
CVPR Workshops 2025
Link to the article:
Attribute-Centric Compositional Text-to-Image Generation
International Journal of Computer Vision (2025)
Utilizing Uncertainty in 2D Pose Detectors for Probabilistic 3D Human Mesh Recovery
Proceedings of the Winter Conference on Applications of Computer Vision (WACV), February 2025, p. 5852-5862.
Into the Here and Now: Explorations within a New Acoustic Virtual Reality
Leonardo, 2025, p. 118–124.
Interfacing with history: curating with audio augmented objects
Museum Management and Curatorship, 2024, p. 1–19.
AI enhances our performance, I have no doubt this one will do the same": The Placebo effect is robust to negative descriptions of AI
CHI ’24: Proceedings of the CHI Conference on Human Factors in Computing Systems, May 2024, Article No.: 299, Pages 1–24
Navigating the Virtual Gaze: Social Anxiety's Role in VR Proxemics
CHI ’24: Proceedings of the CHI Conference on Human Factors in Computing Systems, May 2024, Article No.: 598, Pages 1–15