Publications

Things I have been writing

Overview

Journal Papers

2026
Gaussian Process Regression of Steering Vectors With Physics-Aware Deep Composite Kernels for Augmented Listening
Diego Di Carlo, Shoichi Koyama, Arie Aditya Nugraha, Mathieu Fontaine, Yoshiaki Bando, Kazuyoshi Yoshii
in IEEE Transactions on Audio, Speech, and Language Processing, Vol. ???, Num. ???, pp. ???, 2026.

2021
Mean absorption estimation from room impulse responses using virtually supervised learning
Cedric Foy, Antoine Deleforge, Diego Di Carlo
in The Journal of the Acoustical Society of America, Vol. 150, Num. 2, pp. 1286--1299, 2021.

2021
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing
Diego Di Carlo, Pinchas Tandeitnik, Cedric Foy, Nancy Bertin, Antoine Deleforge, Sharon Gannot
in IEEE Signal Processing Magazine, Vol. 2021, Num. 5, pp. 1--15, 2021.

2019
Audio-Based Search and Rescue With a Drone: Highlights From the IEEE Signal Processing Cup 2019 Student Competition
Antoine Deleforge, Diego Di Carlo, Martin Strauss, Romain Serizel, Lucio Marcenaro
in IEEE Signal Processing Magazine, Vol. 36, Num. 5, pp. 138--144, 2019.

Conference Papers

2026
SIRUP: A diffusion-based virtual upmixer of steering vectors for highly-directive spatialization with first-order ambisonics
Emilio Picard, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026.

2026
Physics-informed Learning Of Neural Scattering Fields Towards Measurement-free Mesh-to-HRTF Estimation
Tancrède Martinez, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2026.

2025
Visually-Informed Multichannel Sound Source Separation Based on 3D Gaussian Primitives
Haruaki Asano, Ryunosuke Nihei, Yoshiaki Bando, Aditya Arie Nugraha, Diego Di Carlo, Hiroyuki Ueda, Yosuke Ito, Kazuyoshi Yoshii
in IEEE Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2025.

2025
Physically Informed Spatial Regularization for Sound Event Localization and Detection
Haocheng Liu, Diego Di Carlo, Aditya Arie Nugraha, Kazuyoshi Yoshii, Gaël Richard, Mathieu Fontaine
in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2025.

2025
SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer
Diego Di Carlo (RIKEN AIP), Mathieu Fontaine (LTCI, IP Paris), Aditya Arie Nugraha (RIKEN AIP), Yoshiaki Bando (RIKEN AIP), Kazuyoshi Yoshii
in European Signal Processing Conference (EUSIPCO), 2025.

2024
Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising
Yoto Fujita, Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
in Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2024.

2024
RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation
Liam Kelley, Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
in Annual Conference of the International Speech Communication Association (Interspeech), 2024.

2024
Joint Audio Source Localization and Separation with Distributed Microphone Arrays Based on Spatially-Regularized Multichannel NMF
Yoshiaki Sumura, Diego Di Carlo, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
in International Workshop on Acoustic Signal Enhancement (IWAENC), 2024.

2024
Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Direction
Diego Di Carlo, Aditya Arie Nugraha, Mathieu Fontaine, Yoshiaki Bando, and Kazuyoshi Yoshii
in IEEE International Conference on Acoustics, Speech and Signal Processing Workshops (ICASSPW),, 2024.

2024
Implicit neural representation for change detection
Peter Naylor, Diego Di Carlo, Arianna Traviglia, Makoto Yamada, Marco Fiorucci
in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024.

2023
Time-Domain Audio Source Separation Based on Gaussian Processes with Deep Kernel Learning
Aditya Arie Nugraha, Diego Di Carlo, Yoshiaki Bando, Mathieu Fontaine, and Kazuyoshi Yoshii
in IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 2023.

2022
Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization
Mathieu Fontaine, Diego Di Carlo, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, and Kazuyoshi Yoshii
in European Signal Processing Conference (EUSIPCO), 2022.

2022
Post processing sparse and instantaneous 2D velocity fields using physics-informed neural networks
Diego Di Carlo, Dominique Heitz, Thomas Corpetti
in 20th International Symposium on Application of Laser and Imaging Techniques to Fluid Mechanics (LXLASER), 2022.

2020
BLASTER: An Off-Grid Method for Blind and Regularized Acoustic Echoes Retrieval
Di Carlo, Diego and Elvira, Clement and Deleforge, Antoine and Bertin, Nancy and Gibonval, Remi
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020.

2019
MIRAGE: 2D Source Localization Using Microphone Pair Augmentation with Echoes
Di Carlo, Diego and Deleforge, Antoine and Bertin, Nancy
in IEEE International Conference on Acoustics, Speech and Signal Processing, 2019.

2018
SEPARAKE: Source Separation with a Little Help from Echoes
Scheibler, Robin and Di Carlo, Diego and Deleforge, Antoine and Dokmanic, Ivan
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018.

2018
Evaluation of an Open-Source Implementation of the SPR-PHAT Algorithm Within the 2018 Locata Challenge
Lebarbenchon, Romain and Camberlein, Ewen and Di Carlo, Diego and Deleforge, Antoine and Bertin, Nancy
in LOCATA Challenge Workshop - a satellite event of International Workshop on Acoustic Signal Enhancement (IWAENC), 2018.

2018
Interference reduction on full-length live recordings
Di Carlo, Diego and Liutkus, Antoine and Déguernel, Ken
in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018.

2017
Gaussian framework for interference reduction in live recordings
Di Carlo, Diego and Déguernel, Ken and Liutkus, Antoine
in AES International Conference on Semantic Audio, 2017.

2016
Gestural Control Of Wavefield synthesis
Grani, Francesco and Di Carlo, Diego and Portillo, Jorge Madrid and Girardi, Matteo and Paisa, Razvan and Banas, Jian Stian and Vogiatzoglou, Iakovos and Overholt, Dan and Serafin, Stefania
in Sound and Music Computing Conference (SMC), 2016.

2014
Automatic music listening for automatic music performance: a grandpiano dynamics classifier
Di Carlo, Diego and Rodá, Antonio
in Proceedings of the 1st International Workshop on Computer and Robotic Systems for Automatic Music Performance (SAMP 14), 2014.

Invited Talks

2025
Augmented Listening with Physics-Coherent Neural Fields
Telecom Paris, France, 2025, April 25.
2024
from Neural Field for Augmented Listening
Kyoto University, Enginnering School, 2024, December.
2024
from Neural Fields to PINNs, ... and beyond.
Prism, CNRS, France, 2024, September.
2024
from Neural Fields to PINNs, ... and beyond.
Telecom Paris, France, 2024, September.
Neural Fields
Kyoto University.
Neural Fields
Strasbourg.
Neural Fields
Telecom.
Neural Fields for Urban Change detection
Kyoto Univesity.
2021
Echo-aware Signal Processing for Audio Scene Analysis
Riken AIP Center, Kyoto (Japan), 2021, July.
2019
Hunting Echoes for Auditory Scene Analysis
Bar-Ilan University, Israel, 2019, November.
What is an Hackathon?
Journeé Science et Musisque, BU Univ Rennes 2, Rennes.
2019
My Pythonic Workflow
Seminaire Au Vert (Team Building Seminar), 2019, August.
2019
Hunting Echoes for Auditory Scene Analysis
Rosckoff, Frances, 2019, July.

Thesis

2020
Echo-aware signal processing for audio scene analysis
Di Carlo, Diego - supervised by Deleforge, Antoine and Bertin, Nancy
Université de Rennes 1 - Panama Team (INRIA/IRISA) [France], 2020.

2017
Gaussian Framework for Interference Reduction in Live Recordings
Di Carlo, Diego - supervised by Orio, Nicola and Liutkus, Antoine
Universitá degli Studi di Padova [Italy], 2017.

2014
Sequential Feature Selection: Algorithms and Applications for Audio Information Retrieval
Di Carlo, Diego - supervised by Antonio Rodá
Universitá degli Studi di Padova [Italy], 2014.

Miscellaneous

2018
Etude des propriétés acoustiques de la guitare Black Flag
Denis Thouret, Loïc Le Marrec, Diego Di Carlo, Ewen Camberlein, Clements Gaultier and Frédéric Bimbot - supervised by
Journee Science et Musique, Rennes (Fr), 2018.