12023 HE

Bye year 12,023 of the Human Era, and welcome 12,024. Before the pandemic things were going way too fast and months basically felt like weeks.

Queenstown, New Zealand

The Best of ISMIR 2023

ISMIR is back in full swing with over 400 registrations, most of them in-person attendees in the beautiful city of Milan.

The Duomo of Milan

Efficient Spoken Language Recognition via Multilabel Classification

TL;DR: we present efficient models for the task of Spoken Language Recognition plus an effective strategy to gracefully handle unsupported languages via multilabel classification.

Thumbnail image

ICASSP 2023: The Good, the Bad, and the Ugly

The beautiful Island of Rhodes welcomed the most prestigious signal processing conference in the world this past week.

Thumbnail image

Audio-Text Models Do Not Yet Leverage Natural Language

TL;DR: we thoroughly analyzed state-of-the-art audio-text multimodal models and they do not fully leverage natural language.

Thumbnail image

12022 HE

And there you go, the year 12022 of the Human Era gets to an end.

Hampi in India

The Best of ISMIR 2022

And finally, after 2 years with virtual-only ISMIRs, this year we had the first post-pandemic in-person ISMIR.

Thumbnail image

Burning Man Intensifies

I keep seeing wonderful pictures from this year’s Burning Man, and they inspired me to share a terrible one, to counterbalance.

Thumbnail image