Cascina Elisa Politecnico di Torino [email protected]

Pellegrino Andrea Politecnico di Torino [email protected]


What is Intent detection ?

Intent detection refers to the task of figuring out the goal or purpose behind a spoken assertion. When we speak, we are typically trying to accomplish some objective, such as asking a question, making a request, or expressing a desire. The aim of intent detection is to algorithmically determine the intention behind natural language statements.

For example, if a user says "Turn up the volume", the intent is to increase the audio output level. If the user says "What time is it?", they are asking a question to obtain the current time. Intent detection involves analyzing linguistic and acoustic signals to categorize the intent. This enables voice interfaces and assistants to understand what the user wants and take appropriate actions.

Example: “Decrease heat” audio

SCR-20231104-sjav.png

This image shows the audio in different domains to better visualize its characteristics

This image shows the audio in different domains to better visualize its characteristics

Using methods that will be explained later, this audio signal will be processed, and our machine learning model has the goal of classifying this audio as one where the intent is to decrease the heat

What were our goals ?

For this project, we had two main goals: