Fix typos <noupdate>

This commit is contained in:
2024-12-27 16:21:47 +01:00
parent b7d538b9ac
commit 8837682582
2 changed files with 4 additions and 4 deletions

View File

@ -103,7 +103,7 @@
\section{Tasks}
\begin{description}
\item[Automatic speech recognition (ASP)]
\item[Automatic speech recognition (ASR)]
Convert a sound signal into text.
\begin{example}
@ -160,7 +160,7 @@
\begin{description}
\item[Speech foundation model (SFM)] \marginnote{Speech foundation model (SFM)}
Transformer-based model pre-trained on speech. A common architecture is composed by:
Transformer-based model pre-trained on speech. A common architecture is composed of:
\begin{descriptionlist}
\item[Feature extractor]
Converts the waveform into a low-dimensional representation (e.g., by using convolutions).
@ -179,7 +179,7 @@
\item[Multimodal model] \marginnote{Multimodal model}
Model able to handle multiple modalities (e.g., speech and text).
The main considerations to take into account when working with multimodel models are:
The main considerations to take into account when working with multimodal models are:
\begin{descriptionlist}
\item[Representation]
Decide how to encode different modalities into the same embedding space.