From 90b014ad0ee18a9fa3dc2e182cd791bed5d005d7 Mon Sep 17 00:00:00 2001 From: =?UTF-8?q?Pawe=C5=82=20Sk=C3=B3rzewski?= Date: Fri, 17 Feb 2023 10:57:35 +0100 Subject: [PATCH] Update diagram files in README --- README.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 6155b75..6a775b9 100644 --- a/README.md +++ b/README.md @@ -13,7 +13,7 @@ The goal of this competition is to develop Natural Language Understanding models The approach used to prepare data for the challenge is meant to promote models robust to various types of errors in the input, making it impossible to solve the task by simply learning a shallow mapping from incorrectly recognized words to the correct ones. This reflects real-world scenarios where the NLU system is presented with inputs that exhibit various disturbances due to changes in the ASR model, acoustic conditions, speaker variation, and other causes. -[](https://raw.githubusercontent.com/kubapok/cnlps-caiccaic) +[](https://raw.githubusercontent.com/kubapok/cnlps-caiccaic) ## Dates @@ -31,7 +31,7 @@ The approach used to prepare data for the challenge is meant to promote models r The training set is derived from [Leyzer: A Dataset for Multilingual Assistants](https://github.com/cartesinus/leyzer). It consists of user utterances along with the semantic representation of the commands targeted at a virtual assistant. A fraction of the utterances in the training set is contaminated with speech recognition errors; however, to make the task more challenging, we left the majority of the utterances intact. The erroneous samples were obtained from user utterances using a TTS model followed by an ASR system. - + The training data are located in the `train` directory of the `cnlps-caiccaic` repository. The `train` directory contains two files: