El objetivo principal de CLARA-FINT fue describir la estructura de los informes financieros para permitir la comparación del contenido. Por ello, nuestra aproximación a la simplificación implicó, sobre todo, un discurso y una estructura sintáctica claros, y no solo un vocabulario básico. Se han recopilado nuevos textos para aumentar tanto el tamaño como, especialmente, la variedad del corpus FinT-esp. Un corpus más completo y variado ha permitido desarrollar modelos de lenguaje financiero en español.

Se han recopilado nuevos textos para aumentar tanto el tamaño como, especialmente, la variedad del corpus FinT-esp. Un corpus más completo y variado ha permitido desarrollar modelos de lenguaje financiero en español.

Un segundo objetivo específico fue la participación en tareas compartidas de evaluación dentro del marco de los Workshops on Financial Narrative Processing y MultiLing Financial Summarisation, organizados por los investigadores de UCREL – Lancaster. Esto ha permitido la inclusión de textos en español en las competiciones de resumen automático y la detección de causa y efecto. Un tercer objetivo fue avanzar en el conocimiento de la narrativa financiera, tanto desde una perspectiva económica como lingüística. La valiosa cantidad de datos recopilados ha constituido una fuente significativa para la elaboración de léxicos especializados o glosarios de términos financieros, así como para la publicación de estudios sobre las características del discurso financiero, sus formas de organizar la información y la argumentación. Este conocimiento puede tener impacto en disciplinas aplicadas del lenguaje, como la Traducción o la Comunicación.

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 9 of 9 Results
May 14, 2025
Carbajo-Coronado, Blanca; Moreno-Sandoval, Antonio, 2025, "Guía de anotación de terminología financiera (FINTERM)", https://doi.org/10.21950/ZF4PKF, e-cienciaDatos, V1
The creation of this document of annotation guidelines is framed in the Spanish national project CLARA-FINT. It consists of annotation guidelines that establish some indications and rules to create a dataset. The dataset is made up of financial texts from the annual reports of th...
Apr 10, 2025
Torterolo Orta, Yanco Amor; Moreno-Sandoval, Antonio, 2025, "SIMFIN: Simplificador y detector léxico financiero automático (aplicación web en Python)", https://doi.org/10.21950/7N2D0H, e-cienciaDatos, V1
Este proyecto se trata de un TFM enmarcado en las prácticas del alumno Yanco Amor Torterolo Orta en el LLI-UAM. Pertenece al proyecto CLARA-FINT, que se centra en el lenguaje claro. Más específicamente, se exploran diversas técnicas de simplificación en el ámbito financiero. Esto...
Apr 1, 2025
Moreno-Sandoval, Antonio; Porta, Jordi; García Toro, Ana, 2025, "Automatic discourse markers extractor", https://doi.org/10.21950/7UBNGJ, e-cienciaDatos, V1
This work is framed in the Spanish national project CLARA-FINT. The aim of this task within the project was to create an automatic discourse markers extractor for Spanish. In order to do so, the first step was to apply linguistic annotation on texts containing said markers. The n...
Apr 1, 2025
Carbajo-Coronado, Blanca; Moreno-Sandoval, Antonio; Porta, Jordi, 2025, "List of financial terms", https://doi.org/10.21950/JXFKRB, e-cienciaDatos, V1
The creation of this dataset is framed in the Spanish national project CLARA-FINT. It is a dataset with financial texts from the main Spanish listed companies' annual reports. Usually, said reports are publicly available under their respective shareholders website sections. The c...
Apr 1, 2025
Moreno-Sandoval, Antonio; Carbajo-Coronado, Blanca, 2025, "The financial narrative summarisation shared task (FNS 2022 & 2023): Datasets", https://doi.org/10.21950/WRH0SO, e-cienciaDatos, V1
Financial Narrative Processing (FNP) consists of workshops organized by Lancaster University at international NLP conferences to address various aspects of automatic processing of financial narratives, including automatic summarization. The LLI-UAM participated in 2022 and 2023 b...
Mar 28, 2025
Moreno-Sandoval, Antonio; Porta, Jordi; García Toro, Ana, 2025, "Discourse markers: Annotation guidelines", https://doi.org/10.21950/NWANNV, e-cienciaDatos, V1
This work is framed in the Spanish national project CLARA-FINT. The aim of this task within the project was to create an automatic discourse markers extractor for Spanish. In order to do so, the first step was to create these Annotation Guidelines to apply linguistic annotation o...
Mar 27, 2025
Moreno-Sandoval, Antonio; Carbajo-Coronado, Blanca; Porta, Jordi, 2025, "The financial document causality detection shared task (FinCausal 2023): Dataset", https://doi.org/10.21950/2JOAZJ, e-cienciaDatos, V1
The Financial Document Causality Detection Task (FinCausal 2023) aims at improving the causality in the financial domain trough its texts. Participants are asked to identify, in causal sentences, which elements of the sentence relate to the cause, and which relate to the effect....
Mar 27, 2025
Moreno-Sandoval, Antonio; Porta, Jordi; Carbajo-Coronado, Blanca, 2025, "Automatic financial term extractor", https://doi.org/10.21950/FWEML6, e-cienciaDatos, V1
The creation of this dataset is framed in the Spanish national project CLARA-FINT. The aim of this task within the project was to create an automatic financial term extractor for Spanish. In order to do so, the first step was to apply linguistic annotation on texts, namely annual...
Mar 21, 2025
Moreno-Sandoval, Antonio; Torterolo Orta, Yanco Amor; Roseti, Sofía Micaela; Carbajo-Coronado, Blanca; Porta, Jordi, 2025, "Financial ES-EN parallel corpus from annual reports", https://doi.org/10.21950/85MWYP, e-cienciaDatos, V1
The creation of this dataset is framed in the Spanish national project CLARA-FINT. It is a dataset with parallel bilingual texts (EN-ES). These texts are the main Spanish listed Companies' annual reports. Usually, said reports are publicly available under their respective shareho...
Add Data

Log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.