Resultados

Herramienta SocialFairness

El proyecto coordinado SocialFairness ha desarrollado una prueba de concepto innovadora orientada al análisis automático de la honestidad en los medios digitales, integrando la evaluación de la confiabilidad de las noticias y la toxicidad de los comentarios asociados. Coordinado por la Universidad de Jaén (UJA) y con la participación de la Universidad de Alicante (UA), el proyecto ha logrado importantes avances tanto científicos como tecnológicos en el ámbito del procesamiento del lenguaje natural (PLN) aplicado a la desinformación y al discurso de odio.

El principal hito alcanzado ha sido el desarrollo de un prototipo funcional en la nube capaz de analizar, en tiempo real, noticias publicadas en distintos medios digitales —entre ellos El País, El Mundo, ABC, La Vanguardia, El Diario, El Español, 20 Minutos, Ok Diario, Voz Pópuli, El Confidencial o Marca—, ofreciendo métricas automáticas sobre la confiabilidad del contenido y la toxicidad y constructividad de los comentarios asociados. Este sistema constituye una herramienta pionera en lengua española para la detección de desinformación y discurso de odio en entornos digitales.

La plataforma está operativa en el enlace https://socialfairness.demos.gplsi.es/

Recursos

I. Cabrera-de Castro, A. M. Mármol-Romero, A. Bonet-Jover, R. Sepúlveda-Torres, M. T. Martín-Valdivia, L. A. Ureña-López, E. Saquete, P. Martínez-Barco, Constructive classifier fine-tuned from roberta-base-bne, 2025. URL: https://huggingface.co/gplsi/Constructive_model, accessed: 2025-10-28.
R. Sepúlveda-Torres, A. Bonet-Jover, A. M. Mármol-Romero, I. Cabrera-de Castro, E. Saquete, P. Martínez-Barco, M. T. Martín-Valdivia, L. A. Ureña-López, 5W1H Extractor Fine-Tuned from Llama-3B-Instruct, 2025. URL: https://huggingface.co/gplsi/5W1H_Llama_3B, accessed: 2025-10-28.
A. Bonet-Jover, R. Sepúlveda-Torres, I. Cabrera-de Castro, A. M. Mármol-Romero, E. Saquete, P. Martínez-Barco, M. T. Martín-Valdivia, L. A. Ureña-López, 5W1H reliability classifier Fine-Tuned from RoBERTabase-bne, 2025. URL: https://huggingface.co/gplsi/reliability_5W1H, accessed: 2025-10-28
A. M. Mármol-Romero, R. Sepúlveda-Torres, I. Cabrera-de Castro, A. Bonet-Jover, M. T. Martín-Valdivia, L. A. Ureña-López, E. Saquete, P. Martínez-Barco, Toxicity classifier fine-tuned from roberta-base-bne), 2025. URL: https://huggingface.co/gplsi/Toxicity_model, accessed: 2025-05-28.
Sepúlveda-Torres, R., Botella-Gil, Beatriz., Bonet-Jover, A., Martínez-Barco, P., Saquete, E. Modelo de clasificación de Violencia. URL: https://github.com/rsepulveda911112/violent_message_detection.
Sepúlveda-Torres, R., Espinosa Zaragoza, S. Web crawler para descargar noticias. Link: https://github.com/gplsi/crawlers_socialfairness/tree/main.
A. Bonet-Jover, I. Cabrerade Castro, R. Sepúlveda-Torres ,A. M. Mármol-Romero, M. T. Martín-Valdivia, L. A. Ureña-López, E. Saquete, P. Martínez-Barco, 5W1Hs dataset , 2025. URL: https://github.com/rsepulveda911112/Flares-dataset, accessed: 2025-10-28.
Cabrera-de Castro, Isabel | Mamchur, Kateryna | Bonet-Jover, Alba | Sepúlveda-Torres, Robiert | Saquete, Estela | Martínez-Barco, Patricio | Martín-Valdivia, María Teresa | Ureña-López, Alfonso. Guía de toxicidad para la anotación de comentarios de noticias en medios digitales. 2025. URL:https://huggingface.co/datasets/gplsi/SocialTOX/blob/main/Gui%CC%81a_Toxicidad_SocialFairness.pdf
Bonet-Jover, Alba | Cabrera-de Castro, Isabel | Mamchur, Kateryna | Sepúlveda-Torres, Robiert | Saquete, Estela | Martínez-Barco, Patricio | Martín-Valdivia, María Teresa | Ureña-López, Alfonso. Guía de confiabilidad para la anotación de noticias en medios digitales. 2025. URL:https://github.com/rsepulveda911112/Flares-dataset/blob/main/Gui%CC%81a_Confiabilidad_SocialFairness.pdf

Publicaciones

Sepúlveda-Torres, R., Bonet-Jover, A., & Saquete, E. (2023). Detecting misleading headlines through the automatic recognition of contradiction in spanish. IEEE Access, 11, 72007-72026. https://doi.org/10.1109/ACCESS.2023.3295781
Bonet-Jover, A., Sepúlveda-Torres, R., Saquete, E., Martínez-Barco, P., Piad-Morffis, A., & Estevez-Velarde, S. (2023). Applying Human-in-the-Loop to construct a dataset for determining content reliability to combat fake news. Engineering applications of artificial intelligence, 126, 107152. https://doi.org/10.1016/j.engappai.2023.107152
Bonet-Jover, A., Sepúlveda-Torres, R., Saquete, E., & Martínez-Barco, P. (2023). A semi-automatic annotation methodology that combines Summarization and Human-In-The-Loop to create disinformation detection resources. Knowledge-Based Systems, 275, 110723. https://doi.org/10.1016/j.knosys.2023.110723
Bonet-Jover, A., Sepúlveda-Torres, R., Saquete, E., Martinez-Barco, P. (2023). Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation. Procesamiento del Lenguaje Natural, 70, 15-26. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6475
Bonet-Jover, A., Sepúlveda-Torres, R., Saquete, E., Martínez-Barco, P., & Nieto-Pérez, M. (2024). RUN-AS: a novel approach to annotate news reliability for disinformation detection. Language Resources and Evaluation, 58(2), 609-639. https://doi.org/10.1007/s10579-023-09678-9
Botella-Gil, B., Sepúlveda-Torres, R., Bonet-Jover, A., Martínez-Barco, P., & Saquete, E. (2024). Semi-automatic dataset annotation applied to automatic violent message detection. IEEE Access, 12, 19651-19664. https://doi.org/10.1109/ACCESS.2024.3361404
Sepúlveda-Torres, R., Bonet-Jover, A., Diab, I., Guillén-Pacho, I., Cabrera-de Castro, I., Badenes-Olmedo, C., … & Ureña-López, L. A. (2024). Overview of flares at iberlef 2024: Fine-grained language-based reliability detection in spanish news. Procesamiento del lenguaje natural, 73, 369-379. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6624
Fernández, J., Gutiérrez, Y., & Martinez-Barco, P. (2023). Generación y pesado de skipgrams y su aplicación al análisis de sentimientos. Procesamiento del Lenguaje Natural: 70, 213-223. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6491
Botella, B., Sepúlveda-Torres, R., Martínez-Barco, P., Saquete, E. (2023) Violencia Identificada en el Lenguaje (VIL). Creación de recurso para mensajes violentos. Procesamiento del Lenguaje Natural. 70: 187-198. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6489
Sepúlveda-Torres, R., Vicente, M., Saquete, E., Lloret, E., & Palomar, M. (2023). Leveraging relevant summarized information and multi-layer classification to generalize the detection of misleading headlines. Data & Knowledge Engineering, 102176. https://doi.org/10.1016/j.datak.2023.102176
Martin, T. J., Vázquez, Y. G., Sepúlveda-Torres, R., & Abreu Salas, J. I. (2024). The risky news sharing quotient (RNSQ): A research instrument for exploring news-sharing behaviour that spreads fake news. Education, Citizenship and Social Justice, 17461979231218652. https://doi.org/10.1177/17461979231218652
Consuegra-Ayala, J. P., Gutiérrez, Y., Almeida-Cruz, Y., & Palomar, M. (2024). Automatic Annotation of Protected Attributes to Support Fairness Optimization. Information Sciences, 120188. https://doi.org/10.1016/j.ins.2024.120188
Yáñez-Romero, F., Montoyo, A., Muñoz, R., Gutiérrez, Y., & Suárez, A. (2024). OntoLM: Integrating Knowledge Bases and Language Models for classification in the medical domain. Procesamiento del Lenguaje Natural, 72, 137-148. http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6584
Consuegra‐Ayala, J. P., Gutiérrez, Y., Almeida‐Cruz, Y., & Palomar, M. (2025). Bias mitigation for fair automation of classification tasks. Expert Systems, 42(2), e13734. https://doi.org/10.1111/exsy.13734
Rodríguez-Ferrándiz, R. (2025). Beyond detection and correction: Fake news’«news-ness» and «shareworthiness» as alternative ways to tackle disinformation. Communication & Society. https://doi.org/10.15581/003.38.1.005
Sepúlveda-Torres, R., Martínez-Murillo, I., Saquete, E., Lloret, E., & Palomar, M. (2025). To Write or Not to Write as a Machine? That’s the Question. IEEE Transactions on Big Data. https://doi.org/10.1109/TBDATA.2025.3536938
Rodríguez-Ferrándiz, R., Sánchez-Olmos, C., & Hidalgo-Marí, T. (2023). For the sake of sharing: Fake news as memes. In Information disorder (pp. 46-68). Routledge. https://doi.org/10.4324/9781003299936-3
Castellanos-Trujillo, V., & Rodríguez-Ferrándiz, R. (2025). Publicidad programática y desinformación en España. El rol de marcas y plataformas AdTech en pseudomedios. Revista Mediterránea de Comunicación. https://doi.org/10.14198/MEDCOM.29240
Sepúlveda-Torres, R., Bonet-Jover, A., Espinosa-Zaragoza, S., Mamchur, K., Cabrera-de-Castro, I., Mármol-Romero, A. María, Martínez-Barco, P., & Saquete, E., Martín-Valdivia, María T., Ureña, L. Alfonso (2025). Reliability and Toxicity Detection Tool in Digital Media. In Proceedings of the 41th International Conference of the Spanish Society for Natural Language Processing. Enlace: https://ceur-ws.org/Vol-3846/paper14.pdf
Botella-Gil, B., Bonet-Jover, A., Sepúlveda-Torres, R., Martínez-Barco, P., & Saquete, E. (2024). Exploring the Relationship between News Reliability and Violent Comments in Digital Media. In Proceedings of the 40th International Conference of the Spanish Society for Natural Language Processing. Enlace: https://ceur-ws.org/Vol-3846/paper14.pdf
Ureña López, L. A., Martín Valdivia, M. T., Saquete Boró, E., & Martínez-Barco, P. (2024). SocialFairness: Assessing Fairness in Digital Media. In SEPLN-CEDI-PD 2024: Seminar of the Spanish Society for Natural Language Processing: Projects and System Demonstrations, June 19-20, 2024, A Coruña, Spain. CEUR Workshop Proceedings, Vol-3729. Enlace: https://ceur-ws.org/Vol-3729/p5_rev.pdf
Botella-Gil, B., Consuegra-Ayala, J. P., Bonet-Jover, A., & Moreda-Pozo, P. Balancing the Scales: Addressing Gender Bias in Social Media Toxicity Detection. Proceedings of Recent Advances in Natural Language Processing,pages 194–203 Varna, Sep 8–10, 2025. Enlace: https://acl-bg.org/proceedings/2025/RANLP%202025/pdf/2025.ranlp-1.23.pdf
Castellanos-Trujillo, V. G., Hidalgo-Marí, T., Palomares-Sánchez, P., & Rodríguez-Ferrándiz, R. (2024). Confiabilidad en los estudios sobre fake news: datasets y métricas.
Sánchez-Olmos, C., Rodríguez-Ferrándiz, R., & Hidalgo-Marí, T. (2024). Desinformación y memética: réplica y mutación del argumentario antivacunas en contenidos informativos.
Castellanos-Trujillo, V. G., & Rodriguez-Ferrándiz, R. (2025). Dataset del Artículo Publicidad programática y desinformación en España.
Cabrera-de Castro, Isabel | Bonet-Jover, Alba | Mamchur, Kateryna | Sepúlveda-Torres, Robiert | Martín-Valdivia, María Teresa | Ureña-López, Alfonso. New proposal and comprehensive linguistic review of an annotation guideline for thereliability detection in news. Enlace: https://eventos.adeit.es/110309/section/48262/4th-international-conference-entretextos.html