Publications

Miguel Romero-Arjona, Pablo Valle, Juan C. Alonso, Ana B. Sánchez, Miriam Ugarte, Antonia Cazalilla, Vicente Cambrón, José A. Parejo, Aitor Arrieta, Sergio Segura (2025). Red Teaming Contemporary AI Models: Insights from Spanish and Basque Perspectives. JISBD'25.

Miguel Romero-Arjona, José A. Parejo, Juan C. Alonso, Ana B. Sánchez, Aitor Arrieta, Sergio Segura (2025). Meta-Fair: Metamorphic Testing of Fairness in Large Language Models. JISBD'25.

Miriam Ugarte, Pablo Valle, Jose Antonio Parejo, Sergio Segura, Aitor Arrieta (2025). ASTRAL. A Tool for the Automated Safety Testing of Large Language Models.

Miguel Romero-Arjona, Aitor Arrieta (2025). Testing the Evilness of Large Language Models.

Aitor Arrieta, Miriam Ugarte, Pablo Valle, Jose Antonio Parejo, Sergio Segura (2025). O3-Mini vs. DeepSeek-R1. Which One Is Safer?.

Aitor Arrieta, Miriam Ugarte, Pablo Valle, Jose Antonio Parejo, Sergio Segura (2025). Early External Safety Testing of OpenAI's O3-Mini. Insights from Pre-Deployment Evaluation..

Miriam Ugarte, Pablo Valle, Jose Antonio Parejo, Sergio Segura, Aitor Arrieta (2025). ASTRAL. Automated Safety Testing of Large Language Models.

Miguel Romero-Arjona, José A. Parejo, Juan C. Alonso, Ana B. Sánchez, Aitor Arrieta, Sergio Segura (2025). AI-Driven Fairness Testing of Large Language Models. A Preliminary Study.

Miguel Romero-Arjona, Sergio Segura, Aitor Arrieta (2024). Toward Trustworthy AI-Enabled Internet Search. JISBD'24.