Publications

(2025). Red Teaming Contemporary AI Models: Insights from Spanish and Basque Perspectives. JISBD'25.

PDF

(2025). Meta-Fair: Metamorphic Testing of Fairness in Large Language Models. JISBD'25.

PDF

(2025). ASTRAL. A Tool for the Automated Safety Testing of Large Language Models.

PDF

(2025). Testing the Evilness of Large Language Models.

Slides

(2025). O3-Mini vs. DeepSeek-R1. Which One Is Safer?.

PDF

(2025). Early External Safety Testing of OpenAI's O3-Mini. Insights from Pre-Deployment Evaluation..

PDF

(2025). ASTRAL. Automated Safety Testing of Large Language Models.

PDF Podcast

(2025). AI-Driven Fairness Testing of Large Language Models. A Preliminary Study.

PDF Podcast

(2024). Toward Trustworthy AI-Enabled Internet Search. JISBD'24.

PDF Slides