Trustworthy LLMs for Ethically Aligned AI-based Systems: A PhD Research Plan

nbnfi-fe202601269095.pdf
Lopullinen julkaistu versio - 568.27 KB
https://creativecommons.org/licenses/by/4.0/

Kuvaus

© 2024 Copyright for this paper by its authors. Use permitted under Creative Commons License Attribution 4.0 International (CC BY 4.0).
In response to growing concerns around trustworthiness and ethical alignment in AI systems, this PhD aims to investigate how Large Language Models (LLMs) can be leveraged to support ethically aligned AI development in software engineering. Despite advancements, integrating ethical principles into AI workflows remains challenging, particularly in real-world applications that require compliance with emerging regulations, such as the EU AI Act. We will develop a Visual Studio Code (VSCode) Generative AI (GenAI) Extension powered by a multi-agent LLM system with Retrieval-Augmented Generation (RAG) capabilities. The extension will be designed to aid developers by evaluating code compliance with ethical standards, providing actionable recommendations to embed trustworthiness from early stages of development. The GenAI Extension will be evaluated through an iterative design science approach, encompassing dataset generation, ethical benchmarking, and practitioner testing. A dataset of over 2000 ethically aligned AI systems, will be created in compliance with leading regulatory frameworks, serving as a foundation for this tool’s assessments. With this work, we hope to assist developers, particularly in startups and SMEs, by providing practical resources for building ethically aligned AI within limited resources. Through this approach, we aim to bridge the gap between abstract ethical principles and actionable software development practices, making ethical AI more accessible across industry contexts.

Emojulkaisu

ICSOB-C 2024 Software Business: PhD Retreat and Posters & Demos Track 2024

ISBN

ISSN

1613-0073

Aihealue

Kausijulkaisu

CEUR workshop proceedings|3921

OKM-julkaisutyyppi

A4 Vertaisarvioitu artikkeli konferenssijulkaisussa