Leveraging NLP Techniques for Privacy Requirements Engineering in User Stories

Herwanto, Guntur Budi and Quirchmayr, Gerald and Tjoa, A Min (2024) Leveraging NLP Techniques for Privacy Requirements Engineering in User Stories. IEEE Access, 12. 22167 -22189. ISSN 21693536

[thumbnail of 3.780 Leveraging_NLP_Techniques_for_Privacy_Requirements_Engineering_in_User_Stories.pdf] Text
3.780 Leveraging_NLP_Techniques_for_Privacy_Requirements_Engineering_in_User_Stories.pdf - Published Version
Restricted to Registered users only

Download (2MB) | Request a copy

Abstract

Privacy requirements engineering acts as a role to systematically elicit privacy requirements from system requirements and legal requirements such as the GDPR. Many methodologies have been proposed, but the majority of them are focused on the waterfall approach, making adopting privacy engineering in agile software development difficult. The other major issue is that the process currently is to a high degree manual. This paper focuses on closing these gaps through the development of a machine learning-based approach for identifying privacy requirements in an agile software development environment, employing natural language processing (NLP) techniques. Our method aims to allow agile teams to focus on functional requirements while NLP tools assist them in generating privacy requirements. The main input for our method is a collection of user stories, which are typically used to identify functional requirements in agile software development. The NLP approach is then used to automate some human-intensive tasks such as identifying personal data and creating data flow diagrams from user stories. The data flow diagram forms the basis for the automatic creation of privacy requirements. Our evaluation shows that our NLP method achieves a fairly good performance in terms of F-Measure. We are also demonstrate the feasibility of our NLP approach in CamperPlus project. Lastly, we are developing a tool to integrate our NLP approach into the privacy requirements engineering pipeline, allowing for manual editing of results so that agile teams can maintain control over the automated approach.

Item Type: Article
Uncontrolled Keywords: agile software development; natural language processing; Privacy requirements engineering; user stories
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: Faculty of Mathematics and Natural Sciences > Computer Science & Electronics Department
Depositing User: Masrumi Fathurrohmah
Date Deposited: 03 Mar 2025 03:38
Last Modified: 03 Mar 2025 03:38
URI: https://ir.lib.ugm.ac.id/id/eprint/15443

Actions (login required)

View Item
View Item