Data Protection in the Utilization of Natural Language Processors for Trend Analysis and Public Opinion: ryptographic Aspect

Rozlomii, I. and Yehorchenkova, N. and Yarmilko, A. and Naumenko, S. (2023) Data Protection in the Utilization of Natural Language Processors for Trend Analysis and Public Opinion: ryptographic Aspect. Proceedings of the 2nd International Workshop on Social Communication and Information Activity in Digital Humanities (SCIA-2023). pp. 1-11.

[img] Text
paper_3 (1).pdf

Download (539kB)

Abstract

In the digital age, the significant increase in information generation and processing is accompanied by a growing threat of unauthorized access, illegal distribution, and use. One of the most promising strategies for protecting information from various cyber threats and malicious attacks is the use of Natural Language Processing (NLP) processors. This article focuses on the methodology of data protection in the context of utilizing Natural Language Processing for sentiment analysis and trend detection. Emphasis is placed on the relevance of using NLP to address tasks related to text content analysis for identifying suspicious or dangerous information. The article covers the stages of text data collection and processing, including data gathering from various sources such as social media, news portals, forums, and blogs. Subsequently, preliminary processing is performed, involving noise removal, tokenization, stemming, and lemmatization of the text to prepare the data for further analysis. The application of NLP allows for the identification of keywords, topics, sentiment, and text structure, facilitating categorization and trend identification in public opinion. Additionally, a mathematical model for detecting phishing indicators is presented, along with an example of identifying suspicious text features. It is noted that the use of cryptographic methods can effectively secure processed data, reducing the risk of unauthorized access or misuse. The article provides a detailed description of data protection methods in the process of sentiment analysis using NLP and underscores the necessity of employing cryptographic techniques to ensure the security of processed text data.

Item Type: Article
Uncontrolled Keywords: Natural language processing ; natural language processing technologies ; information security ; analysis of global rends ; cybersecurity ; disinformation ; phishing, automatic text analysis, text classification ; threat detection ; digital security ; cyber threats
Subjects: Фізико-математичні науки
Комп'ютерні науки
Комп'ютерні науки
Divisions: Факультет обчислювальної техніки, інтелектуальних та управляючих систем
Depositing User: Наукова Бібліотека
Date Deposited: 23 Feb 2024 10:25
Last Modified: 23 Feb 2024 10:25
URI: https://eprints.cdu.edu.ua/id/eprint/6031

Actions (login required)

View Item View Item