Hệ thống phát hiện email lừa đảo phishing sử dụng temporal analysis và mô hình transformer

Đặng Thị Hiền, Trần Thị Duyên

Phishing email detection using temporal behavioral modeling and transformer architectures

Authors:

Hien Dang Thi, Duyen Tran Thi

Pages:

View:

156

Position:

7/7

Download:

Download PDF

Download JournalTOCs

Abtract

This study addresses the challenge of effectively detecting phishing email campaigns that exhibit increasingly sophisticated and sequential behaviors. The primary objective is to develop a detection approach that not only analyzes individual emails but also captures behavioral patterns across sequences of related messages. To achieve this goal, a hybrid deep learning framework is proposed that integrates advanced neural architectures for multi-email sequence analysis. Specifically, the model employs DistilBERT to extract semantic representations from email content, while a Bidirectional Long Short-Term Memory (BiLSTM) network is utilized to model temporal dependencies within consecutive email streams. The training dataset is constructed by aggregating four publicly available phishing and spam corpora, including CEAS_08, Nazario, Nigerian Fraud, and SpamAssassin, resulting in a cleaned dataset of 46,616 emails spanning the period from 2000 to 2022. In addition, two heuristic scoring metrics—Urgency_score and Suspicious_score—are introduced to quantify latent phishing-related cues commonly observed in malicious emails. Experimental results demonstrate that the proposed framework achieves an accuracy of 99.36% and an AUC-ROC of 0.9991 on the validation set, outperforming several baseline approaches. Furthermore, ablation experiments verify the contribution of each model component, while sensitivity analysis provides empirical justification for the selected sequence window size.

Xem thêm Ẩn bớt

Relate

Keyword

Phishing detection email security temporal analysis transformer models multi-email sequence analysis BiLSTM urgency score suspicious score

Articles in the same issue

Morphological characteristics of the population of the donkey croaker Pennahia aneus (Bloch, 1793) in the estuary and Coastal areas of Thanh Hoa province

Thao Hoang Ngoc

Volume 55, Issue 2A, 06/2026

Vinh University journal of science

Tạp chí khoa học Trường Đại học Vinh

ISSN: 1859 - 2228

Governing body: Vinh University

Address: 182 Le Duan - Vinh City - Nghe An province
Phone: (+84) 238.3855.452 - Fax: (+84) 238.3855.269
Email: vinhuni@vinhuni.edu.vn
Website: https://vinhuni.edu.vn

License: 163/GP-BTTTT issued by the Minister of Information and Communications on May 10, 2023

Open Access License: Creative Commons CC BY NC 4.0

CONTACT

Editor-in-Chief: Assoc. Prof., Dr. Tran Ba Tien
Email: tientb@vinhuni.edu.vn

Deputy editor-in-chief: Assoc. Prof., Dr. Phan Van Tien
Email: vantienkxd@vinhuni.edu.vn

Sub-Editor: Dr. Do Mai Trang
Email: domaitrang@vinhuni.edu.vn

Editorial assistant: Msc. Le Tuan Dung, Msc. Phan The Hoa, Msc. Pham Thi Quynh Nga, Msc. Tran Thi Thai

Address: 4th Floor, Executive Building, No. 182, Le Duan street, Vinh city, Nghe An province.
Phone: (+84) 238-385-6700 | Hotline: (+84) 97-385-6700
Email: editors@vujs.vn
Website: https://vujs.vn

Vinh University Journal of Science

ISSN: 1859-2228

Phishing email detection using temporal behavioral modeling and transformer architectures

Design and fabrication of a dual-axis solar tracking system

Application of ByteTrack and YOLOv10 models in solving the object tracking problem

Machine learning-based prediction of construction cost overruns using Random Forest

A hybrid LightGBM-LSTM machine learning model for short-term water level forecasting in the Mekong River Basin

Analyzing the impact of network failures on the routing performance of OSPF and EIGRP

Context representation for LLM based code generation in visual studio: a systematic review

Morphological characteristics of the population of the donkey croaker Pennahia aneus (Bloch, 1793) in the estuary and Coastal areas of Thanh Hoa province

Proposing solutions to support the accounting of cutting tables used in garment manufacturing

Adaptive Reflection Control for Anti-Jamming Backscatter Communication under Dynamic Interference

An Empirical Analysis of Code Complexity and Its Impact on Software Defect Density

Vinh University journal of science

Tạp chí khoa học Trường Đại học Vinh

ISSN: 1859 - 2228