TY - JOUR ID - 49350 TI - Phishing website detection using weighted feature line embedding JO - The ISC International Journal of Information Security JA - ISECURE LA - en SN - 2008-2045 AU - Imani, M. AU - Montazer, Gh. A. AD - Faculty of Electrical and Computer Engineering, Tarbiat Modares University, Tehran, Iran AD - Faculty of Information Technology Engineering, Tarbiat Modares University, Tehran, Iran Y1 - 2017 PY - 2017 VL - 9 IS - 2 SP - 147 EP - 159 KW - Phishing Detection KW - Feature Extraction KW - Feature Line KW - Virtual Training DO - 10.22042/isecure.2017.83439.377 N2 - The aim of phishing is tracing the users' s private information without their permission by designing a new website which mimics the trusted website. The specialists of information technology do not agree on a unique definition for the discriminative features that characterizes the phishing websites. Therefore, the number of reliable training samples in phishing detection problems is limited. Moreover, among the available training samples, there are abnormal samples that cause classification error. For instance, it is possible that there are phishing samples with similar features to legitimate ones and vice versa. A supervised feature extraction method, called weighted feature line embedding, is proposed in this paper to solve these problems. The proposed method virtually generates training samples by utilizing the feature line metric. Hence, it can solve the small sample size problem. Moreover, by assigning appropriate weights to each pair of feature points, it corrects the undesirable quality of abnormal samples. The features extracted by our method improve the performance of phishing website detection specially by using small training sets. UR - https://www.isecure-journal.com/article_49350.html L1 - https://www.isecure-journal.com/article_49350_c28c74eaa7e5791d181e534e06aa9789.pdf ER -