Credit Card Fraud Detection Using Machine Learning with Integration of Contextual Knowledge
Author | : Yvan Lucas |
Publisher | : |
Total Pages | : 125 |
Release | : 2019 |
ISBN-10 | : OCLC:1193290241 |
ISBN-13 | : |
Rating | : 4/5 ( Downloads) |
Download or read book Credit Card Fraud Detection Using Machine Learning with Integration of Contextual Knowledge written by Yvan Lucas and published by . This book was released on 2019 with total page 125 pages. Available in PDF, EPUB and Kindle. Book excerpt: The detection of credit card fraud has several features that make it a difficult task. First, attributes describing a transaction ignore sequential information. Secondly, purchasing behavior and fraud strategies can change over time, gradually making a decision function learned by an irrelevant classifier. We performed an exploratory analysis to quantify the day-by-day shift dataset and identified calendar periods that have different properties within the dataset. The main strategy for integrating sequential information is to create a set of attributes that are descriptive statistics obtained by aggregating cardholder transaction sequences. We used this method as a reference method for detecting credit card fraud. We have proposed a strategy for creating attributes based on Hidden Markov Models (HMMs) characterizing the transaction from different viewpoints in order to integrate a broad spectrum of sequential information within transactions. In fact, we model the authentic and fraudulent behaviors of merchants and cardholders according to two univariate characteristics: the date and the amount of transactions. Our multi-perspective approach based on HMM allows automated preprocessing of data to model temporal correlations. Experiments conducted on a large set of data from real-world credit card transactions (46 million transactions carried out by Belgian cardholders between March and May 2015) have shown that the proposed strategy for pre-processing data based on HMMs can detect more fraudulent transactions when combined with the Aggregate Data Pre-Processing strategy.