Machine Learning Methods in Payment Card Fraud Detection

Protection of clients from fraudulent transactions is a complicated task. Banks tend to rely on rule-based systems which require manual creation of rules to identify fraud. These rules have to be set up by employees of the bank who need to look for any trends in fraudulent transactions themselves. T...

Full description

Bibliographic Details
Main Author:	Sinčák, Jan
Other Authors:	Baruník, Jozef, Vácha, Lukáš
Format:	Thesis
Language:	English
Published:	Univerzita Karlova, Fakulta sociálních věd 2023
Subjects:	machine learning card fraud fraud detection imbalanced data strojové učení karetní podvody detekce podvodů nevyvážená data sami
Online Access:	https://hdl.handle.net/20.500.11956/182600

_version_	1835020571099791360
author	Sinčák, Jan
author2	Baruník, Jozef Vácha, Lukáš
author_facet	Sinčák, Jan
author_sort	Sinčák, Jan
collection	Unknown
description	Protection of clients from fraudulent transactions is a complicated task. Banks tend to rely on rule-based systems which require manual creation of rules to identify fraud. These rules have to be set up by employees of the bank who need to look for any trends in fraudulent transactions themselves. This thesis deals with the problem of detection of fraudulent card transactions as it com- pares multiple machine learning models for fraud detection. These models can find complex relationships in the data and potentially outperform standard fraud detection systems, Logistic regression, neural network, random forest, and extreme gradient boosting (XGBoost) models are trained on a simulated dataset that closely follows properties of real card transactions. Performance of the models is measured by sensitivity, specificity, precision, AUC, and time to predict on the testing dataset. XGBoost shows the highest performance among the tested models. It is then compared to a standard fraud detection system used in a Czech bank. The bank system achieves higher specificity but XGBoost still shows promising performance. It is possible that certain machine learning models could outperform today's fraud detection systems if they are well-tuned. JEL Classification G21, K42 Keywords machine learning, card fraud, fraud. Ochrana klientů před podvodnými transakcemi je náročný úkol. Banky se ob- vykle spoléhají na systémy založené na pravidlech, které vyžadují ruční tvorbu těchto pravidel pro identifikaci podvodu. Tato pravidla musí nastavit zaměst- nanci banky, kteří musí sami vyhledávat trendy v podvodných transakcích. Tato práce se zabývá problémem odhalování podvodných karetních transakcí a porovnává několik modelů strojového učení pro detekci podvodů. Tyto mod- ely mohou v datech najít složité vztahy a potenciálně překonat klasické sys- témy detekce podvodů, Logistická regrese, neuronová síť, random forest a ex- treme gradient boosting (XGBoost) jsou trénovány na simulovaném souboru dat, který věrně kopíruje vlastnosti skutečných ...
format	Thesis
genre	sami
genre_facet	sami
id	ftunivpraha:oai:https://dspace.cuni.cz:20.500.11956/182600
institution	Open Polar
language	English
op_collection_id	ftunivpraha
op_doi	https://doi.org/20.500.11956/182600
op_relation	http://hdl.handle.net/20.500.11956/182600 236963
publishDate	2023
publisher	Univerzita Karlova, Fakulta sociálních věd
record_format	openpolar
spelling	ftunivpraha:oai:https://dspace.cuni.cz:20.500.11956/182600 2025-06-15T14:48:20+00:00 Machine Learning Methods in Payment Card Fraud Detection Metody strojového učení v detekci podvodných karetních transakcí Sinčák, Jan Baruník, Jozef Vácha, Lukáš 2023 application/pdf application/octet-stream https://hdl.handle.net/20.500.11956/182600 English en_US eng Univerzita Karlova, Fakulta sociálních věd http://hdl.handle.net/20.500.11956/182600 236963 machine learning card fraud fraud detection imbalanced data strojové učení karetní podvody detekce podvodů nevyvážená data diplomová práce 2023 ftunivpraha https://doi.org/20.500.11956/182600 2025-05-28T05:49:53Z Protection of clients from fraudulent transactions is a complicated task. Banks tend to rely on rule-based systems which require manual creation of rules to identify fraud. These rules have to be set up by employees of the bank who need to look for any trends in fraudulent transactions themselves. This thesis deals with the problem of detection of fraudulent card transactions as it com- pares multiple machine learning models for fraud detection. These models can find complex relationships in the data and potentially outperform standard fraud detection systems, Logistic regression, neural network, random forest, and extreme gradient boosting (XGBoost) models are trained on a simulated dataset that closely follows properties of real card transactions. Performance of the models is measured by sensitivity, specificity, precision, AUC, and time to predict on the testing dataset. XGBoost shows the highest performance among the tested models. It is then compared to a standard fraud detection system used in a Czech bank. The bank system achieves higher specificity but XGBoost still shows promising performance. It is possible that certain machine learning models could outperform today's fraud detection systems if they are well-tuned. JEL Classification G21, K42 Keywords machine learning, card fraud, fraud. Ochrana klientů před podvodnými transakcemi je náročný úkol. Banky se ob- vykle spoléhají na systémy založené na pravidlech, které vyžadují ruční tvorbu těchto pravidel pro identifikaci podvodu. Tato pravidla musí nastavit zaměst- nanci banky, kteří musí sami vyhledávat trendy v podvodných transakcích. Tato práce se zabývá problémem odhalování podvodných karetních transakcí a porovnává několik modelů strojového učení pro detekci podvodů. Tyto mod- ely mohou v datech najít složité vztahy a potenciálně překonat klasické sys- témy detekce podvodů, Logistická regrese, neuronová síť, random forest a ex- treme gradient boosting (XGBoost) jsou trénovány na simulovaném souboru dat, který věrně kopíruje vlastnosti skutečných ... Thesis sami Unknown
spellingShingle	machine learning card fraud fraud detection imbalanced data strojové učení karetní podvody detekce podvodů nevyvážená data Sinčák, Jan Machine Learning Methods in Payment Card Fraud Detection
title	Machine Learning Methods in Payment Card Fraud Detection
title_full	Machine Learning Methods in Payment Card Fraud Detection
title_fullStr	Machine Learning Methods in Payment Card Fraud Detection
title_full_unstemmed	Machine Learning Methods in Payment Card Fraud Detection
title_short	Machine Learning Methods in Payment Card Fraud Detection
title_sort	machine learning methods in payment card fraud detection
topic	machine learning card fraud fraud detection imbalanced data strojové učení karetní podvody detekce podvodů nevyvážená data
topic_facet	machine learning card fraud fraud detection imbalanced data strojové učení karetní podvody detekce podvodů nevyvážená data
url	https://hdl.handle.net/20.500.11956/182600

Machine Learning Methods in Payment Card Fraud Detection

Similar Items