The use of ML algorithms in the task of detecting fraud when using plastic cards

T.A. Osipova, K.S. Zaytsev, V.O. Bifert

Abstract


Today there is a significant increase in the number of incidents involving the use of plastic cards, as well as a variety of fraudulent methods used by cybercriminals. The presented article is devoted to the application of machine learning methods to counter fraudulent transactions using plastic cards. The aim of the article is to study the effectiveness of various machine learning models in the analysis of transactions with plastic cards to identify various types of fraud. The article sequentially analyzes such machine learning methods as RandomForest, CatBoost, LogisticRegression, 2-layer ordinary perceptron and Rumelhart's multilayer perceptrons L-BFGS and SGD. Considerable attention is paid to the process of preparing data for participation in modeling, which is iterative and includes operations for selecting tables, attributes, records, converting, cleaning data, filtering and combining them in the desired format. The dataset was taken from Worldline and the ULB Machine Learning Group for Big Data Mining and Fraud Detection. The problem with class imbalance was solved using resampling. Rumelhart's L-BFGS and SGD perceptrons showed the best results in experiments when using the metrics "response time" and "accuracy".

Full Text:

PDF (Russian)

References


Tadviser [online resource] //Bank card fraud [website] URL: https://www.tadviser.ru/index.php//index.php/ Article: Bank Card Fraud (Date of request 23.10.2020)

HABR [online resource] // Machine learning algorithms [website] URL: https://habr.com/en/company/ods/blog/324402/#2-sluchaynyy-les (Date of request 6.12.2020)

KPFU [online resource] // Artificial neural networks and their applications [website] URL: https://kpfu.ru/staff_files/F1493580427/NejronGafGal.pdf (Date of request 4.04.2021)

Apmonitor [online resource] // Deep Learning [website] URL: https://apmonitor.com/do/index.php/Main/DeepLearning (Date of request 2.04.2021)

Python-school [online resource] // TensorFlow vs PyTorch: What and When to Choose for Machine Learning [website] URL: https://python-school.ru/tensorflow-vs-pytorch/ ( Date of request 20.03.2021)

Kaggle [online resource] // Kaggle.com [website] https://www.kaggle.com/dejavu23/titanic-survival-seaborn-and-ensembles ( Date of request 17.10.2020)

Kaggle [online resource] // Kaggle.com [website] https://www.kaggle.com/mlg-ulb/creditcardfraud/version/3 (Date of request 10.10.2020)

Neural-university [online resource] // What are neural networks [website] URL: https://neural-university.ru/neural-networks-basics (Date of request 10.04.2021)

"A literature survey on Machine Learning Algorithms", International Journal of Emerging Technologies and Innovative Research (www.jetir.org), ISSN:2349-5162, Vol.6, Issue 4, page no.471-474, April-2019, Available :http://www.jetir.org/papers/JETIR1904C77.pdf

Machine learning approach to literature mining for the genetics of complex diseases, Jessica Schuster, Michael Superdock, Anthony Agudelo, Paul Stey, James Padbury, Indra Neil Sarkar, Alper Uzun, Author Notes, Database, Volume 2019, 2019, baz124. URL: https://doi.org/10.1093/database/baz124

Fraud [online resource] // Fraud Detection with Machine Learning [website] URL: https://www.researchgate.net/project/Fraud-detection-with-machine-learning (Date of request 21.04.2021)

Fontanka.ru [online resource] // Theft with bank cards [website] URL: https://www.fontanka.ru/2020/11/09/69533506/ (Date of request 18.04.2021)

Bazhenov [online resource] // Classifier score (presicion, recall, F-score) [website] URL: http://bazhenov.me/blog/2012/07/21/classification-performance-evaluation.html (Date of request 16.11.2020)

Machinelearningmastery [online resource] // Dealing with unbalanced data [website] URL: https://www.machinelearningmastery.ru/methods-for-dealing-with-imbalanced-data-5b761be45a18/ (Date of request 2.12.2020)

docs.microsoft.com [online resource] // Prevent false relationships and imbalanced data through automated machine learning [website] URL: https://docs.microsoft.com/ru-ru/azure/machine-learning/concept-manage-ml-pitfalls (Date of request 2.04.2021)

Reglament.net [online resource] // Neural networks in anti-fraud modeling [website] URL: http://www.reglament.net/bank/r/2018_2/get_article.htm?id=5615 (Date of request 12.04.2021)

mql5 [online resource] Gradient boosting [website] URL: https://www.mql5.com/ru/articles/8642 (Date of request 7.12.2020)

AI-news [online resource] // Big data [website] URL: https://ai-news.ru/big_data.html (Date of request 26.05.2021)

AI-news [online resource] // Problems and errors of machine learning, neural networks / big data [website] URL: https://ai-news.ru/problemy_i_oshibki_mashinnogo_obucheniya.html (Date of request 26.05.2021)

AI-news [online resource] // Neural network news: who is Prover-7 and why sample quality is more important than sample size [website] URL: https://ai-news.ru/2021/04/novosti_nejrosetej_kto_takoj_providec_7_i_pochemu_kachestvo_vybork.html (Date of request 27.05.2021)

Medium.com [online resource] // ReLU: Not a Differentiable Function [website] URL: https://medium.com/@kanchansarkar/relu-not-a-differentiable-function-why-used-in-gradient-based-optimization-7fef3a4cecec (Date of request 13.04.2021)

Arefyevstudio [online resource] // What is resampling? [website] https://arefyevstudio.com/2019/01/11/chto-takoe-resempling/ (Date of request 20.03.2021)


Refbacks

  • There are currently no refbacks.


Abava  Кибербезопасность IT Congress 2024

ISSN: 2307-8162