Primary competition visual

IndabaX Zimbabwe 2026 Hackathon: Loan Default Prediction in Zimbabwe’s Banking Sector

Helping Zimbabwe
To be awarded
Completed (9 days ago)
Prediction
Machine Learning
Credit Risk
Finance
Banking
Risk Management
238 joined
147 active
Starti
May 09, 26
Closei
May 15, 26
Reveali
May 15, 26
About

The data mix dates, categorical fields, and numeric features (amount, rate, term, income, province, product type, disbursement channel, and more). Missing values appear in both train and test; the test set is crafted with somewhat more missingness to reward robust preprocessing and generalisation. You have from May 02 to May 09, 2026 to register for the challenge. Please note that the dataset will only become available from the May 09, 2026.

Train.csv contains 38,932 loan records, including the binary target defaulted. Test.csv has 12,977 rows with the same feature columns but no defaulted— you must predict it for every ID. Each row is one loan with identifiers, key dates (approval, disbursement, first payment, maturity), amount in USD, interest rate and term, payment frequency and purpose, client and household fileds, employment and income, obligations and collateral, disbursement channel, and province. Both splits include missing values; the test stresses robust handling. Data are tabular and suitable for classical ML, gradient-boosted trees, and other supervised learners.

Files
Description
Files
The file shows the structure and format of the submission file.
provides a description of the dataset.
This starter notebook shows you how to get started.
This file is the training set.
This file is the test set which should be used by your trained model for prediction.