Building a Scalable, Multi-Country Credit Bureau Platform
Designed and implemented a large-scale, customizable credit bureau platform being offered in multiple countries to operate with financial institutions and regulatory bodies.
The system ingests large volume of data from diverse financial institutions, performs advanced identity matching, and generates unified consumer credit reports for banks.

The Challenge
The Solution
Implementation Process
Data Ingestion Layer
Data received from external banks via batch files.
Supported multiple formats and schemas
Designed for high-volume, incremental ingestion.
ETL & Data Processing
Implemented ETL process.
Data validation & cleansing.
Data Transformation.
Data Standardization and matching.
Data caching.
Name & Address Standardization
Third party integration for name and address standardization.
Unique Name and Address parsing based on country norms.
Consumer and Business name and address differentiation.
Identity Matching Engine
Advanced Identity matching algorithm
Identified consumer using Name, Address, ID numbers & Phone numbers.
Proprietary rules to ensure unique consumer identification.
Data Storage Layer
IBM DB2
Permanent storage of records
Structured, relational credit data.
MongoDB
Consumer data stored as documents
Faster retrieval for data validation.
Credit Report Application
User Interface for accessing consumer reports.
Enabled consumers search using name, address, ID and/or phone numbers.
Match the consumers and pull together accounts, loans and other data.

Technical Stack
- Java, Java EE
- Pentaho ETL
- Mongo DB & DB2
- IdentIQ / SAS
- JBoss webserver
