The BankGenome™ Project

Unlocking bank data

The BankGenome™ Project is a source of data relief, not a pain point.

What is the BankGenome™ Project?

Think the Human Genome Project, except for banks. Invictus is working to map the community bank genome by collecting and analyzing loan-level information from thousands of community banks. When we’re done, we’ll gain exclusive knowledge about how banks really work – and share them with participants. The more data in the system, the more incisive the insights become.

Participating banks contribute to the project by providing their own loan level information. In return, they get selective access to the BankGenome™ suite, such as:

The ability to benchmark your portfolio against the database

Loan pricing information

Loss information required for CECL assumptions

Prepayment and curtailment speeds

Bank Genome logo

Unique Loans


Loan Instances

What it Takes to Join

Step 1: Data

The data gathering phase is simple. It leverages existing processes; for example, data you’ve already cleaned and formatted for CECL, FDIC ALERT files or FHLB advances.  Here’s a sample table of common data elements that we need.

  • Data Item
    • Account Number
    • Original Balance
    • Current Balance
    • Origination Date
    • Maturity Date
    • Interest Rate
    • Risk Rating
    • Fixed or Floating

  • Importance
    • High
    • Medium
    • High
    • High
    • High
    • High
    • High
    • High

Secure Data

We do not request or store any information that may identify individual borrowers or guarantors. If this information is provided, it is immediately deleted. We work with banks to help refine the data request.

Step 2: Cost

Good news! The biggest contribution a BankGenome™ participant can make to the project is quality data. The more high-quality information provided, the lower the cost. We do require a small monthly fee to keep the lights on and the servers cool, but the financial cost is based on a bank’s ability to contribute. We’ve developed a tiered approach to pricing, based on data contribution.

  • Tier/Contribution Required
    • Tier : >5-Year History, Quarterly go-forward
    • Tier II: >3-Year History, Quarterly go-forward
    • Tier III: <3-Year History, Quarterly go-forward

  • Estimated Monthly Cost*
    • $0
    • $200
    • $400

*Subject to Change

Step 3: Return
Participants have access to two types of information:

Members can sample BankGenome™ for external loan-level data to help them develop a loss history to support their CECL calculations, solving a problem many banks face. Users can access and download information housed in BankGenome™ and mine the information with their specific questions in mind. Our goal is to provide a user-friendly access point to one of the largest loan level databases in the United States.


While we won’t be able to anticipate all the questions that an institution might ask about the information housed in BankGenome™ (there are lots!) we provide some preliminary analytics to get you started. We compare the information in BankGenome™ to your own information and provide analyses that include (but are not limited to) comparisons of data quality, risk rating tendencies, pricing tendencies, and varying loan structures. We also provide information on capital adequacy and CECL loss rates.

Want to speak with the BankGenome™ Project Director?